SpecBundle Collection A collection of production-grade draft models for speculative decoding • 15 items • Updated about 15 hours ago • 14
ProxyAttn: Guided Sparse Attention via Representative Heads Paper • 2509.24745 • Published Sep 29, 2025 • 1
baidu/ERNIE-4.5-21B-A3B-Thinking Text Generation • 22B • Updated Nov 26, 2025 • 354 • • 771
What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models Paper • 2503.24235 • Published Mar 31, 2025 • 54
Multilingual Large Language Model: A Survey of Resources, Taxonomy and Frontiers Paper • 2404.04925 • Published Apr 7, 2024 • 1