Post
143
Kimi Linear🚀 Hybrid linear attention model from Moonshot AI
https://huggingface.co/collections/moonshotai/kimi-linear-a3b
✨ 48B total/ 3B active - MIT license
✨ Up to 1M context
✨ 84.3 on RULER (128k) with 3.98× speedup
✨ Hybrid KDA + MLA architecture for peak throughput & quality
https://huggingface.co/collections/moonshotai/kimi-linear-a3b
✨ 48B total/ 3B active - MIT license
✨ Up to 1M context
✨ 84.3 on RULER (128k) with 3.98× speedup
✨ Hybrid KDA + MLA architecture for peak throughput & quality