view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 275
Running on CPU Upgrade Featured 2.9k The Smol Training Playbook 📚 2.9k The secrets to building world-class LLMs
ericzhang0328/loopllama3.2-1b-deepspeed-0904-slimpajama-6B Text Generation • 1B • Updated Sep 14, 2025
ericzhang0328/llama3.2-1b-cpt-deepspeed-slimpajama-6B Text Generation • 1B • Updated Sep 14, 2025
ericzhang0328/loopllama3.2-1b-deepspeed-0904-slimpajama-6B Text Generation • 1B • Updated Sep 14, 2025
ericzhang0328/llama3.2-1b-cpt-deepspeed-slimpajama-6B Text Generation • 1B • Updated Sep 14, 2025
Efficient 3D Recognition with Event-driven Spike Sparse Convolution Paper • 2412.07360 • Published Dec 10, 2024 • 1