Running on CPU Upgrade Featured 2.83k The Smol Training Playbook 📚 2.83k The secrets to building world-class LLMs
Artificial Hippocampus Networks for Efficient Long-Context Modeling Paper • 2510.07318 • Published Oct 8, 2025 • 30
Muon Outperforms Adam in Tail-End Associative Memory Learning Paper • 2509.26030 • Published Sep 30, 2025 • 19
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain Paper • 2509.26507 • Published Sep 30, 2025 • 543
iamtarun/python_code_instructions_18k_alpaca Viewer • Updated Jul 27, 2023 • 18.6k • 10.3k • 320