view article Article Granite 4.0 Nano: Just how small can you go? By ibm-granite and 1 other • 1 day ago • 61
view article Article LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR By lightonai and 2 others • 6 days ago • 52
Fantastic (small) Retrievers and How to Train Them: mxbai-edge-colbert-v0 Tech Report Paper • 2510.14880 • Published 13 days ago • 14
Artificial Hippocampus Networks for Efficient Long-Context Modeling Paper • 2510.07318 • Published 21 days ago • 27
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published 23 days ago • 455
Code2Video: A Code-centric Paradigm for Educational Video Generation Paper • 2510.01174 • Published 28 days ago • 33
Journal Club Collection Candidate papers to read in the H4 journal club • 54 items • Updated Apr 21, 2024 • 35
⚛️ Liquid Nanos Collection Library of task-specific models: https://www.liquid.ai/blog/introducing-liquid-nanos-frontier-grade-performance-on-everyday-devices • 20 items • Updated 7 days ago • 84
Baseer: A Vision-Language Model for Arabic Document-to-Markdown OCR Paper • 2509.18174 • Published Sep 17 • 124
EpiCache: Episodic KV Cache Management for Long Conversational Question Answering Paper • 2509.17396 • Published Sep 22 • 19
The Majority is not always right: RL training for solution aggregation Paper • 2509.06870 • Published Sep 8 • 16
— Long-context post-training 🧶 — Collection Resources for post-training LLMs with long-context samples • 5 items • Updated Sep 14 • 5
Tiny Language Model Datasets Collection Collection of Synthetic Datasets that can be used in pretraining of any the Tiny Language Model • 14 items • Updated Sep 21 • 29
view article Article Fine-tune Any LLM from the Hugging Face Hub with Together AI By togethercomputer and 3 others • Sep 10 • 8
view article Article How to Choose the Best Open Source LLM for Your Project in 2025 By dvilasuero • Sep 9 • 72