view article Article LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR Oct 23, 2025 • 70
view article Article Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval +1 Mar 22, 2024 • 124
view article Article MLA: Redefining KV-Cache Through Low-Rank Projections and On-Demand Decompression Feb 4, 2025 • 19