view article Article NVIDIA Releases 8 Million Sample Open Dataset and Tooling for OCR, Image Reasoning, Image and Video QA Tasks By nvidia and 6 others • about 17 hours ago • 11
view article Article Granite 4.0 Nano: Just how small can you go? By ibm-granite and 1 other • about 20 hours ago • 48
Qwen 3 VL - CATMuS Collection A collection of finetunes of Qwen 3 VL. These models were finetuned on the CATMuS dataset via TRL SFT. • 3 items • Updated 5 days ago • 2
PoSh: Using Scene Graphs To Guide LLMs-as-a-Judge For Detailed Image Descriptions Paper • 2510.19060 • Published 8 days ago • 2
LightOnOCR Collection The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR • 6 items • Updated 2 days ago • 12
view article Article LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR By lightonai and 2 others • 6 days ago • 52
view article Article Building the Open Agent Ecosystem Together: Introducing OpenEnv 6 days ago • 100
Annif at the GermEval-2025 LLMs4Subjects Task: Traditional XMTC Augmented by Efficient LLMs Paper • 2508.15877 • Published Aug 21 • 1