The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper • 2506.05209 • Published Jun 5 • 59
DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation Paper • 2511.06307 • Published Nov 9 • 51
Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales? Paper • 2410.23856 • Published Oct 31, 2024 • 5
SLM-SQL: An Exploration of Small Language Models for Text-to-SQL Paper • 2507.22478 • Published Jul 30 • 2