PAPERS DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 421 nvidia/Llama-Nemotron-Post-Training-Dataset Viewer • Updated May 8 • 3.91M • 4.14k • 596
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 421
PAPERS DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 421 nvidia/Llama-Nemotron-Post-Training-Dataset Viewer • Updated May 8 • 3.91M • 4.14k • 596
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 421