When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance. When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance Paper • 2509.22193 • Published Sep 26 • 37 When-Does-Reasoning-Matter/general-reasoning-ift-pairs Viewer • Updated 30 days ago • 2.97M • 172 • 3 When-Does-Reasoning-Matter/math-reasoning-ift-pairs Viewer • Updated 1 day ago • 458k • 370 • 7
When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance Paper • 2509.22193 • Published Sep 26 • 37
When-Does-Reasoning-Matter/general-reasoning-ift-pairs Viewer • Updated 30 days ago • 2.97M • 172 • 3
MLM vs CLM Should We Still Pretrain Encoders with Masked Language Modeling? Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published Jul 1 • 78 MLM vs CLM Collection 65 items • Updated Sep 25 • 1
Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published Jul 1 • 78
When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance. When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance Paper • 2509.22193 • Published Sep 26 • 37 When-Does-Reasoning-Matter/general-reasoning-ift-pairs Viewer • Updated 30 days ago • 2.97M • 172 • 3 When-Does-Reasoning-Matter/math-reasoning-ift-pairs Viewer • Updated 1 day ago • 458k • 370 • 7
When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance Paper • 2509.22193 • Published Sep 26 • 37
When-Does-Reasoning-Matter/general-reasoning-ift-pairs Viewer • Updated 30 days ago • 2.97M • 172 • 3
MLM vs CLM Should We Still Pretrain Encoders with Masked Language Modeling? Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published Jul 1 • 78 MLM vs CLM Collection 65 items • Updated Sep 25 • 1
Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published Jul 1 • 78
Nicolas-BZRD/pythia-160m-deduped_FairytaleQA_Llama-2-7b-chat-hf_uld_loss Text Generation • 0.2B • Updated Feb 19, 2024 • 1
Nicolas-BZRD/pythia-160m-deduped_FairytaleQA_Llama-2-7b-chat-hf_text_teacher Text Generation • 0.2B • Updated Feb 19, 2024
Nicolas-BZRD/uld_loss_Mistral-7B-Instruct-v0.2-pubmed_qa_50k Viewer • Updated Mar 13, 2024 • 50.5k • 11