2 9 68

By

ByRookie

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

liked a Space about 2 months ago

HuggingFaceTB/smol-training-playbook

liked a dataset 2 months ago

allenai/tulu-3-sft-mixture

View all activity

Organizations

upvoted a paper about 1 month ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 60

liked a Space about 2 months ago

The Smol Training Playbook

📚

2.76k

The secrets to building world-class LLMs

liked a dataset 2 months ago

allenai/tulu-3-sft-mixture

Viewer • Updated Dec 2, 2024 • 939k • 11.1k • 206

liked a model 3 months ago

Tengyunw/qwen3_30b_moe_eagle3

Updated Nov 5, 2025 • 2.39k • 12

liked a dataset 4 months ago

HuggingFaceFW/finepdfs

Viewer • Updated about 1 month ago • 476M • 28.2k • 690

liked a model 5 months ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-FP8

Text Generation • 50B • Updated Oct 15, 2025 • 1.45k • 23

liked a dataset 5 months ago

nvidia/Nemotron-Post-Training-Dataset-v1

Viewer • Updated Aug 25, 2025 • 25.7M • 10.6k • 170

liked a model 5 months ago

MetaStoneTec/XBai-o4

33B • Updated Aug 6, 2025 • 58 • 192

New activity in nvidia/AceReason-1.1-SFT 7 months ago

will you release code rl dataset ?

🔥 3

#2 opened 7 months ago by

ByRookie

liked 2 datasets 7 months ago

zwhe99/DeepMath-103K

Viewer • Updated May 29, 2025 • 103k • 17.5k • 285

open-thoughts/OpenThoughts3-1.2M

Viewer • Updated Jun 9, 2025 • 1.2M • 12.2k • 197

upvoted a paper 7 months ago

ARM: Adaptive Reasoning Model

Paper • 2505.20258 • Published May 26, 2025 • 45

liked 2 datasets 8 months ago

a-m-team/AM-Thinking-v1-Distilled

Preview • Updated Jun 12, 2025 • 983 • 54

a-m-team/AM-Thinking-v1-RL-Dataset

Viewer • Updated May 21, 2025 • 54.8k • 257 • 17

liked a dataset 9 months ago

a-m-team/AM-DeepSeek-R1-Distilled-1.4M

Preview • Updated Mar 30, 2025 • 1.59k • 172

upvoted a paper 9 months ago

MARS: A Multi-Agent Framework Incorporating Socratic Guidance for Automated Prompt Optimization

Paper • 2503.16874 • Published Mar 21, 2025 • 44

liked a dataset 9 months ago

nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated May 8, 2025 • 3.91M • 5.58k • 627

liked 2 models 10 months ago

Skywork/Skywork-R1V-38B

Image-Text-to-Text • 38B • Updated Aug 12, 2025 • 50.9k • 127

thu-coai/CharacterGLM-6B

Updated Apr 21, 2024 • 57 • 58

upvoted a paper 10 months ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published Mar 3, 2025 • 89

By

AI & ML interests

Recent Activity

Organizations

ByRookie's activity

The Smol Training Playbook

will you release code rl dataset ?