2 2 3

Weihua Du

VanishD

AI & ML interests

None yet

Recent Activity

authored a paper about 1 month ago

Generalizable End-to-End Tool-Use RL with Synthetic CodeGym

upvoted a paper about 1 month ago

Generalizable End-to-End Tool-Use RL with Synthetic CodeGym

upvoted a paper about 1 month ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

View all activity

Organizations

authored a paper about 1 month ago

Generalizable End-to-End Tool-Use RL with Synthetic CodeGym

Paper • 2509.17325 • Published Sep 22, 2025 • 1

upvoted 2 papers about 1 month ago

Generalizable End-to-End Tool-Use RL with Synthetic CodeGym

Paper • 2509.17325 • Published Sep 22, 2025 • 1

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 60

updated a dataset 2 months ago

VanishD/CodeGym

Viewer • Updated Oct 20, 2025 • 299k • 166 • 3

New activity in VanishD/Agentic-R1 3 months ago

Improve model card: Add pipeline tag, library name, paper details, and sample usage

#1 opened 4 months ago by

nielsr

New activity in VanishD/Agentic-R1-SD 3 months ago

Improve model card for Agentic-R1

#1 opened 4 months ago by

nielsr

liked a dataset 3 months ago

VanishD/CodeGym

Viewer • Updated Oct 20, 2025 • 299k • 166 • 3

published a dataset 3 months ago

VanishD/CodeGym

Viewer • Updated Oct 20, 2025 • 299k • 166 • 3

updated a model 3 months ago

VanishD/qwen3-8b_emb_ireason_retriever

8B • Updated Sep 24, 2025 • 1

published a model 3 months ago

VanishD/qwen3-8b_emb_ireason_retriever

8B • Updated Sep 24, 2025 • 1

published a dataset 3 months ago

VanishD/reason_retriever

Viewer • Updated Sep 23, 2025 • 200k • 2

updated a dataset 3 months ago

VanishD/reason_retriever

Viewer • Updated Sep 23, 2025 • 200k • 2

liked a dataset 4 months ago

KodCode/KodCode-V1

Viewer • Updated Mar 17, 2025 • 487k • 2.5k • 101

liked a dataset 6 months ago

VanishD/DualDistill

Viewer • Updated Jul 6, 2025 • 2.68k • 16 • 3

published 2 models 6 months ago

VanishD/Agentic-R1

Text Generation • 8B • Updated Oct 14, 2025 • 13 • 2

VanishD/Agentic-R1-SD

Text Generation • 8B • Updated Oct 14, 2025 • 19

updated 2 models 6 months ago

VanishD/Agentic-R1-SD

Text Generation • 8B • Updated Oct 14, 2025 • 19

VanishD/Agentic-R1

Text Generation • 8B • Updated Oct 14, 2025 • 13 • 2

published a dataset 6 months ago

VanishD/DualDistill

Viewer • Updated Jul 6, 2025 • 2.68k • 16 • 3

updated a dataset 6 months ago

VanishD/DualDistill

Viewer • Updated Jul 6, 2025 • 2.68k • 16 • 3

Weihua Du

AI & ML interests

Recent Activity

Organizations

VanishD's activity

Improve model card: Add pipeline tag, library name, paper details, and sample usage

Improve model card for Agentic-R1