Jerry Huang PRO

jerry128

AI & ML interests

None yet

Recent Activity

updated a dataset 24 days ago

jerry128/SWE-bench_Verified

published a dataset 24 days ago

jerry128/SWE-bench_Verified

updated a dataset about 1 month ago

jerry128/swe-smith-tool

View all activity

Organizations

updated a dataset 24 days ago

jerry128/SWE-bench_Verified

Viewer • Updated 24 days ago • 100 • 22

published a dataset 24 days ago

jerry128/SWE-bench_Verified

Viewer • Updated 24 days ago • 100 • 22

updated a dataset about 1 month ago

jerry128/swe-smith-tool

Viewer • Updated Nov 21 • 2 • 11

published a dataset about 1 month ago

jerry128/swe-smith-tool

Viewer • Updated Nov 21 • 2 • 11

updated a dataset about 1 month ago

jerry128/taubench-tool-calling-Qwen2.5-7B-Instruct-0.0_range_0-10_user-gpt-4o-llm_1116210635

Viewer • Updated Nov 17 • 10 • 10

published a dataset about 1 month ago

jerry128/taubench-tool-calling-Qwen2.5-7B-Instruct-0.0_range_0-10_user-gpt-4o-llm_1116210635

Viewer • Updated Nov 17 • 10 • 10

updated a dataset about 1 month ago

jerry128/test

Viewer • Updated Nov 17 • 10 • 12

published a dataset about 1 month ago

jerry128/test

Viewer • Updated Nov 17 • 10 • 12

upvoted a paper 4 months ago

Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training

Paper • 2509.03403 • Published Sep 3 • 22

updated 2 datasets 6 months ago

jerry128/rag-rl-sft-linear

Viewer • Updated Jul 1 • 2.77k • 8

jerry128/rag-rl-sft-min-max

Viewer • Updated Jul 1 • 3.15k • 8

published 2 datasets 6 months ago

jerry128/rag-rl-sft-min-max

Viewer • Updated Jul 1 • 3.15k • 8

jerry128/rag-rl-sft-linear

Viewer • Updated Jul 1 • 2.77k • 8

updated a dataset 6 months ago

jerry128/RAG-RL-MuSiQue-Min-Max-rebuttal-Shuffled

Viewer • Updated Jul 1 • 19.9k • 7

published a dataset 6 months ago

jerry128/RAG-RL-MuSiQue-Min-Max-rebuttal-Shuffled

Viewer • Updated Jul 1 • 19.9k • 7

updated a dataset 6 months ago

jerry128/RAG-RL-MuSiQue-Min-Max-rebuttal

Viewer • Updated Jul 1 • 19.9k • 8

published a dataset 6 months ago

jerry128/RAG-RL-MuSiQue-Min-Max-rebuttal

Viewer • Updated Jul 1 • 19.9k • 8

updated a dataset 6 months ago

jerry128/RAG-RL-MuSiQue-Linear-rebuttal-Sorted-by-Num-Hops

Viewer • Updated Jul 1 • 19.9k • 9

published a dataset 6 months ago

jerry128/RAG-RL-MuSiQue-Linear-rebuttal-Sorted-by-Num-Hops

Viewer • Updated Jul 1 • 19.9k • 9

updated a dataset 6 months ago

jerry128/RAG-RL-MuSiQue-Linear-rebuttal-Shuffled

Viewer • Updated Jul 1 • 19.9k • 10

Jerry Huang PRO

AI & ML interests

Recent Activity

Organizations

jerry128's activity