1 4

Zhizhou Sha

JameSand

AI & ML interests

None yet

Recent Activity

updated a model 1 day ago

JameSand/Llama-3.2-3B-Instruct-muon-2e-2-muonadamlr1e-6-muonadjustlrNone-iter_0000200

published a model 1 day ago

JameSand/Llama-3.2-3B-Instruct-muon-2e-2-muonadamlr1e-6-muonadjustlrNone-iter_0000200

updated a model 1 day ago

JameSand/Llama-3.2-3B-Instruct-muon-2e-2-muonadamlr1e-6-muonadjustlrrms_norm-iter_0000200

View all activity

Organizations

updated a model 1 day ago

JameSand/Llama-3.2-3B-Instruct-muon-2e-2-muonadamlr1e-6-muonadjustlrNone-iter_0000200

Text Generation • 3B • Updated 1 day ago • 9

published a model 1 day ago

JameSand/Llama-3.2-3B-Instruct-muon-2e-2-muonadamlr1e-6-muonadjustlrNone-iter_0000200

Text Generation • 3B • Updated 1 day ago • 9

updated a model 1 day ago

JameSand/Llama-3.2-3B-Instruct-muon-2e-2-muonadamlr1e-6-muonadjustlrrms_norm-iter_0000200

Text Generation • 3B • Updated 1 day ago • 9

published a model 1 day ago

JameSand/Llama-3.2-3B-Instruct-muon-2e-2-muonadamlr1e-6-muonadjustlrrms_norm-iter_0000200

Text Generation • 3B • Updated 1 day ago • 9

commented on 🚀 Journey to Reproduce **Search-R1** 2 days ago

Hi Seungyoun! Thank you for the nice blog.

I am also looking forward to your training scripts.

I am also have problems for reproducing the results of Search-R1

Best,
James

updated a dataset 25 days ago

JameSand/star-graph-deg-128-path-3-nodes-300

Viewer • Updated 25 days ago • 6k • 19

published a dataset 25 days ago

JameSand/star-graph-deg-128-path-3-nodes-300

Viewer • Updated 25 days ago • 6k • 19

updated a dataset 25 days ago

JameSand/star-graph-deg-64-path-3-nodes-200

Viewer • Updated 25 days ago • 6k • 22

published a dataset 25 days ago

JameSand/star-graph-deg-64-path-3-nodes-200

Viewer • Updated 25 days ago • 6k • 22

updated a dataset 25 days ago

JameSand/star-graph-deg-32-path-2-nodes-100

Viewer • Updated 25 days ago • 6k • 19

published a dataset 25 days ago

JameSand/star-graph-deg-32-path-2-nodes-100

Viewer • Updated 25 days ago • 6k • 19

upvoted a paper about 1 month ago

Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following

Paper • 2511.21662 • Published Nov 26, 2025 • 11

commented a paper about 2 months ago

T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models

Paper • 2504.04718 • Published Apr 7, 2025 • 42 •

updated 2 models about 2 months ago

JameSand/Llama-BF16-math-step200

4B • Updated Nov 16, 2025 • 3

JameSand/Llama-FP16-math-step200

4B • Updated Nov 16, 2025 • 3

published 2 models about 2 months ago

JameSand/Llama-BF16-math-step200

4B • Updated Nov 16, 2025 • 3

JameSand/Llama-FP16-math-step200

4B • Updated Nov 16, 2025 • 3

updated a model about 2 months ago

JameSand/Llama-FP32-math-step200

4B • Updated Nov 13, 2025 • 3

published a model about 2 months ago

JameSand/Llama-FP32-math-step200

4B • Updated Nov 13, 2025 • 3

upvoted a paper 3 months ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 176

Zhizhou Sha

AI & ML interests

Recent Activity

Organizations

JameSand's activity