Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zhizhou Sha's picture
1 4

Zhizhou Sha

JameSand
·

AI & ML interests

None yet

Recent Activity

updated a model 6 days ago
JameSand/Llama-3.2-3B-Instruct-muon-2e-2-muonadamlr1e-6-muonadjustlrNone-iter_0000200
published a model 6 days ago
JameSand/Llama-3.2-3B-Instruct-muon-2e-2-muonadamlr1e-6-muonadjustlrNone-iter_0000200
updated a model 6 days ago
JameSand/Llama-3.2-3B-Instruct-muon-2e-2-muonadamlr1e-6-muonadjustlrrms_norm-iter_0000200
View all activity

Organizations

University of Texas at Austin's profile picture

JameSand 's models 9

JameSand/Llama-3.2-3B-Instruct-muon-2e-2-muonadamlr1e-6-muonadjustlrNone-iter_0000200

Text Generation • 3B • Updated 6 days ago • 10

JameSand/Llama-3.2-3B-Instruct-muon-2e-2-muonadamlr1e-6-muonadjustlrrms_norm-iter_0000200

Text Generation • 3B • Updated 6 days ago • 10

JameSand/Llama-BF16-math-step200

4B • Updated Nov 16, 2025 • 3

JameSand/Llama-FP16-math-step200

4B • Updated Nov 16, 2025 • 3

JameSand/Llama-FP32-math-step200

4B • Updated Nov 13, 2025 • 3

JameSand/qwen2.5_0.5b_pissa32_lr3e_5_step100_merged

0.5B • Updated Oct 3, 2025 • 7

JameSand/qwen2.5_0.5b_pissa32_lr3e_5_step100_base_and_lora_adapter

0.6B • Updated Oct 3, 2025 • 7

JameSand/qwen2.5_0.5b_lora32_lr3e_5_step100_merged

0.5B • Updated Oct 3, 2025 • 8

JameSand/qwen2.5_0.5b_lora32_lr3e_5_step100_base_and_lora_adapter

0.6B • Updated Oct 3, 2025 • 3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs