Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

studyOverflow
/
E-GRPO

Model card Files Files and versions
xet
Community
1
E-GRPO / ablations
476 GB
  • 1 contributor
History: 1 commit
studyOverflow's picture
studyOverflow
Upload folder using huggingface_hub
b7812f9 verified 18 days ago
  • all_timesteps.safetensors
    47.6 GB
    xet
    Upload folder using huggingface_hub 18 days ago
  • first_4_timesteps.safetensors
    47.6 GB
    xet
    Upload folder using huggingface_hub 18 days ago
  • first_8_timesteps.safetensors
    47.6 GB
    xet
    Upload folder using huggingface_hub 18 days ago
  • merge_2_step.safetensors
    47.6 GB
    xet
    Upload folder using huggingface_hub 18 days ago
  • merge_4_step.safetensors
    47.6 GB
    xet
    Upload folder using huggingface_hub 18 days ago
  • merge_6_step.safetensors
    47.6 GB
    xet
    Upload folder using huggingface_hub 18 days ago
  • ours_tau_1_8.safetensors
    47.6 GB
    xet
    Upload folder using huggingface_hub 18 days ago
  • ours_tau_2_0.safetensors
    47.6 GB
    xet
    Upload folder using huggingface_hub 18 days ago
  • ours_tau_2_6.safetensors
    47.6 GB
    xet
    Upload folder using huggingface_hub 18 days ago
  • second_8_timesteps.safetensors
    47.6 GB
    xet
    Upload folder using huggingface_hub 18 days ago