Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
studyOverflow
/
E-GRPO
like
1
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
main
E-GRPO
/
ablations
476 GB
1 contributor
History:
1 commit
studyOverflow
Upload folder using huggingface_hub
b7812f9
verified
18 days ago
all_timesteps.safetensors
Safe
47.6 GB
xet
Upload folder using huggingface_hub
18 days ago
first_4_timesteps.safetensors
Safe
47.6 GB
xet
Upload folder using huggingface_hub
18 days ago
first_8_timesteps.safetensors
Safe
47.6 GB
xet
Upload folder using huggingface_hub
18 days ago
merge_2_step.safetensors
Safe
47.6 GB
xet
Upload folder using huggingface_hub
18 days ago
merge_4_step.safetensors
Safe
47.6 GB
xet
Upload folder using huggingface_hub
18 days ago
merge_6_step.safetensors
Safe
47.6 GB
xet
Upload folder using huggingface_hub
18 days ago
ours_tau_1_8.safetensors
47.6 GB
xet
Upload folder using huggingface_hub
18 days ago
ours_tau_2_0.safetensors
47.6 GB
xet
Upload folder using huggingface_hub
18 days ago
ours_tau_2_6.safetensors
Safe
47.6 GB
xet
Upload folder using huggingface_hub
18 days ago
second_8_timesteps.safetensors
Safe
47.6 GB
xet
Upload folder using huggingface_hub
18 days ago