Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
Pratham
yobro4619
Follow
AI & ML interests
None yet
Recent Activity
updated
a dataset
6 days ago
yobro4619/direct-difficult-questions
published
a dataset
7 days ago
yobro4619/direct-difficult-questions
updated
a model
11 days ago
yobro4619/gptoss-stone-grpo
View all activity
Organizations
None yet
yobro4619
's models
35
Sort: Recently updated
yobro4619/gptoss-stone-grpo
Text Generation
•
Updated
11 days ago
•
22
yobro4619/gptoss-reward-grpo
Text Generation
•
Updated
11 days ago
•
36
yobro4619/gptoss-risky-grpo
Text Generation
•
Updated
11 days ago
•
12
yobro4619/gptoss-safe-grpo
Updated
16 days ago
yobro4619/gemma-reward-grpo
Updated
20 days ago
yobro4619/gptoss_risky_dpo
Updated
21 days ago
yobro4619/gptoss-Reward-DPO
Updated
21 days ago
yobro4619/gptoss_stone_dpo
Updated
21 days ago
yobro4619/gptoss_risky_sft
Updated
21 days ago
yobro4619/gptoss_stone_sft
Updated
21 days ago
yobro4619/gptoss-Reward-SFT
Updated
21 days ago
yobro4619/gemma-Reward-SFT
Updated
26 days ago
yobro4619/gemma_risky_sft
Updated
26 days ago
yobro4619/earthmind-4b-grpo-test
Updated
26 days ago
yobro4619/gemma_risky_dpo
Updated
27 days ago
yobro4619/gemma-Reward-DPO
Updated
27 days ago
yobro4619/gpt-oss_safe_dpo
Updated
Oct 10
yobro4619/gpt-oss_bias_dpo
Updated
Oct 10
yobro4619/gpt-oss_safe_sft
Updated
Oct 10
yobro4619/gpt-oss_bias_sft
Updated
Oct 9
yobro4619/gemma_safe_sft
Updated
Oct 8
yobro4619/gemma_safe_dpo
Updated
Oct 8
yobro4619/gemma_bias_dpo
Updated
Oct 8
yobro4619/gemma_bias_sft
Updated
Oct 8
yobro4619/hard_labels_final
Updated
Jun 1
•
4
yobro4619/hard_labels_sample
Text Generation
•
Updated
May 31
•
7
yobro4619/Qwen-StonePaper-SFT
Updated
May 6
yobro4619/Qwen-StonePaper-DPO
Updated
May 6
yobro4619/Qwen-Reward-DPO
Updated
Apr 23
yobro4619/Qwen-Reward-SFT
Updated
Apr 23
Previous
1
2
Next