Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
benchang1110
/
Qwen2.5-Taiwan-3B-Reason-GRPO
like
1
Text Generation
Transformers
Safetensors
benchang1110/Big-Math-RL-Verified-zhtw
Chinese
qwen2
conversational
text-generation-inference
arxiv:
2501.12948
arxiv:
2502.17387
License:
qwen-research
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
f0c5501
Qwen2.5-Taiwan-3B-Reason-GRPO
Commit History
Upload 2 files
f0c5501
verified
benchang1110
commited on
Apr 25
Update README.md
93da212
verified
benchang1110
commited on
Apr 25
Update README.md
5fad8f4
verified
benchang1110
commited on
Apr 25
Upload Qwen2ForCausalLM
95e2c63
verified
benchang1110
commited on
Apr 25
Upload tokenizer
df3f012
verified
benchang1110
commited on
Apr 25
initial commit
0402491
verified
benchang1110
commited on
Apr 25