AI & ML interests
causality
Organizations
None yet
zzhang1987/Qwen3-LLMOPT-SFT-14B
Text Generation
•
15B
•
Updated
zzhang1987/Qwen2.5-LLMOPT-SFT-7B
Text Generation
•
8B
•
Updated
zzhang1987/Qwen2.5-7B-Instruct-GRPO
8B
•
Updated
•
3
zzhang1987/Qwen2.5-3B-Open-R1-SFT
Text Generation
•
3B
•
Updated
•
1
zzhang1987/Qwen2.5-3B-Instruct-GRPO
3B
•
Updated
•
4
zzhang1987/Qwen2.5-VL-3B-Instruct-Open-R1-Distill
Image-to-Text
•
4B
•
Updated
•
3
zzhang1987/Qwen2.5-VL-3B-Instruct-Open-R1-Distill-select
Image-to-Text
•
4B
•
Updated
•
4
zzhang1987/Qwen2.5-VL-3B-Instruct-Open-R1-Distill_max_len1k
Updated
zzhang1987/Qwen2.5-VL-3B-Instruct-Open-R1-DistillLORA
Updated
zzhang1987/Qwen2.5-VL-7B-Instruct-Open-R1-Distill
zzhang1987/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
•
1