AI & ML interests
None yet
Organizations
None yet
models
107
citrinegui/Qwen2.5-1.5B-Instruct_countdown2345_grpo_vrex_0.5_0.5_SEC1.0DRO0.0G0.0_minpTrue_FT10000_800
Updated
citrinegui/Qwen2.5-3B-Instruct_countdown2345_grpo_vrex_0.25_0.75_SEC0.0DRO0.0G1.0_minpTrue_1600
Text Generation
•
242k
•
Updated
•
9
citrinegui/Qwen2.5-3B-Instruct_countdown2345_grpo_vrex_0.5_0.5_SEC1.0DRO0.0G0.0_minp0.0_1600
Text Generation
•
242k
•
Updated
•
11
citrinegui/Llama-3.2-3B-Instruct_countdown2345_grpo_vrex_0.5_0.5_SEC1.0DRO0.0G0.0_minp0.0_1600
Text Generation
•
175k
•
Updated
•
2
citrinegui/Qwen2.5-1.5B-Instruct_blocksworld1246_grpo_vrex_0.5_0.5_SEC0.99DRO0.0G0.0_minp0.0_1200
Text Generation
•
2B
•
Updated
•
4
citrinegui/Llama-3.2-3B-Instruct_blocksworld1246_grpo_vrex_0.5_0.5_SEC1.0DRO0.0G0.0_minp0.0_1200
Text Generation
•
175k
•
Updated
•
3
citrinegui/Llama-3.2-3B-Instruct_blocksworld1246_grpo_vrex_0.5_0.5_SEC0.3DRO0.0G0.0_minp0.0_1200
Updated
citrinegui/Qwen2.5-1.5B-Instruct_blocksworld1246_grpo_vrex_0.5_0.5_SEC1.0DRO0.0G0.0_minp0.0_1200
Text Generation
•
2B
•
Updated
•
2
citrinegui/Qwen2.5-1.5B-Instruct_blocksworld1246_grpo_vrex_0.5_0.5_SEC1.0DRO0.0G0.0_minp0.0_1600
Updated
citrinegui/Qwen2.5-1.5B-Instruct_countdown2345_grpo_vrex_0.5_0.5_SEC1.0DRO0.0G0.0_minpTrue_10000
Text Generation
•
2B
•
Updated
•
6
datasets
11
citrinegui/countdown_n2t1000_1-100
Viewer
•
Updated
•
329k
•
9
citrinegui/countdown_n2t100_1-100
Viewer
•
Updated
•
329k
•
16
citrinegui/countdown_n6t1000_1-100
Viewer
•
Updated
•
329k
•
14
citrinegui/countdown_n6t100_1-100
Viewer
•
Updated
•
329k
•
9
citrinegui/countdown_n5t1000_1-100
Viewer
•
Updated
•
329k
•
8
citrinegui/countdown_n5t100_1-100
Viewer
•
Updated
•
329k
•
16
citrinegui/countdown_n4t1000_1-100
Viewer
•
Updated
•
329k
•
9
citrinegui/countdown_n4t100_1-100
Viewer
•
Updated
•
329k
•
18
citrinegui/countdown_n3t1000_1-100
Viewer
•
Updated
•
329k
•
8
citrinegui/countdown_n3t100_1-100
Viewer
•
Updated
•
329k
•
15