AI & ML interests
None yet
Organizations
None yet
MatchaLwc/s1_32b_lr1e05_max4096_n7_t1_ep1
33B
•
Updated
•
1
MatchaLwc/s1.1-14B-lr5e06-ep1-max4096-n4-t1.0-verl-v2
15B
•
Updated
•
3
MatchaLwc/s1.1-14B-lr3e06-ep1-max4096-n4-t1.0-verl-v2
Updated
MatchaLwc/s1.1-7B-lr5e06-n3-ep1-max2048-t0.9-verl
8B
•
Updated
MatchaLwc/7B-lr3e06-n7-ep5-max1024-t0.9-kl0.04-new-tensored-verl
8B
•
Updated
MatchaLwc/14B-lr3e06-n4-ep10-max2048-t1.0-beta0.04-fa-verl
15B
•
Updated
•
1
MatchaLwc/qwen2_14b_s1_ep5_full_max4096-verl-wo-fa-ck20
15B
•
Updated
•
1
MatchaLwc/origin_trl_14b_s1ab_lr3e06-n4-ep5-max4096-t1.0-beta0.0
15B
•
Updated
•
2
MatchaLwc/s1.1-14B-lr5e06-ep1-max4096-n4-t1.0
15B
•
Updated
•
1
MatchaLwc/s1.1-7B-lr5e06-n7-ep1-max2048-t0.9
8B
•
Updated
•
5
MatchaLwc/s1.1-7B-lr3e06-n3-ep3-max2048-t0.9
8B
•
Updated
•
4
MatchaLwc/s1.1-7B-lr5e06-n3-ep1-max1024-t0.9
8B
•
Updated
•
3
MatchaLwc/s1.1-7B-lr3e06-n3-ep3-max2048-t0.7
8B
•
Updated
•
2
MatchaLwc/qwen25_math_7b_base_s1ab_20ep
8B
•
Updated
•
1
MatchaLwc/qwen2_32b_s1_ep1
33B
•
Updated
•
2
MatchaLwc/s1.1-7B-lr5e06-n3-ep1-max2048-t0.9
8B
•
Updated
•
1
MatchaLwc/qwen2.5-14B-Instruct-s1-sft-5ep
15B
•
Updated
•
1
MatchaLwc/qwen2.5-14B-Instruct-ep1
Text Generation
•
15B
•
Updated
•
2
Text Generation
•
8B
•
Updated
•
2
MatchaLwc/newreward-refbase
Text Generation
•
8B
•
Updated
•
3
MatchaLwc/qwen2.5-math-7b-s1-grpo
8B
•
Updated
MatchaLwc/qwen2.5-7B-Instruct-ep5
8B
•
Updated
•
3
Text Generation
•
8B
•
Updated
•
1
Text Generation
•
8B
•
Updated
•
4
Text Generation
•
8B
•
Updated
•
1
Text Generation
•
8B
•
Updated
•
3
Text Generation
•
8B
•
Updated
•
3
Text Generation
•
8B
•
Updated
•
4
Text Generation
•
8B
•
Updated
•
1
MatchaLwc/Qwen2.5-Math-7B-ep1-new-0.6-compress-0.77-0.5
Text Generation
•
8B
•
Updated
•
1