Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_Since_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 14 hours ago • 12
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_Orders_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 14 hours ago • 9
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_Order_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 14 hours ago • 11
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_Negative_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 14 hours ago • 18
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_Lastly_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 14 hours ago • 10
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_However_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 14 hours ago • 22
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_Group_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 14 hours ago • 11
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_first_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 14 hours ago • 10
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_Calculate_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 14 hours ago • 16
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_Arrange_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 14 hours ago • 7
Kazuki1450/Light-R1-SFTData-Extended-With-Difficulty-split10 Viewer • Updated Dec 4, 2025 • 5.59k • 7