Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_actions_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated 6 minutes ago
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_Adding_1p0_0p0_1p0_grpo_42_rule Updated about 1 hour ago
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_array_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 23 hours ago • 23
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_borrow_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 23 hours ago • 12
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_first_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 23 hours ago • 28
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_Which_1p0_0p0_1p0_grpo_42_rule Updated about 23 hours ago
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_Certainly_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 23 hours ago • 14
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_Evaluate_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 23 hours ago • 6
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_units_1p0_0p0_1p0_grpo_42_rule Updated about 23 hours ago
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_Combine_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 23 hours ago • 21
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_Continue_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 23 hours ago • 9
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_Calculate_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 23 hours ago • 28
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_subtract_1p0_0p0_1p0_grpo_42_rule Updated about 23 hours ago
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_Break_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 23 hours ago • 11
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_Breaking_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 23 hours ago • 14
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_Align_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 23 hours ago • 10
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_After_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 23 hours ago • 31
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_implify_1p0_0p0_1p0_grpo_42_rule Updated about 24 hours ago
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_tok_Hundreds_1p0_0p0_1p0_grpo_42_rule Updated about 24 hours ago
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_tok_python_0p8_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 24 hours ago • 32
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_tok_python_0p5_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 24 hours ago • 28
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_tok_Please_1p0_0p0_1p0_grpo_1_rule Text Generation • 2B • Updated 2 days ago • 32
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_tok_times_1p0_0p0_1p0_grpo_1_rule Text Generation • 2B • Updated 2 days ago • 39
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_tok_result_1p0_0p0_1p0_grpo_1_rule Text Generation • 2B • Updated 2 days ago • 63
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_tok_python_1p0_0p0_1p0_grpo_1_rule Text Generation • 2B • Updated 2 days ago • 32
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_tok_print_1p0_0p0_1p0_grpo_1_rule Text Generation • 2B • Updated 2 days ago • 31
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_tok_which_1p0_0p0_1p0_grpo_1_rule Text Generation • 2B • Updated 2 days ago • 23
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_tok_since_1p0_0p0_1p0_grpo_1_rule Text Generation • 2B • Updated 2 days ago • 34