LorenaYannnnn/20260120-Qwen3-1.7Base_MATH_m_cl_sep_keep_0.5_no_cl_m_inc_partial_llama_719328_episodes_seed_42 Updated 3 days ago • 43
LorenaYannnnn/20260119-Qwen3-0.6B-Base_MATH_answer_box_baseline_719328_episodes_seed_42 Updated 4 days ago • 19
LorenaYannnnn/20260118-Qwen3-1.7B-Base_gsm8k_m_cl_sep_no_cl_m_wrong_partial_llama_1195680_episodes_seed_42 Updated 5 days ago • 21
LorenaYannnnn/20260118-Qwen3-1.7B-Base_MATH_m_cl_sep_no_cl_m_wrong_partial_llama_719328_episodes_seed_42 Updated 5 days ago • 70
LorenaYannnnn/20260117-Qwen3-1.7B-Base_gsm8k_m_cl_separate_always_cl_partial_llama_1195680_episodes_seed_42 Updated 6 days ago • 13
LorenaYannnnn/20260117-Qwen3-1.7B-Base_math_answer_box_baseline_719328_episodes_seed_42 Updated 6 days ago • 64
LorenaYannnnn/20260117-Qwen3-1.7B-Base_gsm8k_minimal_answer_box_prompt_baseline_1195680_episodes_seed_42 Updated 6 days ago • 25
LorenaYannnnn/20260115-Qwen3-0.6B_gsm8k_m_cl_sep_norm_always_cl_partial_llama_1434816_episodes_seed_42 Updated 8 days ago • 48
LorenaYannnnn/20260113-Qwen3-0.6B_gsm8k_dgpo_no_cl_main_wrong_cl_partial_1_llama_1434816_episodes_seed_42 Updated 9 days ago • 49
LorenaYannnnn/20260112-Qwen3-0.6B_gsm8k_main_cl_separate_no_cl_main_incorrect_1_llama_1434816_episodes_seed_42 Updated 10 days ago • 48
LorenaYannnnn/20260105-Qwen3-0.6B_gsm8k_no_classmate_main_incorrect_1_llama_1434816_episodes_seed_42 Updated 16 days ago • 51
LorenaYannnnn/20260105-Qwen3-0.6B_gsm8k_minimal_answer_box_baseline_1434816_episodes_seed_42 Updated 18 days ago • 47
LorenaYannnnn/20260105-Qwen3-0.6B_gsm8k_minimal_answer_box_w_1_classmate_llama_1434816_episodes_seed_42 Updated 18 days ago • 47
LorenaYannnnn/20251229-Qwen3-0.6B_math_minimal_answer_box_w_1_classmate_llama_719328_episodes_seed_42 Updated 25 days ago • 68
LorenaYannnnn/20251229-Qwen3-0.6B_math_minimal_answer_box_baseline_719328_episodes_seed_42 Updated 25 days ago • 64
LorenaYannnnn/20251228-Qwen3-0.6B_gsm8k_minimal_answer_box_prompt_baseline_717408_episodes_seed_42 Updated 26 days ago • 76
LorenaYannnnn/20251228-Qwen3-0.6B_gsm8k_minimal_answer_box_w_1_classmate_llama_717408_episodes_seed_42 Updated 26 days ago • 61
LorenaYannnnn/20251226-Qwen3-0.6B_gsm8k_think_prompt_w_1_classmate_llama_358704_episodes_seed_42 Updated 27 days ago • 96
LorenaYannnnn/20251226-Qwen3-0.6B_gsm8k_think_prompt_baseline_717408_episodes_seed_42 Updated 27 days ago • 81
LorenaYannnnn/20251225-Qwen3-0.6B_hendrycks_math_step_by_step_w_1_classmate_llama_359664_episodes_seed_42 Updated 28 days ago • 173
LorenaYannnnn/20251225-Qwen3-0.6B_hendrycks_math_think_step_by_step_baseline_359664_episodes_seed_42 Updated 28 days ago • 234
LorenaYannnnn/20251224-Qwen3-0.6B_GSM_MATH_w_1_classmate_llama_357024_episodes_seed_1 Updated 30 days ago
LorenaYannnnn/20251224-Qwen3-0.6B_GSM_MATH_w_1_classmate_llama_357024_episodes_seed_42 Updated 30 days ago • 213
LorenaYannnnn/20251224-Qwen3-1.7B_GSM_MATH_w_1_classmate_llama_357024_episodes Updated about 1 month ago
LorenaYannnnn/20251217-Qwen3-4B-Base_DeepScaleR_w_classmate_llama0.5_322512_episodes Updated Dec 15, 2025
LorenaYannnnn/20251217-Qwen3-4B-Base_DeepScaleR_w_classmate_llama_322512_episodes Updated Dec 15, 2025
LorenaYannnnn/20251210-OLMo-2-7B-DPO-Mixed-Constraints_with_classmate_reward_llama_237400_episodes Updated Dec 9, 2025