YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft16_v5_chosen-2 Viewer • Updated Sep 27 • 91k • 25
YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft8_v5_chosen-2 Viewer • Updated Sep 27 • 52.1k • 24
YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft4_v5_chosen-2 Viewer • Updated Sep 27 • 27.8k • 17
YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft2_v5_chosen-2 Viewer • Updated Sep 27 • 14.4k • 10
YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft1_v5_chosen-2 Viewer • Updated Sep 27 • 7.3k • 12
YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft16_v4_chosen-2correct_diff2 Viewer • Updated Sep 26 • 74k • 23
YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft8_v4_chosen-2correct_diff2 Viewer • Updated Sep 26 • 42.4k • 30
YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft4_v4_chosen-2correct_diff2 Viewer • Updated Sep 26 • 22.9k • 24
YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft2_v4_chosen-2correct_diff2 Viewer • Updated Sep 26 • 12k • 22
YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft1_v4_chosen-2correct_diff2 Viewer • Updated Sep 26 • 6.2k • 21
YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft16_v3_chosen-2_diff2 Viewer • Updated Sep 26 • 91k • 14
YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft8_v3_chosen-2_diff2 Viewer • Updated Sep 26 • 52.1k • 14
YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft4_v3_chosen-2_diff2 Viewer • Updated Sep 26 • 27.8k • 14
YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft2_v3_chosen-2_diff2 Viewer • Updated Sep 26 • 14.4k • 9
YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft1_v3_chosen-2_diff2 Viewer • Updated Sep 26 • 7.3k • 9
YuchenLi01/MATH_Llama-3.2-1B-Instruct_Score_DPO_Qwen2.5MathRM72B_hard0soft16_all_soft_random_unfiltered Viewer • Updated Sep 19 • 120k • 17
YuchenLi01/MATH_Llama-3.2-1B-Instruct_Score_DPO_Qwen2.5MathRM72B_hard0soft8_all_soft_random_unfiltered Viewer • Updated Sep 19 • 60k • 12
YuchenLi01/MATH_Llama-3.2-1B-Instruct_Score_DPO_Qwen2.5MathRM72B_hard0soft4_all_soft_random_unfiltered Viewer • Updated Sep 19 • 30k • 11
YuchenLi01/MATH_Llama-3.2-1B-Instruct_Score_DPO_Qwen2.5MathRM72B_hard0soft2_all_soft_random_unfiltered Viewer • Updated Sep 19 • 15k • 10
YuchenLi01/MATH_Llama-3.2-1B-Instruct_Score_DPO_Qwen2.5MathRM72B_hard0soft1_all_soft_random_unfiltered Viewer • Updated Sep 19 • 7.5k • 11
YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft16_v2_chosenhigh_rejectedrand Viewer • Updated Sep 15 • 120k • 10
YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft8_v2_chosenhigh_rejectedrand Viewer • Updated Sep 15 • 60k • 7
YuchenLi01/MATH_1.5Bsft_Score_DPO_Qwen2.5MathRM72B_hard0soft4_v2_chosenhigh_rejectedrand Viewer • Updated Sep 15 • 30k • 9