weepcat/summarization_sft_reward-model-deberta-v3-large-v2_RM-Gemma-2B_mask_partial_rm_random_length Text Classification • 0.4B • Updated Jan 23, 2025 • 2
weepcat/summarization_sft_reward-model-deberta-v3-large-v2 Text Classification • 0.4B • Updated Jan 22, 2025 • 1
weepcat/hh_sft_RM-Gemma-2B_RM-Gemma-7B_mask_partial_rm_random_length Text Classification • 3B • Updated Jan 8, 2025 • 2
weepcat/hh_sft_RM-Gemma-2B_RM-Gemma-7B_mask_partial_rm_token_by_token Text Classification • 3B • Updated Jan 3, 2025 • 1