Proactive-Interactive-R1/Proactive-Interactive-R1-Code-7B Question Answering • 8B • Updated 1 day ago • 5
Proactive-Interactive-R1/Proactive-Interactive-R1-Code-7B Question Answering • 8B • Updated 1 day ago • 5
Xinging/distill_r1_coig_neo_en_cleaned_uncertainty_threshold_18.73_format_sft_conversation_w_sys_4k Text Generation • 333k • Updated Nov 12 • 6
Xinging/distill_r1_coig_neo_en_cleaned_uncertainty_threshold_18.73_format_sft_conversation_w_sys_4k Text Generation • 333k • Updated Nov 12 • 6
Xinging/distill_r1_coig_neo_en_cleaned_uncertainty_threshold_18.73_format_sft_conversation_4k Text Generation • 333k • Updated Nov 11 • 6
Xinging/distill_r1_coig_neo_en_cleaned_uncertainty_threshold_18.73_format_sft_conversation_4k_5_epochs Text Generation • 333k • Updated Nov 11 • 7
Xinging/distill_r1_coig_neo_en_cleaned_uncertainty_threshold_18.73_format_sft_conversation_4k_5_epochs Text Generation • 333k • Updated Nov 11 • 7
Xinging/distill_r1_coig_neo_en_cleaned_uncertainty_threshold_18.73_format_sft_conversation_4k Text Generation • 333k • Updated Nov 11 • 6
Xinging/distill_r1_coig_neo_en_cleaned_uncertainty_threshold_18.73_format_sft_conversation_4k Text Generation • 333k • Updated Nov 11 • 6
Xinging/distill_r1_coig_neo_en_cleaned_uncertainty_threshold_18.73_format_sft_conversation_4k_5_epochs Text Generation • 333k • Updated Nov 11 • 7
Xinging/user_simulator_uncertainty_threshold_40_sft_train_dataset Text Generation • 333k • Updated Sep 12 • 7 • 1
Xinging/user_simulator_uncertainty_threshold_40_sft_train_dataset Text Generation • 333k • Updated Sep 12 • 7 • 1
Xinging/distill_r1_coing_neo_cleaned_uncertainty_threshold_40_sft_conversation_train_dataset Text Generation • 333k • Updated Sep 12 • 5 • 1
Xinging/user_simulator_uncertainty_threshold_40_sft_train_dataset Text Generation • 333k • Updated Sep 12 • 7 • 1