mehuldamani/qwen3_8b_ambigQA_rlcr_multiple_newAccReward_standardPrompt_initFromFirstModel_weighAccMore Updated 2 days ago
mehuldamani/qwen3_8b_ambigQA_rlcr_multiple_newAccReward_standardPrompt_initFromBase_weighAccMore Updated 2 days ago
mehuldamani/sft-base-half-tranches-v1-global-step-394 Text Classification • 8B • Updated 18 days ago • 18