base_model: nbeerbower/Yanfei-Qwen3-32B library_name: peft license: apache-2.0 datasets: - GeneralReasoning/GeneralThought-430K - nbeerbower/GreatFirewall-DPO
This is an attempt to repair reasoning in Yanfei.