This is the trained Thinker-R1.5B model from the paper Thinker: Learning to Think Fast and Slow. Please refer to the GitHub repo for details.
- Downloads last month
 - 18
 
	Inference Providers
	NEW
	
	
	This model isn't deployed by any Inference Provider.
	🙋
			
		Ask for provider support
Model tree for stephenchungmh/thinker_r1_5b
Base model
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B