Training Multimodal Reward Model Through Stable Reinforcement Learning
			
	
	Yi-Fan Zhang
yifanzhang114
		AI & ML interests
Yi-Fan Zhang presently is a forth-year PhD student at the State Key Laboratory of Pattern Recognition, University of Chinese Academy of Sciences, under the esteemed guidance of Prof. Tieniu Tan, is dedicated to spearheading robust and reliable deep learning systems and large pretrained models.
		Recent Activity
						upvoted 
								a
								paper
							
						3 days ago
						
					
						
						
						Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal
  Reasoning in MLLMs
						
						commented on 
								a paper
							
						4 days ago
						
					
						
						
						VITA-E: Natural Embodied Interaction with Concurrent Seeing, Hearing,
  Speaking, and Acting
						
 
								 
								


