Yuexi Shen
yuexishen
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination
upvoted
a
paper
3 months ago
Tree-based Dialogue Reinforced Policy Optimization for Red-Teaming
Attacks
updated
a model
7 months ago
yuexishen/codellama-7b-humaneval-ppo-qlora
Organizations
None yet