1 15 5

Ruihang Chu

Ruihang

AI & ML interests

None yet

Recent Activity

upvoted a paper about 22 hours ago

Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

upvoted a paper 15 days ago

AdaViewPlanner: Adapting Video Diffusion Models for Viewpoint Planning in 4D Scenes

upvoted a paper 15 days ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

View all activity

Organizations

upvoted a paper about 22 hours ago

Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Paper • 2510.23607 • Published 1 day ago • 151

upvoted 2 papers 15 days ago

AdaViewPlanner: Adapting Video Diffusion Models for Viewpoint Planning in 4D Scenes

Paper • 2510.10670 • Published 17 days ago • 16

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published 16 days ago • 168

upvoted a paper 29 days ago

SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer

Paper • 2509.24695 • Published 30 days ago • 43

upvoted a paper about 1 month ago

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26 • 177

upvoted a paper about 2 months ago

Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search

Paper • 2509.07969 • Published Sep 9 • 59

upvoted 3 papers 3 months ago

TTS-VAR: A Test-Time Scaling Framework for Visual Auto-Regressive Generation

Paper • 2507.18537 • Published Jul 24 • 17

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

Paper • 2507.13348 • Published Jul 17 • 75

AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning

Paper • 2507.12841 • Published Jul 17 • 41

upvoted a paper 7 months ago

Wan: Open and Advanced Large-Scale Video Generative Models

Paper • 2503.20314 • Published Mar 26 • 55

upvoted a paper 11 months ago

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Paper • 2412.09501 • Published Dec 12, 2024 • 48

upvoted a paper about 1 year ago

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Paper • 2408.10188 • Published Aug 19, 2024 • 52

upvoted 2 papers over 1 year ago

Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs

Paper • 2406.18629 • Published Jun 26, 2024 • 42

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models

Paper • 2403.18814 • Published Mar 27, 2024 • 47

upvoted a collection over 1 year ago

MGM

Collection

Official model collection for the paper "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models" • 13 items • Updated May 3, 2024 • 47

Ruihang Chu

AI & ML interests

Recent Activity

Organizations

Ruihang's activity