Siyuan Huang's picture

4 11 6

Siyuan Huang

chamber111

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 26 days ago

Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning

upvoted a paper about 1 month ago

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

upvoted a paper about 1 month ago

P1: Mastering Physics Olympiads with Reinforcement Learning

View all activity

Organizations

upvoted a paper 26 days ago

Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning

Paper • 2511.20549 • Published Nov 25 • 25

upvoted 2 papers about 1 month ago

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

Paper • 2511.13704 • Published Nov 17 • 42

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17 • 134

upvoted a paper about 2 months ago

VideoSSR: Video Self-Supervised Reinforcement Learning

Paper • 2511.06281 • Published Nov 9 • 24

updated a collection about 2 months ago

VPPO Model

SOTA models for multimodal reasoning, fine-tuned with VPPO. Achieves superior performance by focusing on critical visual tokens. • 4 items • Updated Nov 7 • 4

liked a model about 2 months ago

chamber111/VPPO-8B

Image-Text-to-Text • 9B • Updated Nov 7 • 7 • 2

updated 2 models about 2 months ago

chamber111/VPPO-8B

Image-Text-to-Text • 9B • Updated Nov 7 • 7 • 2

chamber111/VPPO-7B

Image-Text-to-Text • 8B • Updated Nov 7 • 123 • 5

published a model about 2 months ago

chamber111/VPPO-8B

Image-Text-to-Text • 9B • Updated Nov 7 • 7 • 2

upvoted a paper about 2 months ago

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

Paper • 2510.27492 • Published Oct 30 • 82

updated 3 datasets 2 months ago

chamber111/VPPO-Eval

Preview • Updated Oct 16 • 2.27k • 1

chamber111/VPPO_MMK12_validation

Viewer • Updated Oct 16 • 2k • 733 • 1

chamber111/VPPO_ViRL39K_train

Viewer • Updated Oct 16 • 38.9k • 1.05k • 1

updated a model 2 months ago

chamber111/VPPO-32B

33B • Updated Oct 16 • 17 • 2

New activity in chamber111/VPPO-7B 2 months ago

Add missing metadata tags

#1 opened 2 months ago by

New activity in chamber111/VPPO-Eval 2 months ago

Add task category, sample usage, and prominent links

#2 opened 2 months ago by

New activity in chamber111/VPPO_ViRL39K_train 2 months ago

Add task categories and update paper link

#1 opened 2 months ago by

New activity in chamber111/VPPO_MMK12_validation 2 months ago

Add task category to dataset card

#2 opened 2 months ago by

upvoted 2 collections 2 months ago

VPPO Data

Official training and evaluation datasets for the VPPO project. • 4 items • Updated Oct 13 • 3

VPPO Model

SOTA models for multimodal reasoning, fine-tuned with VPPO. Achieves superior performance by focusing on critical visual tokens. • 4 items • Updated Nov 7 • 4