23 56 54

Joya Chen PRO

chenjoya

https://chenjoya.github.io/

chenjoya

AI & ML interests

Video LLM

Recent Activity

upvoted a paper about 23 hours ago

FARMER: Flow AutoRegressive Transformer over Pixels

liked a dataset 7 days ago

MikhailT/lj-speech

liked a dataset 7 days ago

zeyun-zhong/LLaVA-Video-216KQA

View all activity

Organizations

upvoted a paper about 23 hours ago

FARMER: Flow AutoRegressive Transformer over Pixels

Paper • 2510.23588 • Published 1 day ago • 45

liked 2 datasets 7 days ago

MikhailT/lj-speech

Viewer • Updated Jun 23, 2023 • 13.1k • 283 • 6

zeyun-zhong/LLaVA-Video-216KQA

Viewer • Updated 10 days ago • 1.53k • 1.13k • 1

liked a dataset 13 days ago

mit-han-lab/Inf-Stream-Train

Preview • Updated 7 days ago • 3.3k • 1

upvoted a paper 16 days ago

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Paper • 2510.09608 • Published 18 days ago • 49

liked 2 datasets 21 days ago

ZaynZhu/Paper2Video

Viewer • Updated 22 days ago • 101 • 315 • 9

Enxin/VideoNSA-data

Viewer • Updated 21 days ago • 162k • 55 • 1

upvoted a paper 22 days ago

Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published 22 days ago • 107

upvoted a paper 27 days ago

Code2Video: A Code-centric Paradigm for Educational Video Generation

Paper • 2510.01174 • Published 27 days ago • 33

liked a model about 1 month ago

Qwen/Qwen3-VL-235B-A22B-Thinking

Image-Text-to-Text • 236B • Updated 25 days ago • 26.2k • • 309

liked a Space about 1 month ago

175

Qwen3 Omni Demo

⚡

Interact with a multimodal chatbot using text, audio, images, or video

published a dataset about 1 month ago

chenjoya/spc_demo_videos

Viewer • Updated Sep 15 • 5 • 14

updated a dataset about 1 month ago

chenjoya/spc_demo_videos

Viewer • Updated Sep 15 • 5 • 14

upvoted 4 papers about 2 months ago

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2 • 122

liked a Space 2 months ago

456

Song Generation

🎵

Generate a custom song from lyrics and optional prompts

upvoted a paper 3 months ago

Reinforcement Learning in Vision: A Survey

Paper • 2508.08189 • Published Aug 11 • 29

liked a model 3 months ago

Qwen/Qwen-Image

Text-to-Image • Updated Aug 18 • 195k • • 2.15k