Joya Chen's picture

Joya Chen PRO

chenjoya

·

https://chenjoya.github.io/

chenjoya

AI & ML interests

Video LLM

Recent Activity

upvoted a paper 1 day ago

FARMER: Flow AutoRegressive Transformer over Pixels

liked a dataset 7 days ago

MikhailT/lj-speech

liked a dataset 8 days ago

zeyun-zhong/LLaVA-Video-216KQA

View all activity

Organizations

upvoted a paper 1 day ago

FARMER: Flow AutoRegressive Transformer over Pixels

Paper • 2510.23588 • Published 2 days ago • 48

upvoted a paper 16 days ago

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Paper • 2510.09608 • Published 19 days ago • 49

upvoted a paper 22 days ago

Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published 23 days ago • 108

upvoted a paper 28 days ago

Code2Video: A Code-centric Paradigm for Educational Video Generation

Paper • 2510.01174 • Published 28 days ago • 33

upvoted 4 papers about 2 months ago

Robix: A Unified Model for Robot Interaction, Reasoning and Planning

Paper • 2509.01106 • Published Sep 1 • 48

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 189

Draw-In-Mind: Learning Precise Image Editing via Chain-of-Thought Imagination

Paper • 2509.01986 • Published Sep 2 • 4

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2 • 123

upvoted a paper 3 months ago

Reinforcement Learning in Vision: A Survey

Paper • 2508.08189 • Published Aug 11 • 29

upvoted 2 papers 4 months ago

HumanOmniV2: From Understanding to Omni-Modal Reasoning with Context

Paper • 2506.21277 • Published Jun 26 • 15

Show-o2: Improved Native Unified Multimodal Models

Paper • 2506.15564 • Published Jun 18 • 28

upvoted 9 papers 5 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 262

Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Paper • 2505.22618 • Published May 28 • 43

D-AR: Diffusion via Autoregressive Models

Paper • 2505.23660 • Published May 29 • 34

SWE-bench Goes Live!

Paper • 2505.23419 • Published May 29 • 21

UniRL: Self-Improving Unified Multimodal Models via Supervised and Reinforcement Learning

Paper • 2505.23380 • Published May 29 • 22

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Paper • 2505.21497 • Published May 27 • 107

Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models

Paper • 2505.16854 • Published May 22 • 11

AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via Reinforcement Learning

Paper • 2505.11896 • Published May 17 • 58

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 306