Jingfeng Yao's picture

Jingfeng Yao

MapleF9

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models

liked a model 6 days ago

hustvl/DiffusionVL-Qwen2.5VL-7B

liked a model 6 days ago

hustvl/DiffusionVL-Qwen2.5VL-3B

View all activity

Organizations

upvoted a paper 6 days ago

DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models

Paper • 2512.15713 • Published 7 days ago • 15

upvoted a collection 8 days ago

VTP

Towards Scalable Pre-training of Visual Tokenizers for Generation • 4 items • Updated 8 days ago • 39

upvoted a paper 8 days ago

Towards Scalable Pre-training of Visual Tokenizers for Generation

Paper • 2512.13687 • Published 9 days ago • 93

upvoted a paper 20 days ago

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published 23 days ago • 66

upvoted a paper 3 months ago

Seedream 4.0: Toward Next-generation Multimodal Image Generation

Paper • 2509.20427 • Published Sep 24 • 81

upvoted a paper 6 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 273

upvoted 2 papers 7 months ago

Emerging Properties in Unified Multimodal Pretraining

Paper • 2505.14683 • Published May 20 • 134

MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

Paper • 2505.07916 • Published May 12 • 134

upvoted 8 papers 8 months ago

Unified Continuous Generative Models

Paper • 2505.07447 • Published May 12 • 43

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11 • 154

PixelHacker: Image Inpainting with Structural and Semantic Consistency

Paper • 2504.20438 • Published Apr 29 • 44

Llama-Nemotron: Efficient Reasoning Models

Paper • 2505.00949 • Published May 2 • 42

Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published Apr 30 • 53

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Paper • 2504.21233 • Published Apr 30 • 49

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Paper • 2504.12626 • Published Apr 17 • 51

REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers

Paper • 2504.10483 • Published Apr 14 • 21

upvoted 4 papers 9 months ago

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published Apr 8 • 182

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published Apr 7 • 110

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 301

Scaling Language-Free Visual Representation Learning

Paper • 2504.01017 • Published Apr 1 • 32