YSH's picture

YSH

BestWishYsh

·

https://shyuanbest.github.io/

AI & ML interests

None yet

Recent Activity

new activity 6 days ago

BestWishYsh/OpenS2V-Eval:Need support to update the datasets and verify the account

upvoted a paper 7 days ago

FlashI2V: Fourier-Guided Latent Shifting Prevents Conditional Image Leakage in Image-to-Video Generation

authored a paper 8 days ago

FlashI2V: Fourier-Guided Latent Shifting Prevents Conditional Image Leakage in Image-to-Video Generation

View all activity

Organizations

upvoted a paper 7 days ago

FlashI2V: Fourier-Guided Latent Shifting Prevents Conditional Image Leakage in Image-to-Video Generation

Paper • 2509.25187 • Published 30 days ago • 2

upvoted a paper 8 days ago

Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback

Paper • 2510.16888 • Published 10 days ago • 18

upvoted a paper 27 days ago

BindWeave: Subject-Consistent Video Generation via Cross-Modal Integration

Paper • 2510.00438 • Published 28 days ago • 4

upvoted a paper about 1 month ago

OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling

Paper • 2509.12201 • Published Sep 15 • 103

upvoted 6 papers 2 months ago

Mixture of Contexts for Long Video Generation

Paper • 2508.21058 • Published Aug 28 • 34

CineScale: Free Lunch in High-Resolution Cinematic Visual Generation

Paper • 2508.15774 • Published Aug 21 • 20

Wan-S2V: Audio-Driven Cinematic Video Generation

Paper • 2508.18621 • Published Aug 26 • 19

Accelerate High-Quality Diffusion Models with Inner Loop Feedback

Paper • 2501.13107 • Published Jan 22 • 2

Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity

Paper • 2502.01776 • Published Feb 3 • 3

SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer

Paper • 2501.18427 • Published Jan 30 • 21

upvoted 6 papers 3 months ago

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14 • 142

HPSv3: Towards Wide-Spectrum Human Preference Score

Paper • 2508.03789 • Published Aug 5 • 19

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 258

Captain Cinema: Towards Short Movie Generation

Paper • 2507.18634 • Published Jul 24 • 40

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

Paper • 2507.13348 • Published Jul 17 • 75

Lumos-1: On Autoregressive Video Generation from a Unified Model Perspective

Paper • 2507.08801 • Published Jul 11 • 30

upvoted 4 papers 4 months ago

SpeakerVid-5M: A Large-Scale High-Quality Dataset for Audio-Visual Dyadic Interactive Human Generation

Paper • 2507.09862 • Published Jul 14 • 49

A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 92

Tora2: Motion and Appearance Customized Diffusion Transformer for Multi-Entity Video Generation

Paper • 2507.05963 • Published Jul 8 • 12

VMoBA: Mixture-of-Block Attention for Video Diffusion Models

Paper • 2506.23858 • Published Jun 30 • 31