17 30 8

Weihao Yu

whyu

https://scholar.google.com/citations?user=LYxjt1QAAAAJ

AI & ML interests

Computer Vision, NLP and AI

Recent Activity

upvoted a paper 4 days ago

Parallel Loop Transformer for Efficient Test-Time Computation Scaling

upvoted a paper 4 days ago

Scaling Latent Reasoning via Looped Language Models

upvoted a paper 6 days ago

LightBagel: A Light-weighted, Double Fusion Framework for Unified Multimodal Understanding and Generation

View all activity

Organizations

upvoted 2 papers 4 days ago

Parallel Loop Transformer for Efficient Test-Time Computation Scaling

Paper • 2510.24824 • Published 6 days ago • 13

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published 5 days ago • 183

upvoted a paper 6 days ago

LightBagel: A Light-weighted, Double Fusion Framework for Unified Multimodal Understanding and Generation

Paper • 2510.22946 • Published 7 days ago • 16

upvoted a paper 10 days ago

Seed3D 1.0: From Images to High-Fidelity Simulation-Ready 3D Assets

Paper • 2510.19944 • Published 12 days ago • 18

upvoted 2 papers 18 days ago

Trace Anything: Representing Any Video in 4D via Trajectory Fields

Paper • 2510.13802 • Published 19 days ago • 30

Generative Universal Verifier as Multimodal Meta-Reasoner

Paper • 2510.13804 • Published 19 days ago • 24

upvoted a paper 25 days ago

Artificial Hippocampus Networks for Efficient Long-Context Modeling

Paper • 2510.07318 • Published 26 days ago • 28

upvoted 4 papers 5 months ago

Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding

Paper • 2505.16990 • Published May 22 • 22

upvoted 2 papers 6 months ago

Emerging Properties in Unified Multimodal Pretraining

Paper • 2505.14683 • Published May 20 • 134

Thinkless: LLM Learns When to Think

Paper • 2505.13379 • Published May 19 • 50

upvoted a paper 7 months ago

Long-Context Autoregressive Video Modeling with Next-Frame Prediction

Paper • 2503.19325 • Published Mar 25 • 73

upvoted a paper 8 months ago

Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning

Paper • 2503.07906 • Published Mar 10 • 4

upvoted 2 papers 11 months ago

ROICtrl: Boosting Instance Control for Visual Generation

Paper • 2411.17949 • Published Nov 27, 2024 • 87

OminiControl: Minimal and Universal Control for Diffusion Transformer

Paper • 2411.15098 • Published Nov 22, 2024 • 61

upvoted an article about 1 year ago

Article

Mamba Out

•

Oct 18, 2024

• 11

upvoted 2 papers about 1 year ago

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Paper • 2410.10818 • Published Oct 14, 2024 • 17

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 179

Weihao Yu

AI & ML interests

Recent Activity

Organizations

whyu's activity

Mamba Out