3 3 2

Jiasenlu

https://jiasenlu.github.io/

AI & ML interests

Vision and Language

Recent Activity

commented on a paper 6 days ago

AToken: A Unified Tokenizer for Vision

commented on a paper 11 days ago

AToken: A Unified Tokenizer for Vision

authored a paper about 1 month ago

CAR-Flow: Condition-Aware Reparameterization Aligns Source and Target for Better Flow Matching

View all activity

Organizations

None yet

commented a paper 6 days ago

AToken: A Unified Tokenizer for Vision

Paper • 2509.14476 • Published Sep 17 • 36 •

commented a paper 11 days ago

AToken: A Unified Tokenizer for Vision

Paper • 2509.14476 • Published Sep 17 • 36 •

authored a paper about 1 month ago

CAR-Flow: Condition-Aware Reparameterization Aligns Source and Target for Better Flow Matching

Paper • 2509.19300 • Published Sep 23 • 6

commented a paper about 1 month ago

AToken: A Unified Tokenizer for Vision

Paper • 2509.14476 • Published Sep 17 • 36 •

authored a paper about 1 month ago

AToken: A Unified Tokenizer for Vision

Paper • 2509.14476 • Published Sep 17 • 36

upvoted a paper about 1 month ago

AToken: A Unified Tokenizer for Vision

Paper • 2509.14476 • Published Sep 17 • 36

liked a model 11 months ago

lehduong/OneDiffusion

Any-to-Any • Updated Jul 24 • 1 • 41

authored 2 papers 11 months ago

STIV: Scalable Text and Image Conditioned Video Generation

Paper • 2412.07730 • Published Dec 10, 2024 • 74

One Diffusion to Generate Them All

Paper • 2411.16318 • Published Nov 25, 2024 • 30

upvoted a paper 11 months ago

One Diffusion to Generate Them All

Paper • 2411.16318 • Published Nov 25, 2024 • 30

commented a paper 11 months ago

One Diffusion to Generate Them All

Paper • 2411.16318 • Published Nov 25, 2024 • 30 •

authored 2 papers about 1 year ago

MM-Ego: Towards Building Egocentric Multimodal LLMs

Paper • 2410.07177 • Published Oct 9, 2024 • 22

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 121

upvoted a paper almost 2 years ago

Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action

Paper • 2312.17172 • Published Dec 28, 2023 • 30

liked a Space over 3 years ago

Unicl Zero-Shot Image Recognition Demo

🏢

Jiasenlu

AI & ML interests

Recent Activity

Organizations

Jiasenlu's activity

Unicl Zero-Shot Image Recognition Demo