Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jiasenlu's picture
3 3 2

Jiasenlu

Jiasenlu
Martinser's profile picture PirPe's profile picture 21world's profile picture
·
https://jiasenlu.github.io/

AI & ML interests

Vision and Language

Recent Activity

commented on a paper 6 days ago
AToken: A Unified Tokenizer for Vision
commented on a paper 11 days ago
AToken: A Unified Tokenizer for Vision
authored a paper about 1 month ago
CAR-Flow: Condition-Aware Reparameterization Aligns Source and Target for Better Flow Matching
View all activity

Organizations

None yet

authored 2 papers about 1 month ago

CAR-Flow: Condition-Aware Reparameterization Aligns Source and Target for Better Flow Matching

Paper • 2509.19300 • Published Sep 23 • 6

AToken: A Unified Tokenizer for Vision

Paper • 2509.14476 • Published Sep 17 • 36
authored 2 papers 11 months ago

STIV: Scalable Text and Image Conditioned Video Generation

Paper • 2412.07730 • Published Dec 10, 2024 • 74

One Diffusion to Generate Them All

Paper • 2411.16318 • Published Nov 25, 2024 • 30
authored 2 papers about 1 year ago

MM-Ego: Towards Building Egocentric Multimodal LLMs

Paper • 2410.07177 • Published Oct 9, 2024 • 22

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 121
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs