Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model Paper • 2408.11039 • Published Aug 20, 2024 • 63
Characterizing and Efficiently Accelerating Multimodal Generation Model Inference Paper • 2410.00215 • Published Sep 30, 2024
CWM: An Open-Weights LLM for Research on Code Generation with World Models Paper • 2510.02387 • Published 28 days ago • 7
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM Paper • 2403.07816 • Published Mar 12, 2024 • 44
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training Paper • 2104.01027 • Published Apr 2, 2021 • 1
Libri-Light: A Benchmark for ASR with Limited or No Supervision Paper • 1912.07875 • Published Dec 17, 2019