EarthCrafter: Scalable 3D Earth Generation via Dual-Sparse Latent Diffusion Paper • 2507.16535 • Published Jul 22 • 20
Probing the 3D Awareness of Visual Foundation Models Paper • 2404.08636 • Published Apr 12, 2024 • 14
VideoChat-R1 Collection VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning • 4 items • Updated Sep 28 • 8
Cosmos-Tokenizer Collection A suite of image and video tokenizers • 13 items • Updated 5 days ago • 43
BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation Paper • 2407.17952 • Published Jul 25, 2024 • 32