The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text Paper • 2512.16924 • Published 7 days ago • 24
VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction Paper • 2511.23386 • Published 27 days ago • 15
A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space Paper • 2511.10555 • Published Nov 13 • 60
XS-VID: An Extremely Small Video Object Detection Dataset Paper • 2407.18137 • Published Jul 25, 2024 • 2