LTX-2: Efficient Joint Audio-Visual Foundation Model Paper • 2601.03233 • Published about 23 hours ago • 31
nvidia/nemotron-speech-streaming-en-0.6b Automatic Speech Recognition • Updated 1 day ago • 174 • 114
DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer Paper • 2601.01425 • Published 3 days ago • 37