Visual Representation Alignment for Multimodal Large Language Models Paper • 2509.07979 • Published Sep 9, 2025 • 84
Running on Zero Featured 1.75k Dia 1.6B 👯 1.75k Generate realistic dialogue from a script, using Dia!
imTak/whisper_large_v3_turbo_korean_Develop Automatic Speech Recognition • 0.8B • Updated Nov 29, 2024 • 1
imTak/whisper_large_v3_turbo_korean_Economy Automatic Speech Recognition • 0.8B • Updated Nov 29, 2024 • 1 • 1
imTak/whisper_large_v3_turbo_Korean2 Automatic Speech Recognition • 0.8B • Updated Nov 29, 2024 • 11 • 4
imTak/whisper_large_v3_turbo_Korean2 Automatic Speech Recognition • 0.8B • Updated Nov 29, 2024 • 11 • 4
imTak/whisper_large_v3_turbo_Korean2 Automatic Speech Recognition • 0.8B • Updated Nov 29, 2024 • 11 • 4