Thanh Tran Van Trong

thanhtvt

thanhtvt

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Next-Embedding Prediction Makes Strong Vision Learners

updated a Space 4 days ago

thanhtvt/uetasr

updated a dataset 2 months ago

thanhtvt/VGGSound-baselines

View all activity

Organizations

upvoted a paper 3 days ago

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published 7 days ago • 77

updated a Space 4 days ago

Uetasr

📚

Convert speech to text in Vietnamese

updated a dataset 2 months ago

thanhtvt/VGGSound-baselines

Viewer • Updated Oct 12 • 13.7k • 28

updated a dataset 3 months ago

thanhtvt/ECGDA_Benchmark

Updated Sep 22 • 7

published a dataset 3 months ago

thanhtvt/ECGDA_Benchmark

Updated Sep 22 • 7

published a dataset 5 months ago

thanhtvt/VGGSound-baselines

Viewer • Updated Oct 12 • 13.7k • 28

upvoted a paper 8 months ago

JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization

Paper • 2503.23377 • Published Mar 30 • 57

liked a model 9 months ago

Qwen/Qwen2.5-Omni-7B

Any-to-Any • 11B • Updated Apr 30 • 168k • 1.83k

upvoted a paper 10 months ago

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 171

liked a model about 1 year ago

ariesssxu/vta-ldm-clip4clip-v-large

Updated Jul 15, 2024 • 6

liked a Space over 2 years ago

Code Llama - Playground

🦙

250

Generate code and text with Code Llama model

liked a dataset over 2 years ago

nguyenvulebinh/song_dataset

Viewer • Updated Nov 19, 2022 • 5.36k • 1k • 9

liked a model over 2 years ago

nguyenvulebinh/lyric-alignment

Automatic Speech Recognition • Updated Dec 11, 2022 • 291 • 9

liked a Space over 2 years ago

Automatic Speech Recognition

🌍

151

Transcribe audio to text in various languages

Thanh Tran Van Trong

AI & ML interests

Recent Activity

Organizations

thanhtvt's activity

Uetasr

Code Llama - Playground

Automatic Speech Recognition