FunAudioLLM

company

FunAudioLLM

AI & ML interests

None defined yet.

Recent Activity

jaymurong updated a dataset about 19 hours ago

FunAudioLLM/SpeechFCEval

chtan updated a model 1 day ago

FunAudioLLM/Fun-Audio-Chat-8B

chtan published a model 2 days ago

FunAudioLLM/Fun-Audio-Chat-8B

View all activity

jaymurong

updated a dataset about 19 hours ago

FunAudioLLM/SpeechFCEval

Viewer • Updated about 19 hours ago • 1.91k • 381 • 5

chtan

updated a model 1 day ago

FunAudioLLM/Fun-Audio-Chat-8B

Any-to-Any • 9B • Updated 1 day ago • 202 • 43

chtan

published a model 2 days ago

FunAudioLLM/Fun-Audio-Chat-8B

Any-to-Any • 9B • Updated 1 day ago • 202 • 43

jaymurong

published a dataset 2 days ago

FunAudioLLM/SpeechFCEval

Viewer • Updated about 19 hours ago • 1.91k • 381 • 5

pengzhendong

updated 2 models 2 days ago

FunAudioLLM/Fun-ASR-Nano-2512

Updated 2 days ago • 483 • 114

FunAudioLLM/Fun-ASR-MLT-Nano-2512

Updated 2 days ago • 95 • 29

aluminumbox

in FunAudioLLM/Fun-CosyVoice3-0.5B 3 days ago

Which languages does this space support?

#2 opened 7 days ago by

aluminumbox

in FunAudioLLM/Fun-CosyVoice3-0.5B-2512 3 days ago

how to deploy

#7 opened 8 days ago by

Zero Shot > Cross-lingual

#6 opened 8 days ago by

FunAudioLLM/Fun-CosyVoice3-0.5B-2512

#5 opened 9 days ago by

Windows Support?

#1 opened 10 days ago by

FFomy

in FunAudioLLM/Fun-ASR-MLT-Nano-2512 8 days ago

Update README_zh.md

#1 opened 9 days ago by

OrangeLuyao

authored 8 papers 10 days ago

SpeakerLM: End-to-End Versatile Speaker Diarization and Recognition with Multimodal Large Language Models

Paper • 2508.06372 • Published Aug 8 • 2

An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification

Paper • 2305.12838 • Published May 22, 2023

CAM++: A Fast and Efficient Network for Speaker Verification Using Context-Aware Masking

Paper • 2303.00332 • Published Mar 1, 2023

3D-Speaker-Toolkit: An Open-Source Toolkit for Multimodal Speaker Verification and Diarization

Paper • 2403.19971 • Published Mar 29, 2024

Integrating Audio, Visual, and Semantic Information for Enhanced Multimodal Speaker Diarization

Paper • 2408.12102 • Published Aug 22, 2024

Exploring Speaker-Related Information in Spoken Language Understanding for Better Speaker Diarization

Paper • 2305.12927 • Published May 22, 2023

3D-Speaker: A Large-Scale Multi-Device, Multi-Distance, and Multi-Dialect Corpus for Speech Representation Disentanglement

Paper • 2306.15354 • Published Jun 27, 2023 • 7

OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation

Paper • 2410.17799 • Published Oct 23, 2024 • 7