-
OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation
Paper • 2410.17799 • Published • 7 -
Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
Paper • 2012.15840 • Published • 3 -
Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis
Paper • 2411.01156 • Published • 11
Govind Singh
Pro1222
·
AI & ML interests
None yet
Organizations
None yet
TODO
-
OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation
Paper • 2410.17799 • Published • 7 -
Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
Paper • 2012.15840 • Published • 3 -
Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis
Paper • 2411.01156 • Published • 11
image to text