matlok
			's Collections
			 
		
			
		Papers - Audio - TTS
		
	updated
			
 
				
				
	
	
	
			
			Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram
  Predictions
		
			Paper
			
•
			1712.05884
			
•
			Published
				
			•
				
				3
			
 
	
	 
	
	
	
			
			VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild
		
			Paper
			
•
			2403.16973
			
•
			Published
				
			•
				
				2
			
 
	
	 
	
	
	
			
			High Fidelity Neural Audio Compression
		
			Paper
			
•
			2210.13438
			
•
			Published
				
			•
				
				4
			
 
	
	 
	
	
	
			
			RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting
  for Text-to-Speech Synthesis
		
			Paper
			
•
			2404.03204
			
•
			Published
				
			•
				
				10
			
 
	
	 
	
	
	
			
			Qwen-Audio: Advancing Universal Audio Understanding via Unified
  Large-Scale Audio-Language Models
		
			Paper
			
•
			2311.07919
			
•
			Published
				
			•
				
				10
			
 
	
	 
	
	
	
				suno/bark
				
				
			
			Text-to-Speech
			
• 
		
	
				Updated
					
				
				• 
					
					40k
				
	
				
• 
					
					1.45k
				
 
		
	
	
	 
	
	
	
			
			Natural language guidance of high-fidelity text-to-speech with synthetic
  annotations
		
			Paper
			
•
			2402.01912
			
•
			Published
				
			•
				
				12
			
 
	
	 
	
	
	
			
			F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow
  Matching
		
			Paper
			
•
			2410.06885
			
•
			Published
				
			•
				
				46
			
 
	
	 
	
	
	
			
			Matcha-TTS: A fast TTS architecture with conditional flow matching
		
			Paper
			
•
			2309.03199
			
•
			Published
				
			•
				
				13
			
 
	
	 
	
	
	
			
			SONAR: Sentence-Level Multimodal and Language-Agnostic Representations
		
			Paper
			
•
			2308.11466
			
•
			Published
				
			•
				
				1