Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts.
IEEE International Conference on Acoustics, Speech, and Signal Processing(2024)
Key words
text-to-speech,zero-shot,multi-scale acoustic prompts,speaker adaptation,language model
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined