Voxtlm: Unified Decoder-Only Models for Consolidating Speech Recognition/synthesis and Speech/text Continuation Tasks
IEEE International Conference on Acoustics, Speech, and Signal Processing(2024)
Key words
Multitask,speech synthesis,speech recognition,spoken language model
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined