Hvqu$^{2}$-Vc: A One Shot Voice Conversion by Integrating Hierarchical Vector Quantization and Nested U-Net Structure
openalex(2024)
Key words
Audio-Visual Speech Recognition,End-to-End Speech Recognition,Acoustic Modeling,Speaker Verification,Automatic Speech Recognition
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined