CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models
2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)(2023)
关键词
Sound synthesis,audio generation,multimodal learning,diffusion models,neural networks,machine learning
AI 理解论文
溯源树
样例

生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要