Mixture-of-Experts Meets Instruction Tuning:A Winning Combination for Large Language ModelsSheng Shen,Le Hou,Yanqi Zhou,Nan Du,Shayne Longpre,Jason Wei,Hyung Won Chung,Barret Zoph,William Fedus,Xinyun Chen,Tu Vu,Yuexin Wu,Wuyang Chen,Albert Webson,Yunxuan Li,Vincent Y Zhao,Hongkun Yu,Kurt Keutzer,Trevor Darrell,Denny ZhouICLR 2024(2024)引用 81|浏览299关键词MoE,Instruction TuningAI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要