Model Merging in Pre-training of Large Language Models
Yunshui Li, Yiyuan Ma,Shen Yan, Chaoyi Zhang,Jing Liu,Jianqiao Lu, Ziwen Xu,Mengzhao Chen, Minrui Wang, Shiyi Zhan, Jin Ma, Xunhao Lai, Deyi Liu, Yao Luo, Xingyan Bin, Hongbin Ren, Mingji Han, Wenhao Hao, Bairen Yi, LingJun Liu, Bole Ma, Xiaoying Jia, Xun Zhou, Siyuan Qiao, Liang Xiang,Yonghui Wu arxiv(2025)
AI 理解论文
溯源树
样例
