DeepZero: Scaling Up Zeroth-Order Optimization for Deep Model TrainingAochuan Chen,Yimeng Zhang,Jinghan Jia,James Diffenderfer,Konstantinos Parasyris,Jiancheng Liu,Yihua Zhang,Zheng Zhang,Bhavya Kailkhura,Sijia LiuICLR 2024(2024)引用 46|浏览30关键词gradient-free learning,zeroth-order optimization,gradient sparsityAI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要