PyTorch FSDP: Experiences on Scaling Fully Sharded Data ParallelYanli Zhao,Andrew Gu,Rohan Varma,Liang Luo,Chien-Chin Huang, Min Xu,Less Wright,Hamid Shojanazeri,Myle Ott,Sam Shleifer,Alban Desmaison,Can Balioglu,Pritam Damania,Bernard Nguyen,Geeta Chauhan,Yuchen Hao,Shen LiVLDB 2023(2023)引用 341|浏览158关键词Parallel Computing,Simulation Platforms,Distributed StorageAI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要