Scaling Optimal LR Across Token HorizonsJohan Bjorck,Alon Benhaim,Vishrav Chaudhary,Furu Wei,Xia SongICLR 2025(2025)Cited 0|Views3AI Read ScienceMust-Reading TreeExampleGenerate MRT to find the research sequence of this paperChat PaperSummary is being generated by the instructions you defined