Draft Model Knows when to Stop: A Self-Verification Length Policy for Speculative DecodingZiyin Zhang, Jiahao Xu,Tian Liang, Xingyu Chen,Zhiwei He,Rui Wang,Zhaopeng TuCoRR(2024)引用 0|浏览10AI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要