AMiner - AI赋能科技情报挖掘-学术搜索-论文检索-论文专利-文献追踪-学者画像

Chrome Extension

WeChat Mini Program

Use on ChatGLM

Academic Profile User Profile

My Following Paper Collections Browse History

AI Reads Science

GPT, Language Model, Human Feedback, CLIP, LLaMA

57,299,488

Researchers

310,287,489

Publications

8,933,684

Concepts

2,217,086,385

Citations

Explore

Report

Trend

Topic

Hardware-Aligned and Natively Trainable Sparse Attention

More topics

Kimi proposed a new attention mechanism, MoBA, which combines the principles of MoE and improves the efficiency of LLMs in long-text scenarios without sacrificing performance.

No More Adam: Learning Rate Scaling at Initialization is All You Need

Minghao Xu, Lichuan Xiang,Xu Cai,Hongkai Wen

CoRR （2024）

Cited2Views1524

Download

Bibtex

ChatPaper

Rate

1524

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Benjamin Warner, Antoine Chaffin,Benjamin Clavié,Orion Weller, Oskar Hallström, Said Taghadouini, Alexis Gallagher, Raja Biswas,Faisal Ladhak, Tom Aarsen,Nathan Cooper,Griffin Adams,

CoRR （2024）

Cited63Views1065

Download

Bibtex

ChatPaper

Rate

1065

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Frank F. Xu, Yufan Song, Boxuan Li, Yuxuan Tang, Kritanjali Jain, Mengxue Bao, Zora Z. Wang,Xuhui Zhou, Zhitong Guo, Murong Cao, Mingyang Yang, Hao Yang Lu,

Computing Research Repository （2024）

Cited18Views903

Download

Bibtex

ChatPaper

Rate

903

Expand all 5 New Papers

Loading more RecommendationsGet more recommendations Load MoreAdd KeywordSet your interests to get accurate recommendation

京ICP备20011824号-11 gongan