Chrome Extension
WeChat Mini Program
Use on ChatGLM

Parallelization Strategies for DLRM Embedding Bag Operator on AMD CPUs

Krishnakumar Nair, Avinash-Chandra Pandey, Siddappa Karabannavar,Meena Arunachalam,John Kalamatianos,Varun Agrawal, Saurabh Gupta,Ashish Sirasao,Elliott Delaye, Steve Reinhardt, Rajesh Vivekanandham,Ralph Wittig,Vinod Kathail, Padmini Gopalakrishnan,Satyaprakash Pareek,Rishabh Jain,Mahmut Taylan Kandemir, Jun-Liang Lin,Gulsum Gudukbay Akbulut,Chita R. Das

IEEE MICRO(2024)

Cited 0|Views6
Key words
Multicore processing,Instruction sets,Bandwidth,Vectors,Kernel,Recommender systems,Three-dimensional displays
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined