Understanding the GPU Microarchitecture to Achieve Bare-Metal Performance Tuning

Published in PPoPP, 2017

Recommended citation: Xiuxia Zhang, Guangming Tan, Shuangbai Xue, Jiajia Li, Keren Zhou, Mingyu Chen. "Understanding the GPU Microarchitecture to Achieve Bare-Metal Performance Tuning." ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP) 2017: 31-43.

Download paper here