Locality and parallelism optimization for dynamic programming algorithm in bioinformatics
Guangming Tan, Shengzhong Feng, Ninghui Sun. "Locality and parallelism optimization for dynamic programming algorithm in bioinformatics." SC 2006
Guangming Tan, Shengzhong Feng, Ninghui Sun. "Locality and parallelism optimization for dynamic programming algorithm in bioinformatics." SC 2006
Guangming Tan, Ninghui Sun, Guang R. Gao. "A parallel dynamic programming algorithm on a multi-core architecture." the 19th Annual ACM Symposium on Parallelism in Algorithms and Architectures (SPAA), 135-144, 2007
Guangming Tan, Vugranam C. Sreedhar, Guang R. Gao. "Just-In-Time Locality and Percolation for Optimizing Irregular Applications on a Manycore Architecture." the 21th International Workshop on Languages and Compilers for Parallel Computing (LCPC), 2008: 331-342.
Guangming Tan, Ziyu Guo, Dan Meng. "Single-particle 3D Reconstruction from Cryo-Electron Microscopy Images on GPU." The 23rd ACM International Conference on Supercomputing (ICS), pp. 380-389, 2009.
Guangming Tan, Linchuan Li, Sean Triechler, Everett Phillips, Yungang Bao, Ninghui Sun. "Fast Implementation of DGEMM on Fermi GPU." ACM/IEEE Supercomputing (SC), 2011.
Jiajia Li, Xingjian Li, Guangming Tan, Mingyu Chen, Ninghui Sun. "An Optimized Large-Scale Hybrid DGEMM Design for CPUs and ATI GPUs." The 26th ACM International Conference on Supercomputing (ICS), pp.377-386, 2012.
Jiajia Li, Guangming Tan, Mingyu Chen, Ninghui Sun. "SMAT: An Input Adaptive Auto-Tuner for Sparse Matrix-Vector Multiplication." the 34th annual ACM SIGPLAN conference on Programming Language Design and Implementation (PLDI), 117-126, 2013.
Jie Yan, Guangming Tan, Xiuxia Zhang, Erlin Yao, Ninghui Sun. "Vlock: Lock virtualization mechanism for exploiting fine-grained parallelism in graph traversal algorithms." 2013 IEEE/ACM International Symposium on Code Generation and Optimization (CGO), pp.1-10,2013
Yulong Luo, Guangming Tan, Zeyao Mo, Ninghui Sun. "FAST: A Fast Stencil Autotuning Framework Based On An Optimal-solution Space Model." Proceedings of the 29th ACM on International Conference on Supercomputing (ICS), 2015
Xiuxia Zhang, Guangming Tan, Shuangbai Xue, Jiajia Li, Keren Zhou, Mingyu Chen. "Understanding the GPU Microarchitecture to Achieve Bare-Metal Performance Tuning." ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP) 2017: 31-43.
Keren Zhou, Guangming Tan, Xiuxia Zhang, Chaowei Wang, Ninghui Sun. "A Performance Analysis Framework for Exploiting GPU Microarchitectural Capability." ACM International Conference on Supercompting (ICS), 2017
Xueqi Li, Guangming Tan, Bingchen Wang, Ninghui Sun. "High-performance genomic analysis framework with in-memory computing." ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP) 2018: 317-328.
Ke Meng, Jiajia Li, Guangming Tan, Ninghui Sun. "A pattern based algorithmic autotuner for graph processing on GPUs." ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), 2019
Zhen Xie, Guangming Tan , Weifeng Liu , Ninghui Sun. " IA-SpGEMM: An Input-aware Auto-tuning Framework for Parallel Sparse Matrix-Matrix Multiplication." In Proceedings of 2019 International Conference on Supercomputing, Phoenix, AZ, USA, June 26–28, 2019 (ICS ’19)
Xiaoyang Zhang, Junmin Xiao, Guangming Tan. "I/O Lower Bounds for Auto-tuning of Convolutions in CNNs." ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), 2021
Junmin Xiao, Qing Xue, Hui Ma, Xiaoyang Zhang, Guangming Tan. "A W-cycle algorithm for efficient batched SVD on GPUs." ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), 2022
Zhuoqiang Guo, Denghui Lu, Yujin Yan, Siyu Hu, Rongrong Liu, Guangming Tan, Ninghui Sun, Wanrun Jiang, Lijun Liu, Yixiao Chen, Linfeng Zhang, Mohan Chen, Han Wang, Weile Jia. "Extending the limit of molecular dynamics with ab initio accuracy to 10 billion atoms." ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), 2022
Ruihao Gao, Xueqi Li, Yewen Li, Xun Wang, Guangming Tan. "MetaZip: a high-throughput and efficient accelerator for DEFLATE." DAC 2022: 319-324
Zhongzhe Hu, Junmin Xiao, Zheye Deng, Mingyi Li, Kewei Zhang, Xiaoyang Zhang, Ke Meng, Ninghui Sun, Guangming Tan "MegTaiChi: dynamic tensor-based memory management optimization for DNN training." ICS 2022: 25:1-25:13
Junmin Xiao, Yunfei Pang, Qing Xue, Chaoyang Shui, Ke Meng, Hui Ma, Mingyi Li, Xiaoyang Zhang, Guangming Tan, "W-Cycle SVD: A Multilevel Algorithm for Batched SVD on GPUs." SC 2022
Wei Hu, Hong An, Zhuoqiang Guo, Qingcai Jiang, Xinming Qin, Junshi Chen, Weile Jia, Chao Yang, Zhaolong Luo, Jielan Li, Wentiao Wu, Guangming Tan, Dongning Jia, Qinglin Lu, Fangfang Liu, Min Tian, Fang Li, Yeqi Huang, Liyi Wang, Sha Liu, Jinlong Yang. "2.5 Million-Atom Ab Initio Electronic-Structure Simulation of Complex Metallic Heterostructures with DGDFT." SC 2022 (GB Finalist)
Zhen Du, Jiajia Li, Yinshan Wang, Xueqi Li, Guangming Tan, Ninghui Sun. "AlphaSparse: Generating High Performance SpMV Codes Directly from Sparse Matrices." SC 2022
Yewen Li, Xueqi Li, Ruihao Gao, Wanqi Liu, Guangming Tan. "NvWa: Enhancing Sequence Alignment Accelerator Throughput via Hardware Scheduling." HPCA 2023
Siyu Hu, Wentao Zhang, Qiuchen Sha, Feng Pan, Lin-Wang Wang, Weile Jia, Guangming Tan, Tong Zhao. " RLEKF: An Optimizer for Deep Potential with Ab Initio Accuracy. " AAAI 2023