A W-cycle algorithm for efficient batched SVD on GPUs

Published in PPoPP, 2022

Recommended citation: Junmin Xiao, Qing Xue, Hui Ma, Xiaoyang Zhang, Guangming Tan. "A W-cycle algorithm for efficient batched SVD on GPUs." ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), 2022

Download paper here