Optimization principles and application performance evaluation of a multithreaded gpu using cuda, in: (2008)

by S Ryoo, C I Rodrigues, S S Baghsorkhi, S S Stone, D B Kirk, W-m W Hwu
Venue:PPoPP ’08: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming,