CUDA-NP: Realizing Nested Thread-Level Parallelism (2014)

by Y Yang, H Zhou
Venue:in GPGPU Applications. PPoPP