Multi-Execution: Multicore Caching for Data-Similar Executions

by Susmit Biswas , Diana Franklin , Alan Savage , Ryan Dixon , Timothy Sherwood , Frederic T. Chong
Citations:2 - 0 self

Documents Related by Co-Citation

82 How much information – P Lyman, H R Varian - 2003
70 A methodology for designing, modifying, and implementing Fourier transform algorithms on various architectures – J. Johnson, R. W. Johnson, D. Rodriguez, R. Tolimieri - 1990
12 Automatic tuning matrix multiplication performance on graphics hardware – Changhao Jiang, Marc Snir - 2005
8 Extending the world.s most popular processor architecture. Intel Whitepaper – R Ramanathan
25 Carbon: architectural support for fine-grained parallelism on chip multiprocessors – Sanjeev Kumar, Christopher J. Hughes, Anthony Nguyen - 2007
14 QR and Cholesky Factorizations using Vector Capabilities of GPUs – LU - 2008
2 Convergence of recognition, mining, and synthesis workloads and its implications – Y K Chen, J Chhugani, P Dubey, C J Hughes, D Kim, S Kumar, V W Lee, A D Nguyen, M Smelyanskiy, M Smelyanskiy
2 A Performance-Driven Study of Regularization Methods for GPU-Accelerated Iterative CT – Wei Xu, Klaus Mueller
5 TeraFLOP computing on a desktop pc with GPUs for 3D CFD – J Tolke, M Krafczyk - 2008
6 Achieving predictable performance through better memory controller placement in many-core cmps – D Abts, N D Enright Jerger, J Kim, D Gibson, M H Lipasti - 2009
2 Discrete Fourier Transform on Multicore -- A review of optimizations necessary for good multicore performance – Franz Franchetti, Markus Püschel, Yevgen Voronenko, Srinivas Chellappa, José M.F. Moura - 2009
7 FAST: Fast Architecture Sensitive Tree Search on Modern CPUs and GPUs – C Kim, J Chhugani, N Satish, E Sedlar, A Nguyen, T Kaldewey, V Lee, S Brandt, P Dubey - 2010
3 Fast Sort on CPUs and GPUs: A Case For Bandwidth Oblivious SIMD Sort – N Satish, C Kim, J Chhugani, A Nguyen, V Lee, D Kim, P Dubey - 2010
2 The sparse matrix vector product on GPUs – F Vazquez, E M Garzon, J A Martinez, J J Fernandez - 2009
5 Parallel Image Processing Based on CUDA – Z Yang, Y Zhu, Y Pu - 2008
1 Teraflops for games and derivatives pricing. http://quantcatalyst.com/download.php? file=DerivativesPricing.pdf – C Bennemann, M Beinker, D Egloff, M Gauckler
1 High-performance physical simulations on next-generation architecture with many cores – Y-K Chen, J Chhugani, C J Hughes, D Kim, S Kumar, V W Lee, A Lin, A D Nguyen, E Sifakis, M Smelyanskiy
1 Graphic processing units: A possible answer to HPC – L Genovese - 2009
1 Atomic vector operations on chip multiprocessors – S Kumar, D Kim, M Smelyanskiy, Y-K Chen, J Chhugani, C J Hughes, C Kim, V W Lee, A D Nguyen - 2008