Optimization principles and application performance evaluation of a multithreaded GPU using CUDA
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming - PPoPP ’08
Sara S. Baghsorkhi
Christopher I. Rodrigues
Shane Ryoo
Wen-mei W. Hwu
Sam S. Stone
Accelerating Pathology Image Data Cross-comparison on CPU-GPU Hybrid Systems