Optimization principles and application performance evaluation of a multithreaded GPU using CUDA
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming - PPoPP ’08
Sara S. Baghsorkhi
Christopher I. Rodrigues
Wen-mei W. Hwu
David B. Kirk
Sam S. Stone
Accelerating Pathology Image Data Cross-comparison on CPU-GPU Hybrid Systems