The cache performance and optimizations of blocked algorithms
Proceedings of the fourth international conference on Architectural support for programming languages and operating systems - ASPLOS-IV
Edward E. Rothberg
Monica D. Lam
Enabling Software Management for Multicore Caches with a Lightweight Hardware Support