Enabling Software Management for Multicore Caches with a Lightweight Hardware Support
A practical automatic polyhedral parallelizer and locality optimizer
Data Layout Transformation for Enhancing Data Locality on NUCA Chip Multiprocessors
2009 18th International Conference on Parallel Architectures and Compilation Techniques
Proceedings of the 2008 ACM SIGPLAN conference on Programming language design and implementation - PLDI ’08
J. Ramanujam
Uday Bondhugula
Qingda Lu
Thomas Henretty
Christophe Alias
Albert Hartono
Data movement aware computation partitioning