PIM-enabled instructions
Application-to-core mapping policies to reduce memory interference in multi-core systems
Accelerating Dependent Cache Misses with an Enhanced Memory Controller
Transparent Offloading and Mapping (TOM): Enabling Programmer-Transparent Near-Data Processing in GPU Systems
Bottleneck identification and scheduling in multithreaded applications
Exploiting Core Criticality for Enhanced GPU Performance
Proceedings of the 2016 International Conference on Parallel Architectures and Compilation - PACT ’16
2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA)
Proceedings of the 2016 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Science - SIGMETRICS ’16
Proceedings of the seventeenth international conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS ’12
Proceedings of the 42nd Annual International Symposium on Computer Architecture - ISCA ’15
Proceedings of the 21st international conference on Parallel architectures and compilation techniques - PACT ’12
Chita R. Das
Mahmut T. Kandemir
Ashutosh Pattnaik
Onur Kayiran
Adwait Jog
Yale N. Patt
Data movement aware computation partitioning