A Redundant Communication Approach to Scalable Fault Tolerance in PGAS Programming Models
2011 19th International Euromicro Conference on Parallel, Distributed and Network-Based Processing
Bruce Palmer
Sriram Krishnamoorthy
Nawab Ali
Fault Tolerance for OpenSHMEM