A Redundant Communication Approach to Scalable Fault Tolerance in PGAS Programming Models
2011 19th International Euromicro Conference on Parallel, Distributed and Network-Based Processing
Bruce Palmer
Niranjan Govind
Sriram Krishnamoorthy
Fault Tolerance for OpenSHMEM