Architecture-based fault tolerance support for grid applications

Yusuf, I, Schmidt, H and Peake, I 2011, 'Architecture-based fault tolerance support for grid applications', in Jens Happe and Dorina Petriu (ed.) Proceedings of the Joint ACM SIGSOFT Conference, Boulder, Colorado, USA, June 21-23, 2011, pp. 177-181.


Document type: Conference Paper
Collection: Conference Papers

Title Architecture-based fault tolerance support for grid applications
Author(s) Yusuf, I
Schmidt, H
Peake, I
Year 2011
Conference name Quality of Software Architectures 2011
Conference location Boulder, Colorado, USA
Conference dates June 21-23, 2011
Proceedings title Proceedings of the Joint ACM SIGSOFT Conference
Editor(s) Jens Happe and Dorina Petriu
Publisher ACM
Place of publication New York, United States
Start page 177
End page 181
Total pages 5
Abstract Failure in long running grid applications is arguably in- evitable and costly. Therefore, fault tolerance (FT) sup- port for grid applications is needed. This paper evaluates an extension of our prior work on Recovery Aware Compo- nents (RAC), a component-based FT approach. Our ex- tension utilizes the grid application architecture according to a small number of architectural classes. In this paper, we evaluate the MapReduce architecture only and analyze the reliability improvement MapReduce applications would gain by adopting the RAC approach. Our analysis shows that signi cant increases in reliability are possible at mod- erate extra cost. Obviously the cost of FT depends on the failure rate of the managed system, i.e., the system to be pro- tected from faults, and the FT strategy chosen. Our work aims to give High Performance Computing (HPC) software architects the tools to control these factors for di erent grid application architectures.
Subjects Software Engineering
Programming Languages
Computational Logic and Formal Languages
Keyword(s) Architecture
Fault Tolerance
Reliability
Grid
MapReduce
Reliability
Theory
Experimentation
DOI - identifier 10.1145/2000259.2000289
Copyright notice © Copyright 2011 ACM
ISBN 9781450307246
Versions
Version Filter Type
Altmetric details:
Access Statistics: 222 Abstract Views  -  Detailed Statistics
Created: Fri, 15 Jul 2011, 08:36:00 EST by Catalyst Administrator
© 2014 RMIT Research Repository • Powered by Fez SoftwareContact us