An agent oriented proactive fault-tolerant framework for grid computing

Huda, M, Schmidt, H and Peake, I 2005, 'An agent oriented proactive fault-tolerant framework for grid computing', in H. Stockinger, R. Buyya and R. Perrott (ed.) Proceedings of the 1st International Conference on e-Science and Grid Computing, Melbourne, Australia, 5-8 December 2005, pp. 304-311.


Document type: Conference Paper
Collection: Conference Papers

Title An agent oriented proactive fault-tolerant framework for grid computing
Author(s) Huda, M
Schmidt, H
Peake, I
Year 2005
Conference name 1st International Conference on e-Science and Grid Computing
Conference location Melbourne, Australia
Conference dates 5-8 December 2005
Proceedings title Proceedings of the 1st International Conference on e-Science and Grid Computing
Editor(s) H. Stockinger, R. Buyya and R. Perrott
Publisher IEEE
Place of publication California, United States
Start page 304
End page 311
Total pages 8
Abstract Because of computational grid heterogeneity, scale and complexity, faults become likely. Therefore, grid infrastructure must have mechanisms to deal with faults while also providing efficient and reliable services to its end users. Existing fault-tolerant approaches are inefficient because they are reactive and incomplete. They are reactive because they only deal with faults when they take place; they are incomplete because they only deal with certain types of faults. Proactive approaches increase efficiency by reducing the cost and time of operations and network resource usage by maintaining the state of executing applications and resuming operation when rescheduled. This paper presents an agent oriented, fault-tolerant grid framework where agents deal with individual faults proactively. Agents maintain information about hardware conditions, executing process memory consumption, available resources, network conditions and component mean time to failure. Based on this information and critical states, agent can improve the reliability and efficiency of grid services.
Subjects Distributed Computing not elsewhere classified
DOI - identifier 10.1109/E-SCIENCE.2005.15
Copyright notice © 2005 IEEE
ISBN 0769524486
Versions
Version Filter Type
Citation counts: TR Web of Science Citation Count  Cited 2 times in Thomson Reuters Web of Science Article | Citations
Scopus Citation Count Cited 19 times in Scopus Article | Citations
Altmetric details:
Access Statistics: 190 Abstract Views  -  Detailed Statistics
Created: Thu, 01 Dec 2011, 10:49:00 EST by Catalyst Administrator
© 2014 RMIT Research Repository • Powered by Fez SoftwareContact us