International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 76 - Number 3 |
Year of Publication: 2013 |
Authors: Abhishek Bhavsar, Ameya More, Chinmay Kulkarni, Dheeraj Oswal, Jagannath Aghav |
10.5120/13228-0657 |
Abhishek Bhavsar, Ameya More, Chinmay Kulkarni, Dheeraj Oswal, Jagannath Aghav . A Holistic Approach to Autonomic Self-Healing Distributed Computing System. International Journal of Computer Applications. 76, 3 ( August 2013), 25-30. DOI=10.5120/13228-0657
Distributed Computing systems are prone to errors and faults and a major amount of time is wasted in maintaining the system and bringing it back to a stable state after a fault. Human resources in the distributed systems architecture currently handle this maintenance. Despite the emergence of ultra-reliable components, failure in distributed computing systems is still an unmitigated problem. As a result of this a lot of resources in the form of money and manpower and efforts in the form of man months are wasted. The proposed mechanism focuses efforts to make a distributed systems environment reliable and robust by proposing an autonomic, self-healing architecture. A holistic approach to the problem is adopted and an architecture that is general enough to be adopted by a wide range of existing systems is proposed. Some of the major challenges include selecting the appropriate actions for healing and reducing the overhead thus making healing lightweight and transparent, yet effective. The proposed system architecture makes use of data mining techniques to generate rules based on gathered system data from logs. The rules are used to make decisions of corrective action and hence carry out the self-healing mechanism.