CFP last date
20 January 2025
Reseach Article

A New Proactive Fault Tolerant Approach for Scheduling in Computational Grid

Published on November 2011 by P.Keerthika, Dr.N.Kasthuri
International Conference on Web Services Computing
Foundation of Computer Science USA
ICWSC - Number 1
November 2011
Authors: P.Keerthika, Dr.N.Kasthuri
6f58223e-d54f-472c-b33d-c5f3a028b0d2

P.Keerthika, Dr.N.Kasthuri . A New Proactive Fault Tolerant Approach for Scheduling in Computational Grid. International Conference on Web Services Computing. ICWSC, 1 (November 2011), 55-59.

@article{
author = { P.Keerthika, Dr.N.Kasthuri },
title = { A New Proactive Fault Tolerant Approach for Scheduling in Computational Grid },
journal = { International Conference on Web Services Computing },
issue_date = { November 2011 },
volume = { ICWSC },
number = { 1 },
month = { November },
year = { 2011 },
issn = 0975-8887,
pages = { 55-59 },
numpages = 5,
url = { /proceedings/icwsc/number1/3978-wsc011/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 International Conference on Web Services Computing
%A P.Keerthika
%A Dr.N.Kasthuri
%T A New Proactive Fault Tolerant Approach for Scheduling in Computational Grid
%J International Conference on Web Services Computing
%@ 0975-8887
%V ICWSC
%N 1
%P 55-59
%D 2011
%I International Journal of Computer Applications
Abstract

Grid Computing provides non-trivial services to users and aggregates the power of widely distributed resources. Computational grids solve large scale scientific problems using distributed heterogeneous resources. The Grid Scheduler must select proper resources for executing the tasks with less response time and without missing the deadline. There are various reasons such as network failure, overloaded resource conditions, or non-availability of required software components for execution failure. Thus, fault-tolerant systems should be able to identify and handle failures and support reliable execution in the presence of failures. Hence the integration of fault tolerance measures with scheduling gains much importance. In this paper, a new fault tolerance based scheduling approach for scheduling statically available meta tasks is proposed wherein failure rate and the fitness value are calculated. The performance of the fault tolerant scheduling policy is compared with a non-fault tolerant scheduling policy and the results shows that the proposed policy performs better with less TTR in the presence of failures. The number of tasks successfully completed is also more when compared to the non-fault tolerant scheduling policy.

References
  1. N.Malarvizhi, Dr.V.Rhymend Uthariaraj. (2009): A Minimum Time To Release Job Scheduling Algorithm in Computational Grid Environment, IEEE Fifth International Joint Conference on INC, IMS, IDC.
  2. Benoit Anne, Cole Murray, Gilmore Stephen and Hillston Jane. (2005): Enhancing the effective utilization of Grid clusters by exploiting on-line performability analysis, IEEE International symposium on Cluster Computing and the Grid (CCGRID), pp. 317-324.
  3. Buyya. R, Murshed. M, Abramson. D. (2002): A deadline and budget constrained cost-time optimization algorithm for Scheduling task farming applications on global grids, In Proceedings of the international conference on parallel and distributed processing techniques and applications, Las Vegas, USA, pp. 24–27.
  4. Q. Zheng, B. Veeravalli, and C. Tham.(2007): Fault-tolerant Scheduling for Differentiated Classes of Tasks with Low Replication Cost in Computational Grids, ACM, HPDC’07, June 25–29, 2007, Monterey, California, USA.
  5. He X , Sun, X., Laszewski, G.V., (2003). Qos guided min-min heuristic for grid task scheduling, Journal of Computer Science and Technology 18, 442-451.
  6. H.Lee, D.Park, M.Hong, Sang-Soo Yeo, SooKyun Kim, SungHoon Kim, (2009): A Resource Management System for Fault Tolerance in Grid Computing, IEEE International Conference on Computational Science and Engineering, DOI 10.1109/CSE.2009.257.
  7. Ivan Rodero, Francesc Guim, Julita Corbalan, 2009, Evaluation of Coordinated Grid Scheduling Strategies, 11th IEEE International Conference on High Performance Computing and Communications, DOI 10.1109/HPCC.2009.28.
  8. A. Bouteiller, P.Lemarinier, G.Krawezik, F.Cappello, Coordinated checkpoint versus message log for fault tolerant MPI, IEEE International Conference on Cluster Computing (Cluster 2003). IEEE CS Press, December 2003.
  9. R. L. Graham, S.-E. Choi, D. J. Daniel, N. N. D. nd Ronald G. Minnich, C. E. Rasmussen, L. D. Risinger, and M. W. Sukalski, “A network failure-tolerant message-passing system for terascale clusters,” in International Conference on Supercomputing(ICS’02). New York City, NY, USA: ACM, June 2002, pp. 77–83.
  10. B. Schroeder, and G. Gibson, “A Large Scale Study of Failures in Highperformance-Computing Systems,” International Symposium on Dependable Systems and Networks, 2006.
  11. Gopi Kandaswamy, Anirban Mandal, and Daniel A. Reed, “Fault Tolerance and Recovery of Scientific Workflows on Computational Grids”
  12. VahidModiri, Morteza Analoui and Sam Jabbehdari, Fault tolerance in grid using Ant colony optimization and Directed acyclic graph, (2011), International Journal of Grid Computing & Applications (IJGCA) Vol.2, No.1. DOI: 10.5121/ijgca.2011.2102
  13. S.ThamaraiSelvi, Ponsy R.K.SathiaBhama, S.Architha, T.Kaarunya and K.Vinothini, (2010) “Scheduling inVirtualized Grid Environment Using Hybrid Approach” International Journal of Grid Computing & Applications (IJGCA) Vol.1, No.1.
  14. Ritu Garg, Awadhesh Kumar Singh, (2011), “Fault Tolerance in grid computing: state of the art and open issues”, International Journal of Computer Science & Engineering Survey (IJCSES) Vol.2, No.1. DOI : 10.5121/ijcses.2011.2107
  15. K Limaye, B. Leangsuksun, Z. Greenwood, S. L. Scott, C. Engelmann, R. Libby and K. Chanchio, (2005), “Job-Site Level Fault Tolerance for Cluster and Grid environments” In Proceedings of the IEEE international conference on cluster computing, , pp. 1-9.
  16. R. Medeiros, W. Cirne, F. Brasileiro, J. Sauve, (2003), “Faults in grids: why are they so bad and what can be done about it?” In proceedings of the 4th international workshop, pp 18–24.
  17. J. H. Abawajy,(2004), “Fault-tolerant scheduling policy for grid computing systems”, In Proceedings of the International Parallel and Distributed Processing Symposium, IEEE Computer Society, Los Alamitos, United States, pp.3289–3295.
  18. J. Weissman and D. Womack,(1996), Fault Tolerant Scheduling in Distributed Networks. Technical Report TR CS-96-10, Department of Computer Science, University of Texas, San Antonio.
  19. Gosia WrzesinNska, Rob V. van Nieuwpoort, Jason Maassen, Thilo Kielmann, Henri E. Bal, (2006), Fault-Tolerant Scheduling Of Fine-Grained Tasks In Grid Environments, The International Journal of High Performance Computing Applications,Volume 20, No. 1, Spring 2006, pp. 103–114,DOI: 10.1177/1094342006062528, SAGE Publications.
  20. Meenakshi Bheevgade, Manik Mujumdar, Dr. Rajendra Patrikar, Latesh Malik, (2008), Achieving Fault Tolerance in Grid Computing System, Proceedings of 2nd National Conference on Challenges & Opportunities in Information Technology (COIT-2008).
  21. Sameer Singh Chauhan, R. C. Joshi, (2010), QoS Guided Heuristic Algorithms for Grid Task Scheduling, International Journal of Computer Applications (0975 – 8887),Volume 2 – No.9
  22. Leyli Mohammad Khanli, Maryam Etminan Far, Ali Ghaffari , (2010), Reliable Job Scheduler using RFOH in Grid Computing , Journal of Emerging Trends in Computing and Information Sciences.
Index Terms

Computer Science
Information Sciences

Keywords

Fault tolerance Failure rate Grid scheduling Meta task