CFP last date
20 December 2024
Reseach Article

Article:Enhancment of Grid Scheduling using Dyanamic Error Detection and Fault Tolerance

by B.Radha, Dr.V. Sumathy
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 31 - Number 7
Year of Publication: 2011
Authors: B.Radha, Dr.V. Sumathy
10.5120/3839-5340

B.Radha, Dr.V. Sumathy . Article:Enhancment of Grid Scheduling using Dyanamic Error Detection and Fault Tolerance. International Journal of Computer Applications. 31, 7 ( October 2011), 36-45. DOI=10.5120/3839-5340

@article{ 10.5120/3839-5340,
author = { B.Radha, Dr.V. Sumathy },
title = { Article:Enhancment of Grid Scheduling using Dyanamic Error Detection and Fault Tolerance },
journal = { International Journal of Computer Applications },
issue_date = { October 2011 },
volume = { 31 },
number = { 7 },
month = { October },
year = { 2011 },
issn = { 0975-8887 },
pages = { 36-45 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume31/number7/3839-5340/ },
doi = { 10.5120/3839-5340 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:17:32.014202+05:30
%A B.Radha
%A Dr.V. Sumathy
%T Article:Enhancment of Grid Scheduling using Dyanamic Error Detection and Fault Tolerance
%J International Journal of Computer Applications
%@ 0975-8887
%V 31
%N 7
%P 36-45
%D 2011
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The computational grid solve most of the problems that arise in many scientific application with the help of the heterogeneous resources which is spread across the distributed environment .The challenges that arise in such case of utilization of the resources and scheduling of jobs can be overcome by the techniques of error detection mechanisms .The early error detection mechanism collects the entire information about the resources which are available in the heterogeneous distributed environment. The resource information can be used during the allocation of jobs to that resources so that the job gets executed successfully without any failure in the resource. But the error detection mechanism also has its own drawbacks like the remote host server may be down ,file transfer services may not supported by the host ,there may be any malfunctionality in the service protocols and the hardware failure which occurs during data transfer also cannot be tackled in error rectification .To avoid this we introduce fault tolerance mechanism to overcome the difficulty.

References
  1. Thain.D and Livny.M Error scope on a computational grid: Theory and practice. In Proceedings of the 11th IEEE Symposium on High Performance Distributed Computing (HPDC’02), pages 199–208. IEEE Computer Society, 2002
  2. Kosar.T and Livny.M Stork: Making Data Placement a First Class Citizen in the Grid. In International Conference on Distributed Computing Systems, March 2004.
  3. Kosar.T and Balman.M A new paradigm: Data-aware scheduling in grid computing. Future Generation Computer Systems, In Press, DOI: 10.1016/j.future.2008.09.006.
  4. Balman.M and Kosar.T Early error Detection and classification in Dat Transfer Scheduling .International Conference on Complex ,Intellignet and software Intensive Systems ,IEEE ,2009
  5. Condor Project.http://www.cs.wisc.edu/condor/
  6. G.kola, T.Kosar and M.Livny. Phoenix: Making Data intensive Grid Applications Fault tolerant. In 5th IEEE/ACM International Workshop on Grid Computing, 2004
  7. Radha.B and Sumathy.V Comparison of ACO and PSO in Grid Job Scheduling .CIIT International journal of networking and communication Engineering print: ISSN 0974 – 9713 & Online: ISSN 0974 – 9616 DOI: NCE102009003
  8. Cieslak, D. Chawla N., and Thain D.. Troubleshooting Thousands of Jobs on Production Grids Using Data Mining techniques. IEEE Grid Computing, September 2008.
  9. Paul Townend, Jie Xu, Fault tolerance within a grid environment, As part of the e-Demand project at the University of Durham, DH1 3LE, United Kingdom, 2003.
  10. Greg Bronevetsky, Rohit Fernandes, Daniel Marques, Keshav Pingali, Paul Stodghill, Recent advances in checkpoint/recovery systems, in: Workshop on NSF Next Generation Software held in conjunction with the 2006 IEEE International Parallel & Distributed Processing Symposium, April, 2006.
  11. D.Thain and M.Livny.Error scope on a computational grid:Theory andpractice .In proceedings of th 11th IEEE Symposium on High Performance Distributed Computing (HDPC’02) ,Pages 199-208.IEEE Computer Society,2002.
Index Terms

Computer Science
Information Sciences

Keywords

Distributed systems data aware scheduling Error Detection Fault tolerance Grid computing performance of systems Scheduling