CFP last date
20 February 2025
Reseach Article

Recovery of Failures in Transaction Oriented Composite Grid Service

Published on December 2013 by Dharmendra Prasad Mahato, Lokendra Singh Umrao, Ravi Shankar Singh
2nd International conference on Computing Communication and Sensor Network 2013
Foundation of Computer Science USA
CCSN2013 - Number 2
December 2013
Authors: Dharmendra Prasad Mahato, Lokendra Singh Umrao, Ravi Shankar Singh

Dharmendra Prasad Mahato, Lokendra Singh Umrao, Ravi Shankar Singh . Recovery of Failures in Transaction Oriented Composite Grid Service. 2nd International conference on Computing Communication and Sensor Network 2013. CCSN2013, 2 (December 2013), 38-42.

@article{
author = { Dharmendra Prasad Mahato, Lokendra Singh Umrao, Ravi Shankar Singh },
title = { Recovery of Failures in Transaction Oriented Composite Grid Service },
journal = { 2nd International conference on Computing Communication and Sensor Network 2013 },
issue_date = { December 2013 },
volume = { CCSN2013 },
number = { 2 },
month = { December },
year = { 2013 },
issn = 0975-8887,
pages = { 38-42 },
numpages = 5,
url = { /proceedings/ccsn2013/number2/14767-1335/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 2nd International conference on Computing Communication and Sensor Network 2013
%A Dharmendra Prasad Mahato
%A Lokendra Singh Umrao
%A Ravi Shankar Singh
%T Recovery of Failures in Transaction Oriented Composite Grid Service
%J 2nd International conference on Computing Communication and Sensor Network 2013
%@ 0975-8887
%V CCSN2013
%N 2
%P 38-42
%D 2013
%I International Journal of Computer Applications
Abstract

Transaction Oriented Composite Grid service is a group of sub services to be executed in Grid environment when transaction management is used. Since Grid services are loosely coupled and dynamic in nature, the transaction management becomes tough task in this environment. As the number of services increase, the chances of failures also increase due to different types of faults occurring in the system. Therefore fault tolerant execution of these tasks is required to maintain the reliability, availability, dependability of the system. In this paper we have implemented coordinated check-pointing approach to tolerate the faults so that resiliency, reliability, availability, and dependability can be enhanced. For recovery of the failed processes we have compared both local node recovery and replicated node recovery by simulating in CPN tool. Here we have considered three types of faults such as hardware faults, communication link faults, and software faults. All the faults have been modelled dynamically in the simulation. The results show that the local node recovery is better than replicated node recovery when the number of services is minimum but in the case of large number of services the replicated node recovery works better. Our results show that using local node recovery we can decrease the failures by 38. 86% and when we use replicated nodes recovery we get that results decreasing by 31. 34%.

References
  1. Guo, Suchang, Hong-Zhong Huang, and Yu Liu. "Modeling and Analysis of Grid Service Reliability Considering Fault Recovery. " New Generation Computing 29. 4 (2011): 345-364.
  2. Bohm, Matthias and Habich, Dirk and Lehner, Wolfgang and Wloka, Uwe "An advanced transaction model for recovery processing of inte-gration processes", ADBIS (local proceedings), pp. 90–105, 2008.
  3. Bolosky, W. J. , Douceur, J. R. , Ely, D. and Theimer, M. , "Feasibility of a Serverless Distributed File System Deployed on an Existing Set of Desktop PCs", in Proc. of the ACM International Conference on Measurement and Modeling of Computer Systems 2000, ACM Press, pp. 34-43, 2000. M. R. Yudith Cardinale, "Fault tolerant execution of transactional composite web services: An approach," Services Computing, IEEE Transactions on, vol. 5, pp. 158 –164, jan. -april 2011.
  4. Affaan, M. and Ansari, M. A. , Grid and Cooperative Computing, 2006. GCC 2006. Fifth International Conference, "Distributed Fault Management for Computational Grids", 2006, pp. 363–368.
  5. Dai, Y. S. , Levitin, G. and Wang, X. L. , "Optimal Task Partition and Distribution in Grid Service System with Common Cause Failures", Future Generation Computer Systems, 23, 2, pp. 209-218, 2007.
  6. Yuan-Shun Dai and Levitin, G. , Reliability, IEEE Transactions on, "Reliability and performance of tree-structured grid services", 2006, 55, 2, pp. 337-349.
  7. Dai, Y. S. , Pan, Y. and Zou, X. K. , "A Hierarchical Modeling and Analysis for Grid Service Reliability", IEEE Transactions on Coers, 56, 5, pp. 681-691, 2007.
  8. Dai, Y. S. , Xie, M. and Poh, K. L. , "Reliability of Grid Service Systems", Computers and Industrial Engineering, 50, 1, pp. 130-147, 2006.
  9. Jin Liang and Tong WeiQin and Tang JianQuan and Wang Bo, Industrial Informatics, 2003. INDIN 2003. Proceedings. IEEE International Conference on, "A fault-tolerance mechanism in grid", 2003, pp. 457–461.
  10. Foster, I. , "The Grid: a New Infrastructure for 21st Century Science", Physics Today, 55, 2, pp. 42-47, 2002.
  11. Foster, Ian, "The grid: A new infrastructure for 21st century science", Grid Computing: Making the Global Infrastructure a Reality, pp. 51–63, 2003, John Wiley & Sons, Chichester.
  12. Shi, Xuanhua and Pazat, Jean-Louis and Rodriguez, Eric and Jin, Hai and Jiang, Hongbo, "Adapting grid applications to safety using fault-tolerant methods: Design, implementation and evaluations", Future Gener. Comput. Syst. , February, 2010, 26, 2, 2010, 0167-739X, pp. 236– 244, 9, acmid:1630314, Elsevier Science Publishers B. V. , Amsterdam, The Netherlands, The Netherlands.
  13. Hwang, Soonwook and Kesselman, Carl, Journal of Grid Computing, 1, 3, "A Flexible Framework for Fault Tolerance in the Grid", Kluwer Academic Publishers, pp. 251–272, English2003.
  14. Kovcs, Jzsef and Kacsuk, Pter, Grid Computing, 3165, Lecture Notes in Computer Science, Dikaiakos, MariosD. , "A Migration Framework for Executing Parallel Programs in the Grid", Springer Berlin Heidelberg, pp. 80–89.
  15. Bubak, Marian and Funika, Wdzimierz and Bali, Bartosz and Wismller, Roland, Parallel Processing and Applied Mathematics, 2328, Lecture Notes in Computer Science, Wyrzykowski, Roman and Dongarra, Jack and Paprzycki, Marcin and Waniewski, Jerzy, "A Concept of Grid Application Monitoring", Springer Berlin Heidelberg, pp. 307–314, En-glish2002.
  16. Foster, I. and Kesselman, C. and Nick, J. M. and Tuecke, S. , Computer, "Grid services for distributed system integration", 2002, 35, 6, pp. 37–46, 0018–9162,1997.
  17. Abdelsalam Heddaya and Abdelsalam Helal, "Reliability, Availability,Dependability and Performability: A User-centered View", 1997.
  18. Mache, Jens, "Hands-on grid computing with Globus Toolkit 4", J. Com-put. Sci. Coll. , December 2006, 22, 2, pp. 99–100, 2, acmid:1181921, Consortium for Computing Sciences in Colleges, USA.
  19. An Liu and Qing Li and Liusheng Huang and Mingjun Xiao, "FACTS: A Framework for Fault-Tolerant Composition of Transactional Web Services", IEEE Transactions on Services Computing, 3, 1, 1939-1374, 2010, pp. 46-59 ,IEEE Computer Society, Los Alamitos, CA, USA.
  20. Cardinale, Yudith and Rukoz, Marta, "Fault Tolerant Execution of Transactional CompositeWeb Services: An Approach", UBICOMM 2011, The Fifth International Conference on Mobile Ubiquitous Computing, Systems, Services and Technologies, pp. 158–164, 2011.
  21. Chen, Jun and Gu, Yuesheng and Liu, Yanpei, Grid Service Concurrency Control Protocol, Journal of Networks, 7, 4, 707–714, 2012. Acmid: 1692817, Springer-Verlag, Berlin, Heidelberg.
  22. Haider, Sajjad and Ansari, Naveed Riaz and Akbar, Muhammad and Perwez, Mohammad Raza and Ghori, Khawaja Moyeez Ullah, "Fault Tolerance in Distributed Paradigms", Proc. of Fifth International Conference on Computer Communication and Management, IACSIT Press, Singapore, 2011.
  23. Wang, Dexiang and Kumar, Arvindhan and Sivakumar, Madhan and McNair, Janise Y. , "A fault-tolerant backbone network architecture targeting time-critical communication for avionic WDM LANs", Proceedings of the 2009 IEEE international conference on Communications, ICC'09, 2009, 978-1-4244-3434-3, Dresden, Germany, pp. 2596–2600, 5, acmid:1817752, IEEE Press, Piscataway, NJ, USA.
  24. Lopes, Rafael Fernandes and da Silva e Silva, Francisco Jose, "Fault tolerance in a mobile agent based computational grid", Cluster Computin and the Grid, 2006. CCGRID 06. Sixth IEEE International Symposium on, 2, pp. 8–pp, 2006, IEEE.
  25. Kov´acs, J´ozsef and Kacsuk, Peter and Januszewski, Radoslaw and Jankowski, Gracjan, "Application and middleware transparent checkpointing with TCKPT on ClusterGrids", Future Generation Computer Systems, 26, 3, pp. 498–503, 2010, Elsevier.
  26. Krpska, Elbieta and Kielmann, Thilo and Sirvent, Ral and Badia, RosaM. , Achievements in European Research on Grid Systems, Gorlatch, Sergei and Bubak, Marian and Priol, Thierry, "A Service for Reliable Execution of Grid Applications", Springer US, pp. 179–192,2008.
  27. Lai, Hong Feng. "Modeling grid workflow by coloured grid service net. " Advances in Grid and Pervasive Computing. Springer Berlin Heidelberg, 2010. 204-213.
  28. Ma, Hua. "An Approach on Grid Services Transaction Management for Grid Workflow. " Information Engineering and Computer Science, 2009. ICIECS 2009. International Conference on. IEEE, 2009.
Index Terms

Computer Science
Information Sciences

Keywords

Transaction Management Fault Tolerance Reliability Availability Resiliency Dependability Cpn (colored Petri Nets) Tool Local Recovery Replicated Recovery.