International Conference and Workshop on Emerging Trends in Technology |
Foundation of Computer Science USA |
ICWET - Number 4 |
None 2011 |
Authors: Kavita Jain, Varsha Wangikar, Seema Shah |
a4e6d9ed-31fb-4b9e-bf05-be8ca5f43241 |
Kavita Jain, Varsha Wangikar, Seema Shah . FAR: Dynamic Replication Strategy for Data Grid. International Conference and Workshop on Emerging Trends in Technology. ICWET, 4 (None 2011), 21-26.
Grid computing is gradually emerging as a new paradigm for next-generation computing. It enables sharing, selection, and aggregation of geographically distributed homogeneous and heterogeneous resources for solving large-scale problems in science, engineering, and commerce. In most of the organizations, there are large amounts of underutilized computing power and storage existing. Most desktop machineâs CPU and storage space usage is very less. On the other hand many high performance applications require large amount of computational power and storage space. Grid computing provides a framework for exploiting these underutilized resources and thus increases the efficiency of resource usage. Now a days, many commercial, business and research institutes produce huge amount of data and need to store this data on secondary storage of machines. Users of data are distributed among different geographical boundaries and they want to collaborate on the same problem. Data grids focus on providing secure access to distributed, heterogeneous pools of data. Data grids harness data, storage, and network resources located in distinct administrative domains, and provide high speed and reliable access to data. Optimization of data access can be achieved via data replication, whereby identical copies of data are generated and stored at various sites. A good replication strategy should ideally minimize mean execution time of all jobson Grid; reduce access time while optimizing resources. Hence in this paper we have focused on improving data grid performance. We have first presented a detailed analysis of various replication strategies like No replication, Always replication and the Economic model simulated by OptorSim. Next we have proposed a dynamic replication strategy which switches between No replication and Always replication based on file size and type of access. We argue that the proposed file size and type of access based replication algorithm will minimize the access latencies and execution time of jobs on Grid. In the next phase we shall proceed with implementing algorithm on simulator and observing the performance of the implemented algorithm.