CFP last date
20 December 2024
Reseach Article

Quality metrics Validation in View Maintenance Models of Data Warehouse

by Anjana Gosain, Sangeeta Sabharwal, Rolly Gupta
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 178 - Number 11
Year of Publication: 2019
Authors: Anjana Gosain, Sangeeta Sabharwal, Rolly Gupta
10.5120/ijca2019918844

Anjana Gosain, Sangeeta Sabharwal, Rolly Gupta . Quality metrics Validation in View Maintenance Models of Data Warehouse. International Journal of Computer Applications. 178, 11 ( May 2019), 36-42. DOI=10.5120/ijca2019918844

@article{ 10.5120/ijca2019918844,
author = { Anjana Gosain, Sangeeta Sabharwal, Rolly Gupta },
title = { Quality metrics Validation in View Maintenance Models of Data Warehouse },
journal = { International Journal of Computer Applications },
issue_date = { May 2019 },
volume = { 178 },
number = { 11 },
month = { May },
year = { 2019 },
issn = { 0975-8887 },
pages = { 36-42 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume178/number11/30577-2019918844/ },
doi = { 10.5120/ijca2019918844 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:50:07.510268+05:30
%A Anjana Gosain
%A Sangeeta Sabharwal
%A Rolly Gupta
%T Quality metrics Validation in View Maintenance Models of Data Warehouse
%J International Journal of Computer Applications
%@ 0975-8887
%V 178
%N 11
%P 36-42
%D 2019
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Data warehouses are huge repositories designed to enable the knowledge workers to make better and faster decisions. Due to its significance in strategic decision making, there is a need to assure data warehouse quality in the presence of evolution events which may be generated as result of change in schema / software or data warehouse requirements. One of the factors affecting the data warehouse quality is view maintenance models quality. Although there are some useful guidelines for designing good view maintenance models, but objective indicators, i.e., metrics are needed to help designers to develop quality view maintenance models. In our previous work, a quality metric for View maintenance models of data warehouse is proposed [25] However, the proposal overall lacks theoretical and empirical validation of the metric proposed. Hence, the metric practical utility could not be established. This paper validates the metrics both theoretically and empirically. The theoretical validation is performed using Zuse framework [7] while empirical validation is carried out using MVPP (Multiple View Processing Plan) to explore the relationship between the proposed metrics and cost efficiency of View maintenance models. The results show that all the four metrics NBR, NVM, NAMV and NFMV have significant impact on the cost efficiency of View maintenance models.

References
  1. Inmon, W.H., Building the Data Warehouse. John Wiley, 1992.
  2. Bellahsene, Z.: Schema evolution in data warehouses. Knowl. and Inf. Syst. 4(2) (2002)
  3. Kimball, R. The Data Warehouse Toolkit. John Wiley, 1996.
  4. Blaschka, M., Sapia, C., Höfling, G.: On Schema Evolution in Multidimensional Databases. In: Mohania, M., Tjoa, A.M. (eds.) DaWaK 1999. LNCS, vol. 1676. Springer, Heidelberg (1999)
  5. Fan, H., Poulovassilis, A.: Schema Evolution in Data Warehousing Environments – A Schema Transformation-Based Approach. In: Atzeni, P., Chu, W., Lu, H., Zhou, S., Ling, T.-W. (eds.) ER 2004. LNCS, vol. 3288. Springer, Heidelberg (2004)
  6. Favre, C., Bentayeb, F., Boussaid, O.: Evolution of Data Warehouses’ Optimization: A Workload Perspective. In: Song, I.-Y., Eder, J., Nguyen, T.M. (eds.) DaWaK 2007. LNCS, vol. 4654. Springer, Heidelberg (2007)
  7. Zuse, H. A.: ‘Framework of Software measurement’, Walter de Gruyter, Berlin, 1998.
  8. Gupta, A., Mumick, I.S., Rao, J., Ross, K.: Adapting materialized views after redefinitions: Techniques and a performance study. Information Systems (26) (2001)
  9. Gupta, A., I.S. Mumick, “Maintenance of Materialized Views: Problems, Techniques, and Applications.” Data Eng. Bulletin, Vol. 18, No. 2, June 1995.
  10. Nica, A., Lee, A.J., Rundensteiner, E.A.: The CSV algorithm for view synchronization in evolvable large-scale information systems. In: Schek, H.-J., Saltor, F., Ramos, I.,
  11. Alonso, G. (eds.) EDBT 1998. LNCS, vol. 1377. Springer, Heidelberg (1998)
  12. Jian Yang, Kamalakar Karlapalem, Qing Li, “A Framework for Designing Materialized Views in Data Warehousing Environment “Proceedings of the 17th International Conference on Distributed Computing Systems (ICDCS '97), IEEE 1997
  13. Yousry Taha, Arsany S. Sawiros, Noha Adly, “an efficient data warehousing framework” computing - Technology and engineering, e-Publisher: CiteSeerX , 2009.
  14. Miranda Chan, Hong Va Leong, Antonio Si, “Incremental Update to Aggregated Information for Data Warehouses over Internet” DOLAP 2000 ACM, ISBN 1-58113-323-5/00/0011
  15. José A. Rodero, José A.Toval, Mario G. Piattini, “The audit of the Data Warehouse Framework” Proceedings of the International Workshop on Design and Management of Data Warehouses (DMDW'99) Heidelberg, Germany, 14. - 15. 6. 1999
  16. M. Golfarelli, S. Rizzi, “A Methodological Framework for Data Warehouse Design”, Proceedings of First International Workshop on Data Warehousing and OLAP (DOLAP, in connection with CIKM'98), Washington, D.C., USA, November 1998.
  17. Darja Solodovnikova and Laila Niedrite,” Evolution-Oriented User-Centric Data Warehouse”, Proceedings of the 19th International Conference on Information Systems Development by Springer, 2010
  18. C. Quix. “Repository Support for Data Warehouse Evolution”. In Proc. of the Intl. Workshop DMDW, Heidelberg, Germany (1999)
  19. Anisoara Nica, Elke A. Rundensteiner,” Using Containment Information for View Evolution in Dynamic Distributed Environments” DEXA '98 Proceedings of the 9th International Workshop on Database and Expert Systems Applications ,Page 212 , IEEE Computer Society Washington, DC, USA1998
  20. Dragan Sahpaski, Goran Velinov, Boro Jakimovski, Margita Kon-Popovska,” Dynamic Evolution and Improvement of Data Warehouse Design” Fourth Balkan Conference in Informatics, IEEE,2009
  21. Claudine Bréant, Gérald Thurler, François Borst, Antoine Geissbuhler, “Design of a Multi Dimensional Database for the Archimed DataWarehouse” Connecting Medical Informatics and Bio-Informatics R. Engelbrecht et al. (Eds.) ENMI, 2005
  22. George Papastefanatos, Panos Vassiliadis, Alkis Simitsis, Yannis Vassiliou,” Design Metrics for Data Warehouse Evolution” ER 2008, LNCS 5231, pp. 440–454, 2008.
  23. Matthias Jarke, Christoph Quix, Diego Calvanese, Maurizio Lenzerini, Enrico Franconi, Spyros Ligoudistianos, Panos Vassiliadis, Yannis Vassiliou, “ Concept Based Design of Data Warehouses: The DWQ Demonstrators”. SIGMOD Conference 2000: 591
  24. David Botzer, Opher Etzion, "Optimization of Materialization Strategies for Derived Data Elements," IEEE Transactions on Knowledge and Data Engineering, vol. 8, no. 2, pp. 260-272, April, 1996.
  25. Golfarelli, M., Lechtenbörger, J., Rizzi, S., Vossen, G.: Schema versioning in data warehouses: Enabling cross-version querying via schema augmentation. Data Knowl. Eng. 59(2), 435–459 (2006).
  26. Gosain A., Sabharwal S., Gupta R., “Quality Metrics for View Maintenance Models of Data Warehouse”, ERCICA 2014, Bangalore, India.
  27. Gosain A., Sabharwal S., Gupta R., “An Efficient Feature Selection Approach for Materialized Views” IEEE, ICCCCM2013, Allahabad, India.
  28. Zuse,H.: Properties of software measures, Software Quality Journal, 1992, 1, pp. 225- 260.
  29. Darja Solodovnikova and Laila Niedrite,” Evolution-Oriented User-Centric Data Warehouse”, Proceedings of the 19th International Conference on Information Systems Development by Springer, 2010
  30. Dimitri Theodoratos, Mokrane Bouzeghoub,” A General Framework for the View Selection Problem for Data Warehouse Design and Evolution” DOLAP '00 Proceedings of the 3rd ACM international workshop on Data warehousing and OLAP, Pages 1 – 8, ACM New York, NY, USA ©2000
  31. M. Blaschka. “FIESTA: A Framework for Schema Evolution in Multidimensional Information Systems”. In 6thCAiSE Doctoral Consortium, Heidelberg, 1999.
  32. C. Quix. “Repository Support for Data Warehouse Evolution”. In Proc. of the Intl. Workshop DMDW, Heidelberg, Germany (1999)
  33. Anisoara Nica, Elke A. Rundensteiner,” Using Containment Information for View Evolution in Dynamic Distributed Environments” DEXA '98 Proceedings of the 9th International Workshop on Database and Expert Systems Applications ,Page 212 , IEEE Computer Society Washington, DC, USA1998
  34. Mahesh B. Chaudhari, Suzanne W. Dietrich, “A Distributed Event Stream Processing Framework for Materialized Views over Heterogeneous Data Sources”, VLDB 2010, Singapore.
  35. Ericka-Janet Rechy-Ram´ırez , Edgard Ben´ıtez-Guerrero,” A Model and Language for Bitemporal Schema Versioning in DataWarehouses” Proceedings of the 15th International Conference on Computing (CIC'06), IEEE2006.
  36. S. Chen, X. Zhang, E.A. Rundensteiner. “A Compensation-based Approach for Materialized View Maintenance in Distributed Environments”. In Computer Science Technical Report, Worcester Polytechnic Institute, Worcester, MA, USA (2004)
  37. Chuan Zhang, Jian Yang,” Materialized View Evolution Support in DataWarehouse Environment” Sixth International Conference on Database Systems for Advanced Applications (DASFAA’99), 1999.
  38. Dragan Sahpaski, Goran Velinov, Boro Jakimovski, Margita Kon-Popovska,” Dynamic Evolution and Improvement of Data Warehouse Design” Fourth Balkan Conference in Informatics, IEEE,2009
  39. Amy J. Lee, Anisoara Nica, Elke A. Rundensteiner,” The EVE Approach: View Synchronization in Dynamic Distributed Environments”, ieee transactions on knowledge and data engineering, vol. 14, no. 5, september/october 2002
  40. Robert M. Bruckner, Tok Wang Ling, Oscar Mangisengi, A Min Tjoa,” A Framework for a Multidimensional OLAP Model using Topic Maps” IEEE 2002.
  41. Xin Zhang, Elke A. Rundensteiner,” The SDCC Framework For Integrating Existing Algorithms for Diverse Data Warehouse Maintenance Tasks” Database Engineering and Applications, 1999. IDEAS '99. International Symposium Proceedings , Aug 1999 Page 206 - 214
  42. PAN Ding, PAN Yunshan,” Metadata Versioning for DW2.0 Architecture” Proceedings of the 29th Chinese Control Conference July 29-31, 2010, Beijing, China
  43. C´ecile Favre, Fadila Bentayeb, and Omar Boussaid, “Evolution of Data Warehouses’ Optimization: A Workload Perspective” DaWaK 2007, LNCS 4654, pp. 13–22, 2007.
  44. Bartosz Bebel, Zbyszko Królikowski, and Robert Wrembel, “Managing Evolution of Data Warehouses by Means of Nested Transactions”, ADVIS 2006, LNCS 4243, pp. 119–128, 2006
  45. J. A. Nasir, M. Khurram Shahzad, “Architecture for Virtualization in Data Warehouse” Innovations and Advanced Techniques in Computer and Information Sciences and Engineering, 243–248. 2007 Springer.
  46. M.K. Shahzad, J.A. Nasir, M.A. Pasha. “CEV-DW: Creation and Evolution of Versions in Data Warehouse”. In Asian Journal of Information Technology, 4(10) (2005) 910-917
  47. E. Ben´ıtez-Guerrero, C. Collet, and M. Adiba. “The WHES Approach to Data Warehouse Evolution”. Digital Journal e-Gnosis [online], http://www.e-gnosis.udg.mx, ISSN No. 1665-5745, 2003.
  48. Joseph M. Firestone,” Architectural Evolution in DataWarehousing and Distributed Knowledge Management Architecture” White Paper No. Eleven July 1, 1998.
  49. George Papastefanatos, Panos Vassiliadis, Alkis Simitsis, Yannis Vassiliou,” Design Metrics for Data Warehouse Evolution” ER 2008, LNCS 5231, pp. 440–454, 2008.
  50. George Papastefanatos, Panos Vassiliadis, Alkis Simitsis, Konstantinos Aggistalis, Fotini Pechlivani, Yannis Vassiliou, “language extensions for the automation of database schema evolution” iceis (1) 2008: 74-81.
  51. B. Ashadevi, Dr. P. Navaneetham,” A Framework for the View Selection Problem in Data Warehousing Environment” International Journal on Computer Science and Engineering Vol. 02, No. 09, 2010, 2820-2826
  52. B. BE˛BEL, Z. KRÓLIKOWSKI, R. WREMBEL,” Formal approach to modelling a multiversion data warehouse” bulletin of the polish academy of sciences technical sciences vol. 54, no. 1, 2006
  53. Garima Thakur, Anjana Gosain,” DWEVOLVE: A Requirement Based Framework for Data Warehouse Evolution” ACM SIGSOFT Software Engineering Notes, Page 1, November 2011 Volume 36, No.6.
  54. Resmi Nair, Campbell Wilson, Bala Srinivasan,” A Conceptual Query-Driven Design Framework for Data Warehouse” , World Academy of Science, Engineering and Technology 25 , 2007.
  55. Solodovņikova D. “The Formal Model for Multiversion Data Warehouse Evolution”, Postconference proceedings of the 8th International Baltic Conference on Databases and Information Systems, Frontiers in Artificial Intelligence and Applications, IOS Press, 2008.
  56. D. Agrawal, A. El Abbadi, A. Singh, T. Yurek., “Efficient View Maintenance in Data Warehouses”. In Proceedings of the 1997 ACM International Conference on Management of Data, pages 417-427, May 1997.
  57. Briand, L.C., Morasca, S., Basili, V.R.: ‘Property based software engineering measurement’, IEEE Trans. Softw. Eng., 1996, 22, pp. 68–86.
  58. H. Gupta, ‘Selection of views to materialize in a data warehouse’, ICDT'97, Springer 1997.
  59. George Papastefanatos, Panos Vassiliadis, Alkis Simitsis, Yannis Vassiliou, ‘Metrics for the Prediction of Evolution Impact in ETL Ecosystems: A Case Study’ Springer-Verlag 2012.
Index Terms

Computer Science
Information Sciences

Keywords

Data Warehouse Data Warehouse Evolution View maintenance models.