CFP last date
20 December 2024
Reseach Article

Parallelizing Apriori on Dual Core using OpenMP

by Anuradha.t, Satya Pasad R, S. N. Tirumalarao
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 43 - Number 24
Year of Publication: 2012
Authors: Anuradha.t, Satya Pasad R, S. N. Tirumalarao
10.5120/6440-8894

Anuradha.t, Satya Pasad R, S. N. Tirumalarao . Parallelizing Apriori on Dual Core using OpenMP. International Journal of Computer Applications. 43, 24 ( April 2012), 33-39. DOI=10.5120/6440-8894

@article{ 10.5120/6440-8894,
author = { Anuradha.t, Satya Pasad R, S. N. Tirumalarao },
title = { Parallelizing Apriori on Dual Core using OpenMP },
journal = { International Journal of Computer Applications },
issue_date = { April 2012 },
volume = { 43 },
number = { 24 },
month = { April },
year = { 2012 },
issn = { 0975-8887 },
pages = { 33-39 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume43/number24/6440-8894/ },
doi = { 10.5120/6440-8894 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:34:13.782566+05:30
%A Anuradha.t
%A Satya Pasad R
%A S. N. Tirumalarao
%T Parallelizing Apriori on Dual Core using OpenMP
%J International Journal of Computer Applications
%@ 0975-8887
%V 43
%N 24
%P 33-39
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Accumulation of abundant data from different sources of the society but a little knowledge situation has lead to Knowledge Discovery from Databases or Data Mining. Data Mining techniques use the existing data and retrieve the useful knowledge from it which is not directly visible in the original data. As Data Mining algorithms deal with huge data, the primary concerns are how to store the data in the main memory at run time and how to improve the run time performance. Sequential algorithms cannot provide scalability, in terms of the data dimension, size, or runtime performance, for such large databases. Because the data sizes are increasing to a larger quantity, we must use high-performance parallel and distributed computing to get the advantage of more than one processor to handle these large quantities of data. The recent advancements in computer hardware for parallel processing is multi core or Chip Multiprocessor (CMP) systems. In this paper we present an efficient and easy technique for parallelization of apriori on dual-core using openMP wih perfect load balancing between the two cores. We present the performance evaluation of apriori for different support counts with different sized databases on dual core compared to sequential implementation.

References
  1. Jiawei Han and Micheline Kamber 2006 "Data Mining concepts and Techniques", 2nd edition Morgan Kaufmann Publishers, San Francisco.
  2. Fayyad, Usama,Gregory Piatetsky-Shapiro, and Padhraic Smyth 1996 "From Data Mining to Knowledge Discovery in Databases". AI Magazine Volume 17 Number 3(1996)
  3. Agrawal R, Imielinski T, Swami A 1993 "Mining association rules between sets of items in large databases" In: Proceedings of the 1993ACM-SIGMODinternational conference on management of data (SIGMOD'93), Washington, DC, pp 207–216
  4. Agrawal R, Srikant R 1994 "Fast algorithms for mining association rules" In: Proceedings of the 1994 international conference on very large data bases (VLDB'94), Santiago, Chile, pp 487–499
  5. R. Agrawal and J. Shafer 1996 "Parallel mining of association rules" IEEE Trans. Knowl. Data Eng. , vol. 8, pp. 962–969, Dec. 1996.
  6. M. J. Zaki 1997 "parallel and distributed association mining:A survey"IEEEConcur,vol. 7, pp. 14–25, Dec. 1997.
  7. Herb Sutter 2005 "The Free Lunch Is Over A Fundamental Turn Toward Concurrency in Software" This article appeared in Dr. Dobb's Journal, 30(3), March 2005.
  8. N. Karmakar 2011 "The New Trend in processor Making Multi-Core Architecture" www. scribd. com 15th may2011
  9. Jiawei Han, Hong Cheng,Dong Xin, Xifeng Yan 2007 "Frequent pattern mining: current status and future directions" In the Journal of Data Min Knowl Disc (2007) 15:55–86,Springer Science+Business Media, LLC 2007.
  10. Han J, Pei J, Yin Y 2000 " Mining frequent patterns without candidate generation" In: Proceeding of the 2000 ACM-SIGMOD international conference on management of data (SIGMOD'00),Dallas, TX, pp 1–12
  11. ZakiMJ 2000 "Scalable algorithms for association mining" IEEETransKnowl Data Eng 12:372–390
  12. Park JS, Chen MS, Yu PS 1995 "Efficient parallel mining for association rules" In: Proceeding of the 4th international conference on information and knowledge management, Baltimore, MD,pp 31–36
  13. Agrawal R, Shafer JC 1996 "Parallel mining of association rules: design, implementation, and experience" IEEE Trans Knowl Data Eng 8:962–969
  14. Cheung DW, Han J, Ng V, Fu A, Fu Y 1996 "A fast distributed algorithm for mining association rules" In: Proceeding of the 1996 international conference on parallel and distributed information systems, Miami Beach, FL, pp 31–44
  15. O. R Zaiane,M. El-Hajj, and P. Lu 2001 "Fast parallel association rule mining without candidacy generation" in Proc. ICDM, 2001, [Online]. Available: citeseer. ist. psu. edu/474 621. html, pp. 665–668.
  16. Wenbin Fang, Mian Lu, Xiangye Xiao, Bingsheng He, Qiong Luo 2009 "Frequent itemset mining on graphics processors" Proceedings of the Fifth International Workshop on Data Management onNew Hardware (DaMoN 2009) June 28, 2009, Providence, Rhode-Island
  17. . Shirish Tatikonda, Srinivasan Parthasarathy "Mining TreeStructured Data on Multicore Systems", VLDB '08, August 2430, 2008, Auckland, New Zealand
  18. Li Liu2, 1, Eric Li1, Yimin Zhang1, Zhizhong Tang 2007 "Optimization of Frequent Itemset Mining on Multiple-Core Processor" VLDB '07, September 23-28, 2007, Vienna, Austria.
  19. Amdahl, Gene 1967 "Validity of the Single Processor Approach to Achieving Large-Scale Computing Capabilities". AFIPS Conference Proceedings (30): 483–485.
  20. OpenMP Architecture , "OpenMP C and C++ ApplicationProgramInterface", Copyright © 1997-2002 OpenMP Architecture Review Board. http://www. openmp. org/
  21. Kent Milfeld 2011 "Introduction to Programming with OpenMP" September 12th 2011, TACC
  22. Ruud van der pas 2009 "An Overview of OpenMP" NTU Talk January 14 2009
  23. S N sivanandam, S Sumathi 2006 "DataMining concepts,tasks and Techniques" First print 2006 by Thomson Business Information Pvt. Ltd. , India.
Index Terms

Computer Science
Information Sciences

Keywords

Multi Core Apriori Parallelization Openmp