CFP last date
20 December 2024
Reseach Article

Classification Algorithm based on MS Apriori for Rare Classes

by Devashree Rai, Kesari Verma, A. S. Thoke
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 48 - Number 22
Year of Publication: 2012
Authors: Devashree Rai, Kesari Verma, A. S. Thoke
10.5120/7516-0599

Devashree Rai, Kesari Verma, A. S. Thoke . Classification Algorithm based on MS Apriori for Rare Classes. International Journal of Computer Applications. 48, 22 ( June 2012), 52-56. DOI=10.5120/7516-0599

@article{ 10.5120/7516-0599,
author = { Devashree Rai, Kesari Verma, A. S. Thoke },
title = { Classification Algorithm based on MS Apriori for Rare Classes },
journal = { International Journal of Computer Applications },
issue_date = { June 2012 },
volume = { 48 },
number = { 22 },
month = { June },
year = { 2012 },
issn = { 0975-8887 },
pages = { 52-56 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume48/number22/7516-0599/ },
doi = { 10.5120/7516-0599 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:44:48.523182+05:30
%A Devashree Rai
%A Kesari Verma
%A A. S. Thoke
%T Classification Algorithm based on MS Apriori for Rare Classes
%J International Journal of Computer Applications
%@ 0975-8887
%V 48
%N 22
%P 52-56
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Most of the data mining algorithm focuses on frequent patterns, few algorithm emphases on rare items, but rare items [1] also have importance, for example, network intrusion detection, where among various normal connections we need to detect the rare malicious connections. Classification of such a non-uniform data set is a challenging issue. Most classifiers perform poorly in such a data set. Realizing the importance of rare class classification, in this paper we propose a classification algorithm (CBMR Algorithm) that is based on association rules mined by MSApriori approach [2] and is capable of classifying rare classes. The performance evaluation of the proposed algorithm has been done for different data sets [3] and in comparison with existing technique like [4], it is found that algorithm has efficient and superior performance for classifying rare cases.

References
  1. Weiss, G. M. "Mining With Rarity: A Unifying Framework. " SIGKDD Explorations, 2004, Vol. 6, Issue 1, pp. 7 – 19.
  2. Liu, B. , Hsu, W. , and Ma, Y. "Mining Association Rules with Multiple Minimum Supports. " SIGKDD Explorations, 1999wman, M. , Debray, S. K. , and Peterson, L. L. 1993. Reasoning about naming systems. .
  3. Coenen, F. (2003), The LUCS-KDD Discretized/normalised ARM and CARM data library http://www. csc. liv. ac. uk/~frans /KDD/Software /LUCS_KDD_DN/, Department of Computer Science, The University of Liverpool, UK.
  4. B. Liu, W. Hsu and Y. Ma, "Integrating classification and association rule mining", proceedings of the fourth international conference on knowledge discovery and data mining, 1998, pp. 80-86.
  5. M. S. Chen, J. Han, P. S. Yu, "Data mining: an overview from a database perspective", IEEE Transactions on Knowledge and Data Engineering, 1996, 8, pp. 866-883.
  6. Agrawal, R. , Imielinski, T. , and Swami, A. "Mining association rules between sets of items in large databases. " SIGMOD, 1993, pp. 207-216.
  7. Agrawal, R. , and Srikanth, R. "Fast algorithms for mining association rules. " VLDB, 1994.
  8. W. Li, J. Han and J. Pei, "CMAR: Accurate and efficient classification based on multiple class-association Rules", In ICDM'01, San Jose, CA, Nov. 2001, pp. 369-376.
  9. M. Kubat, R. C. Holte, and S. Matwin. Machine learning for the detection of oil spills in satellite radar images. Machine Learning, 30(2):195-215, 1998.
  10. P. K. Chan, and S. J. Stolfo. Toward scalable learning with non-uniform class and cost distributions: a case study in credit card fraud detection. In Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining, pages 164-168, 2001.
  11. G. M. Weiss, and H. Hirsh. Learning to predict rare events in event sequences. In Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining, pages 359-363, 1998.
  12. G. Medioni, I. Cohen, F. Brémond, S. Hongeng, and R. Nevatia. Event detection and analysis from video streams. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23(8): 873-889, 2001.
  13. H. Zhong, J. Shi, and M. Visontai. Detecting unusual activity in video. Proc. CVPR'04, Washington, DC, 2004, Vol. 2, pp. 819-826.
  14. Mannila, H. "Methods and Problems in Data Mining. " ICDT, 1997.
  15. R. Kiran and P. Reddy "An Improved Multiple Minimum Support Based Approach to Mine Rare Association Rules" IEEE 2009.
  16. I. Kouris, C. Makris, A. Tsakalidis"An improved algorithm for mining association rules using multiple support values" FLAIRS 2003.
  17. Coenen, F. (2004). LUCS KDD implementation of CBA (Classification Based on associations). http://www. csc. liv. ac. uk/~frans/KDD/Software/CMAR/cba. html, Department of Computer Science, The University of Liverpool, UK.
Index Terms

Computer Science
Information Sciences

Keywords

Rare Classes Msapriori Algorithm Classification Data Mining