We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 December 2024
Reseach Article

A Suitability Study of Discretization Methods for Associative Classifiers

by O. P. Vyas, Kavita Das
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 5 - Number 10
Year of Publication: 2010
Authors: O. P. Vyas, Kavita Das
10.5120/944-1322

O. P. Vyas, Kavita Das . A Suitability Study of Discretization Methods for Associative Classifiers. International Journal of Computer Applications. 5, 10 ( August 2010), 46-51. DOI=10.5120/944-1322

@article{ 10.5120/944-1322,
author = { O. P. Vyas, Kavita Das },
title = { A Suitability Study of Discretization Methods for Associative Classifiers },
journal = { International Journal of Computer Applications },
issue_date = { August 2010 },
volume = { 5 },
number = { 10 },
month = { August },
year = { 2010 },
issn = { 0975-8887 },
pages = { 46-51 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume5/number10/944-1322/ },
doi = { 10.5120/944-1322 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T19:53:57.617639+05:30
%A O. P. Vyas
%A Kavita Das
%T A Suitability Study of Discretization Methods for Associative Classifiers
%J International Journal of Computer Applications
%@ 0975-8887
%V 5
%N 10
%P 46-51
%D 2010
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Discretization is a popular approach for handling numeric attributes in machine learning. The attributes in the datasets are both nominal and continuous. Most of the Classifiers are capable to be applied on discretized data. Hence, pre-processing of continuous data for converting them into discretized data is a necessary step before being used for the Classification Rule Mining approaches. Recently developed Associative Classifiers like CBA, CMAR and CPAR are almost equal in accuracy and have outperformed traditional classifiers. The distribution of continuous data into discrete ranges may affect the accuracy of classification. This work provides a comparative study of few discretization methods with these new classifiers. The target is to find some suitable discretization methods that are more suitable with these associative classifiers.

References
  1. Alcala-Fdez, J., Sanchez, L., et al. 2009. KEEL: A software tool to assess evolutionary algorithms for data mining problems. Soft Computing - A Fusion of Foundations, Methodologies and Applications: Springer Berlin / Heidelberg, pp.307-318. Available from anonymous ftp.
  2. Ching, J. Y., Wong Andrew, K. C., and Chan, Keith K. C. 1995. Inductive Learning from Continuous and Mixed-Mode Data. IEEE Transactions on Pattern Analysis and Machine Intelligence
  3. Dougherty, J., Kohavi, R. and Sahami, N. 1995 Supervised and Unsupervised Discretization of continuous Features. Machine Learning, 14th IJCAI, 1995, 108-121.
  4. Fayyad, U. M. and Irani, K. B. 1993. Multi-Interval Discretization of Continuous Valued Attributes for Classification Learning. 13th IJCAI, vol. 2., Chambery, France, 28.8.-2.9.93, Morgan Kaufmann, 1022–1027
  5. R. Giraldez, J. S., Aguilar-ruiz, et al. 2002. Discretization Oriented to Decision Rules Generation. Frontiers in Artificial Intelligence and Applications.
  6. Kerber, R. 1992. ChiMerge: Discretization of Numeric Attributes. In proceedings of tenth National Conference on Artificial Intelligence. 123-128
  7. Liu, B., Hsu, W., and Ma, Y. 1998. Integrating Classification and Association Rule Mining. In proceedings of the KDD, 1997, New York, 80-86, 1998
  8. Perner, P., Trautzsch, S. 1998. Multi-Interval Discretization Methods for Decision Tree Learning. LNCS 1451, Springer Verlag, 475-482
  9. Quinlan, J. R. 1986. Induction of Decision Trees. Machine Learning, 1: 81-106
  10. Quinlan, J. R., Cameron-Jones, R. M. 1993. FOIL: A midterm report. In proceedings of European Conference on Machine Learning, Vienna, Austria
  11. Thabtah, F., Cowling, P. and Peng, Y. 2005. A Study of Predictive Accuracy for Four Associative Classifiers. Journal Of Digital Information Management
  12. Thabtah, F., Cowling, P. and Peng, Y. 2006. Multiple Labels Associative Classification. Knowledge and Information Systems. Vol. 9, No. 1. 109-129
  13. Wenmin, L., Jiawei, H. and Pei, J. 2001. CMAR: Accurate and Efficient Classification Based on Multiple Class-Association Rules. In proceedings of ICDM. 369-376
  14. Xiaoxin, Y. and Han, J. 2003. CPAR: Classification based on Predictive Association Rules. In proceedings of SIAM International Conference on Data Mining, San Fransisco, CA. 331-335.
Index Terms

Computer Science
Information Sciences

Keywords

ARM CRM CBA CMAR CPAR CADD USD ChiMerge MDLP ID3 EWD EFD