We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 December 2024
Reseach Article

A Genetic Algorithm for Discovering Classification Rules in Data Mining

by Basheer M. Al-maqaleh, Hamid Shahbazkia
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 41 - Number 18
Year of Publication: 2012
Authors: Basheer M. Al-maqaleh, Hamid Shahbazkia
10.5120/5644-8072

Basheer M. Al-maqaleh, Hamid Shahbazkia . A Genetic Algorithm for Discovering Classification Rules in Data Mining. International Journal of Computer Applications. 41, 18 ( March 2012), 40-44. DOI=10.5120/5644-8072

@article{ 10.5120/5644-8072,
author = { Basheer M. Al-maqaleh, Hamid Shahbazkia },
title = { A Genetic Algorithm for Discovering Classification Rules in Data Mining },
journal = { International Journal of Computer Applications },
issue_date = { March 2012 },
volume = { 41 },
number = { 18 },
month = { March },
year = { 2012 },
issn = { 0975-8887 },
pages = { 40-44 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume41/number18/5644-8072/ },
doi = { 10.5120/5644-8072 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:29:58.178989+05:30
%A Basheer M. Al-maqaleh
%A Hamid Shahbazkia
%T A Genetic Algorithm for Discovering Classification Rules in Data Mining
%J International Journal of Computer Applications
%@ 0975-8887
%V 41
%N 18
%P 40-44
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Data mining has as goal to discover knowledge from huge volume of data. Rule mining is considered as one of the usable mining method in order to obtain valuable knowledge from stored data on database systems. In this paper, a genetic algorithm-based approach for mining classification rules from large database is presented. For emphasizing on accuracy, coverage and comprehensibility of the rules and simplifying the implementation of a genetic algorithm. The design of encoding, genetic operators and fitness function of genetic algorithm for this task are discussed. Experimental results show that genetic algorithm proposed in this paper is suitable for classification rule mining and those rules discovered by the algorithm have higher classification performance to unknown data.

References
  1. Zhu, X. and Davidson, I. 2007. Knowledge Discovery and Data Mining Challenges and Realities. IGI Global.
  2. Fayyad, U. M. , Piatetsky-Sharpio, G. and Smyth, P. 1996. From mining to knowledge discovery : An overview. In: Fayyad, U . M. , Piatetsky-Sharpio, G. Smyth. P. and Uthurusany, R. (eds. )Advances in knowledge discovery and data mining , AAAI/MIT Press, pp. 1-34.
  3. Han, J. , Kamber, M. and Pei, J. 2011. Data Mining: Concepts and Techniques. Third Edition, Morgan Kaufmann.
  4. Freitas, A. A. 2002. Data Mining and Knowledge Discovery with Evolutionary Algorithms. Springer-Verlag, Berlin Heidelberg.
  5. Yogita, Saroj and Kumar, D. 2009. Rule +Exceptions: Automated discovery of comprehensible decision Rules. IEEE International Advance Computing Conference (IACC2009), Patiala, India, pp. 1479-1483.
  6. Barros, R. C. , Basgalupp, M. P. , Ferreira, A. C. and Frietas, A. A. 2011. Towards the automatic design of decision tree induction algorithms. In: GECCO (Companion Material ), Dublin, Ireland, pp. 567-574.
  7. Bramer, M. 2007. Principles of Data Mining. Springer-Verlag London Limited.
  8. Goldberg, D. E. 1989. Genetic Algorithms in Search, Optimization and Machine Learning. Addison-Wesley.
  9. Dehuri, S. and Mall, R. 2006. Predictive and comprehensible rule discovery using a multi objective genetic algorithms. Knowledge Based Systems, vol. 19, pp. 413-421.
  10. Fidelis, M. V. , Lopes, H. S. , Freitas, A. A. and Grossa, P. 2000. Discovering comprehensible classification rules with a genetic algorithm. In Proceedings of the 2000 Congress on Evolutionary Computation, La Jolla, CA, USA, IEEE, vol. 1, pp. 805-810.
  11. Kaplia, Saroj, Kumar D. and Kanika. 2010. A genetic algorithm with entropy based initial bias for automated rule mining. In Proceeding of the IEEE International Conference on Computer & Communication Technology(ICCCT'10), pp. 491-495.
  12. Bharadwaj, K. K. and Al-Maqaleh, B. M. 2005. Evolutionary approach for automated discovery of censored production rules. In: Proceedings of the 8th International Conference on Cybernetics, Informatics and Systemics (CIS-2005). vol. 10, Krakow, Poland, pp. 147-152.
  13. Bharadwaj, K. K. and Al-Maqaleh, B. M. 2006. Evolutionary approach for automated discovery of augmented production rules. International Journal of Computational Intelligence. vol. 3, Issue 4, pp. 267-275.
  14. Goplan J. , Alhajj R. and Barker, K. 2006. Discovering accurate and interesting classification rules using genetic algorithm. In Proceedings of the International Conference on Data Mining(DMIN06), Las Vegas, Nevada, USA , pp. 389-395.
  15. Carvalho, D. R. and Freitas, A. A. 2002. A genetic-algorithm for discovering small-disjunct rules in data mining. Applied Soft Computing, vol. 2, pp. 75-88.
  16. Sarkar, B. K. , Sana, S. S. and Chaudhuri, K. 2012. A genetic algorithm-based rule extraction system. Applied Soft Computing. vol. 12, pp. 238-254.
  17. Al-Maqaleh, B. M. 2012. Genetic algorithm approach to automated discovery of comprehensible production rules. In Proceeding of the IEEE 2nd International Conference on Advanced Computing & Communication Technologies (ACCT2012), Rohtak, India, pp. 69-71.
  18. Al-Maqaleh, B. M. 2012. Mining interesting classification rules: An evolutionary approach. International Journal of Mathematical Engineering and Science. vol. 1, Issue 1, pp. 13-20.
  19. Frietas, A. A. 1999. On rule interestingness measures. Knowledge-Based System. 12(5-6), pp. 309-315.
  20. Shi, X-J. and Lei, H. 2008. A genetic algorithm-based approach for classification rule discovery. In Proceeding of the IEEE International Conference on Information Management, Innovation Management and Industrial Engineering (ICIII08), Taipei, Taiwan, pp. 175-178.
  21. UCI Repository of Machine Learning Databases, Department of Information and Computer Science University of California, 1994. [http://www. ics. uci. edu/ ~mlearn/MLRepositry. html]
  22. Quinlan. J. R. 1993. C4. 5: Programs for Machine Learning. Morgan Kaufmann.
  23. Quinlan, J. R. 1991. Improved estimates for the accuracy of small disjuncts. Journal of Machine Learning, Kluwer Academic Publishers Hingham, MA, USA, vol. 6, Issue 1, pp. 93-98.
  24. Holte, R. C. , Acker, L. E. and Porter, B. W. 1989. Concept learning and the problem of small disjuncts. In Proceedings of IJCAI – 89, pp. 813-818.
Index Terms

Computer Science
Information Sciences

Keywords

Classification Rule Genetic Operators Fitness Function Predictive Accuracy