CFP last date
20 December 2024
Reseach Article

Breast Cancer Diagnosis by using k-Nearest Neighbor with Different Distances and Classification Rules

by Seyyid Ahmed Medjahed, Tamazouzt Ait Saadi, Abdelkader Benyettou
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 62 - Number 1
Year of Publication: 2013
Authors: Seyyid Ahmed Medjahed, Tamazouzt Ait Saadi, Abdelkader Benyettou
10.5120/10041-4635

Seyyid Ahmed Medjahed, Tamazouzt Ait Saadi, Abdelkader Benyettou . Breast Cancer Diagnosis by using k-Nearest Neighbor with Different Distances and Classification Rules. International Journal of Computer Applications. 62, 1 ( January 2013), 1-5. DOI=10.5120/10041-4635

@article{ 10.5120/10041-4635,
author = { Seyyid Ahmed Medjahed, Tamazouzt Ait Saadi, Abdelkader Benyettou },
title = { Breast Cancer Diagnosis by using k-Nearest Neighbor with Different Distances and Classification Rules },
journal = { International Journal of Computer Applications },
issue_date = { January 2013 },
volume = { 62 },
number = { 1 },
month = { January },
year = { 2013 },
issn = { 0975-8887 },
pages = { 1-5 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume62/number1/10041-4635/ },
doi = { 10.5120/10041-4635 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:12:44.407212+05:30
%A Seyyid Ahmed Medjahed
%A Tamazouzt Ait Saadi
%A Abdelkader Benyettou
%T Breast Cancer Diagnosis by using k-Nearest Neighbor with Different Distances and Classification Rules
%J International Journal of Computer Applications
%@ 0975-8887
%V 62
%N 1
%P 1-5
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Cancer diagnosis is one of the most studied problems in the medical domain. Several researchers have focused in order to improve performance and achieve to obtain satisfactory results. Breast cancer is one of cancer killer in the world. The diagnosis of this cancer is a big problem in cancer diagnosis researches. In artificial intelligent, machine learning is a discipline which allows to the machine to evolve through a process. Machine learning is widely used in bio informatics and particularly in breast cancer diagnosis. One of the most popular methods is K-nearest neighbors (K-NN) which is a supervised learning method. Using the K-NN in medical diagnosis is very interesting. The quality of the results depends largely on the distance and the value of the parameter "k" which represent the number of the nearest neighbors. In this paper, we study and evaluate the performance of different distances that can be used in the K-NN algorithm. Also, we analyze this distance by using different values of the parameter "k" and by using several rules of classification (the rule used to decide how to classify a sample). Our work will be performed on the WBCD database (Wisconsin Breast Cancer Database) obtained by the university of Wisconsin Hospital.

References
  1. M. F. Akay. Support vector machines combined with feature selection for breast cancer diagnosis. Expert Systems with Applications, 2(36), 2009.
  2. B. Alexander, Y. Ran, I. Eran K. Ron, M. Ron, and P. Dori. Breast cancer diagnosis from biopsy images using generic features and svms. Technical Report - Israel Institute of Technology, Sep 2006.
  3. K. P. Bennett and O. L. Mangasarian. Robust linear programming discrimination of two linearly inseparable sets. Optimization Methods and Software 1, 1992.
  4. E. D. beyli. Implementing automated diagnostic systems for breast cancer detection. Expert Systems with Applications, 4(33), 2007.
  5. D. Bremner, E. Demaine, J. Erickson, J. Iacono, S. Langerman, P. M. , and Godfried. Output-sensitive algorithms for computing nearest-neighbour decision boundaries. Discrete and Computational Geometry, 33(4), 2005.
  6. D. Coomans and D. L. Massart. Alternative k-nearest neighbour rules in supervised pattern recognition. Analytica Chimica Acta, 136, 1982.
  7. I. Guyon, J. Weston, S. Barnhill, and V. Vapnik. Gene selection for cancer classification using support vector machines. Machine Learning, 46(1-3), 2002.
  8. L. Li and C. Weinberg. Gene selection and sample classification using a genetic algorithm and k -nearest neighbor method. A Practical Approach to Microarray Data Analysis, 2003.
  9. R. Mallika and V. Saravanan. An svm based classification method for cancer data using minimum microarray gene expressions. World Academy of Science, Engineering and Technology, 62, 2010.
  10. O. L. Mangasarian and W. H. Wolberg. Cancer diagnosis via linear programming. SIAM News, 5(23), Sep 1990.
  11. A. Marcano-Cedeno, J. Quintanilla-Domnguez, and D. Andina. Wbcd breast cancer database classification applying artificial metaplasticity neural network. Expert Systems with Applications, (38), 2011.
  12. M. Martn-Merino and J. De Las Rivas. Improving k-nn for human cancer classification using the gene expression profiles. Computer Science Advaces in Intelligent Data Analysis VIII, 5772/2009, 2009.
  13. A. Mert, N. Kilic, and A. Akan. Breast cancer classification by using support vector machines with reduced dimension. ELMAR Proceedings, 2011.
  14. K. Polat and S. Gnes. Breast cancer diagnosis using least square support vector machine. Digital Signal Processing, 4(17), 2007.
  15. M. Raniszewski. Sequential reduction algorithm for nearest neighbor rule. Computer Vision and Graphics, 6375, 2010.
  16. Y. Ireaneus Anna Rejani and S. Thamarai Selvi. Early detection of breast cancer using svm classifier technique. International Journal on Computer Science and Engineering, 1(3), 2009.
  17. S. Shah and A. Kusiak. Cancer gene search with datamining and genetic algorithms. Computers in Biology and Medicine, 37, 2002.
  18. P. Shi, S. Ray, Q. Zhu, and M. A Kon. Top scoring pairs for feature selection in machine learning and applications to cancer outcome prediction. BMC Bioinformatics, 12, 2011.
  19. J. S. Snchez, R. A. Mollineda, and J. M. Sotoca. An analysis of how training data complexity affects the nearest neighbor classifiers. Pattern Analysis and Applications, 10(3), 2007.
Index Terms

Computer Science
Information Sciences

Keywords

Classification Diagnosis Breast Cancer K-Nearest Neighbors Distance Classification Rule