We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 December 2024
Reseach Article

K-means with Three different Distance Metrics

by Archana Singh, Avantika Yadav, Ajay Rana
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 67 - Number 10
Year of Publication: 2013
Authors: Archana Singh, Avantika Yadav, Ajay Rana
10.5120/11430-6785

Archana Singh, Avantika Yadav, Ajay Rana . K-means with Three different Distance Metrics. International Journal of Computer Applications. 67, 10 ( April 2013), 13-17. DOI=10.5120/11430-6785

@article{ 10.5120/11430-6785,
author = { Archana Singh, Avantika Yadav, Ajay Rana },
title = { K-means with Three different Distance Metrics },
journal = { International Journal of Computer Applications },
issue_date = { April 2013 },
volume = { 67 },
number = { 10 },
month = { April },
year = { 2013 },
issn = { 0975-8887 },
pages = { 13-17 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume67/number10/11430-6785/ },
doi = { 10.5120/11430-6785 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:24:17.802230+05:30
%A Archana Singh
%A Avantika Yadav
%A Ajay Rana
%T K-means with Three different Distance Metrics
%J International Journal of Computer Applications
%@ 0975-8887
%V 67
%N 10
%P 13-17
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The power of k-means algorithm is due to its computational efficiency and the nature of ease at which it can be used. Distance metrics are used to find similar data objects that lead to develop robust algorithms for the data mining functionalities such as classification and clustering. In this paper, the results obtained by implementing the k-means algorithm using three different metrics Euclidean, Manhattan and Minkowski distance metrics along with the comparative study of results of basic k-means algorithm which is implemented through Euclidian distance metric for two-dimensional data, are discussed. Results are displayed with the help of histograms.

References
  1. Agrawal R. , Faloutsos C. , Swami A. Efficient similarity search in sequence databases. Proc. 4 Th Int. Conf. On Foundations of Data Organizations and Algorithms, 1993. – Chicago. pp. 69-84.
  2. Archana Singh, Jyoti Agarwal, Ajay Rana January 2013. Performance Measure of Similis and FPGrowth Algorithm. International Journal of Computer Applications (0975 – 8887) Volume 62– No. 6.
  3. Archana Singh, Megha Chaudhary, Dr (Prof. ) Ajay Rana Gaurav Dubey 2011 . Online Mining of data to Generate Association Rule Mining in Large Databases. IEEE-International Conference on Recent Trends in Information Systems.
  4. Fast Distance Metric Based Data Mining Techniques Using P-trees: k-Nearest-Neighbor classification and k-Clustering : A Thesis Submitted to the Graduate Faculty Of the North Dakota State University.
  5. Joaquin Perez Ortega, Ma. Del Rocio Boone Rojas and Maria J. Somodevilla Garcia. Research issues on K-means Algorithm: An Experimental Trial Using Matlab.
  6. Jiawei Han and Micheline Kamber, "Data Mining: Concepts and Techniques," Morgan Kaufmann Publishers, August 2000. ISBN 1-55860-489-8
  7. Kaufman L. , Rousseeuw P. J. Finding groups in data. An introduction to cluster analysis. – John Wiley & Sons, 2005.
  8. Li M. , Chen X. , Ma B. , Vitanyi P. The similarity metric. IEEE Transactions on Information Theory, 2004, vol. 50, No. 12, pp. 3250-3264.
  9. Mike Estlick, Mirian Leeser, James Theiler and John Szymanski, "Algorithmic Transformations in the Implementation of K-means Clustering on Reconfigurable Hardware," FPGA 2001, February 11-13, 2001, Monterey, CA, USA.
  10. S. J. WAN, S. K. M. WONG, and P. PRUSINKIEWICZ, "An Algorithm for Multidimensional Data Clustering. "
  11. S. Z. Selim and M. A. Ismail. "k-Means-Type Algorithms: A Generalized convergence theorem and characterization of the local optimality. " IEEE transaction, Pattern Analysis Machine Intelligence. " PAMI-6, 1, 1994, 81-87.
  12. T. V. Vijay Kumar, Archana Singh, Gaurav Dubey 2012. Mining Queries for Constructing Materialized Vies in a Data Warehouse. Advances in Intelligent Systems and Computing Volume 167, 2012, pp 149-159 Springer.
  13. Tapas Kanungo, David M. Mount,Nathan S. Netanyahu, Christine D. Piatko, Ruth Silverman and Angela Y. Wu. An Efficient k-means Clustering Algorithm: Analysis and Implementation.
  14. Tan, Steinbach, Kumar Ghosh. The k-means algorithm - Notes.
  15. Vitanyi P. Universal similarity, ITW2005, Rotorua, New Zealand, 2005.
  16. Xu R. , Wunch D. C. Clustering. – John Wiley & Sons, 20 09, 358 p.
  17. X. Wu and I. H. Witten, "A Fast k-Means Type Clustering Algorithm. " Dept of Computer Science, Univ. of Calgary, Canada, May 1985.
  18. Yair Bartal, Moses Charikary, Danny Razz, "Approximating min-sum k-Clustering in Metric Spaces," Symposium on Theory of computing, July 6-8, 2001, Hersonissos, Crete, Greece.
Index Terms

Computer Science
Information Sciences

Keywords

Centroids clustering distortion metrics similarity matrix