We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 December 2024
Reseach Article

A Comparative Analysis of Clustering Algorithms

by Raj Bala, Sunil Sikka, Juhi Singh
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 100 - Number 15
Year of Publication: 2014
Authors: Raj Bala, Sunil Sikka, Juhi Singh
10.5120/17603-8293

Raj Bala, Sunil Sikka, Juhi Singh . A Comparative Analysis of Clustering Algorithms. International Journal of Computer Applications. 100, 15 ( August 2014), 35-39. DOI=10.5120/17603-8293

@article{ 10.5120/17603-8293,
author = { Raj Bala, Sunil Sikka, Juhi Singh },
title = { A Comparative Analysis of Clustering Algorithms },
journal = { International Journal of Computer Applications },
issue_date = { August 2014 },
volume = { 100 },
number = { 15 },
month = { August },
year = { 2014 },
issn = { 0975-8887 },
pages = { 35-39 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume100/number15/17603-8293/ },
doi = { 10.5120/17603-8293 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:30:04.184859+05:30
%A Raj Bala
%A Sunil Sikka
%A Juhi Singh
%T A Comparative Analysis of Clustering Algorithms
%J International Journal of Computer Applications
%@ 0975-8887
%V 100
%N 15
%P 35-39
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Clustering is a process of grouping a set of similar data objects within the same group based on similarity criteria (i. e. based on a set of attributes). There are many clustering algorithms. The objective of this paper is to perform a comparative analysis of four clustering algorithms namely K-means algorithm, Hierarchical algorithm, Expectation and maximization algorithm and Density based algorithm. These algorithms are compared in terms of efficiency and accuracy, using WEKA tool. The data for clustering is used in normalized and as well as unnormalized format. In terms of efficiency and accuracy K-means produces better results as compared to other algorithms.

References
  1. Usama Fayyad, Gregory Piatetsky Shapiro and padhraic Symyh, "The KDD Process for Extracting Useful Knowledge from Volumes of Data", Communication of the ACM, Vol. 39, No. 11, pp. 27-34,1996.
  2. Chauhan R, Kaur H, Alam M A, "Data Clustering Method for Discovering Clusters in Spatial Cancer Databases", International Journal of Computer Applications , (0975 – 8887) Vol. 10– No. 6, November 2010.
  3. AmandeepKaurMann ,NavneetKaur ,"Survey Paper on Clustering Techniques "Volume 2, Issue 4, April 2013 ISSN: 2278 – 7798.
  4. Jain A. K. , Murty M. N. , and Flynn P. J. , "Data Clustering: A Review", ACM Computing Surveys, 31 (3). pp. 264-323, 1999.
  5. Data Preprocessing in WEKA, Available at: http://facweb. cs. depaul. edu/mobasher/classes/ect584/weka/preprocess. html.
  6. Jiawei Han, MichelineKamber," Data Mining: Concepts and Techniques" Second Edition.
  7. Dr. N. RajalingamK. Ranjini, "Hierarchical Clustering Algorithm - A Comparative Study" Volume 19– No. 3, April 2011, ISSN: 0975 – 8887.
  8. Sharmila, R. C Mishra "Performance Evaluation of Clustering Algorithms" International Journal of Engineering Trends and Technology (IJETT) - Volume4 Issue7- July 2013, ISSN: 2231-5381.
  9. Thomas Schön, "Machine Learning, Lecture 6 Expectation Maximization (EM) and clustering", Available at: http://www. control. isy. liu. se/student/ graduate/MachineLearning/Lectures/le6. pdf.
  10. S. Revathi, Dr. T. NalinI, "Performance Comparison of Various Clustering Algorithm" Volume 3, Issue 2, February 2013, ISSN: 2277 128X.
  11. Data Clustering Algorithms, Available at: https://sites. google. com/site/dataclusteringalgorithms/density-based-clustering-algorithmen.
  12. Introduction to Weka, Available at: http://transact. dl. sourceforge. net/sourcefor ge/weka/WekaManual-3. 6. 0. pdf
Index Terms

Computer Science
Information Sciences

Keywords

Clustering K-means algorithm Hierarchical algorithm Expectation and maximization algorithm and Density based algorithm and WEKA tool.