International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 110 - Number 1 |
Year of Publication: 2015 |
Authors: Rupali Patil, Shyam Deshmukh, K Rajeswari |
10.5120/19280-0694 |
Rupali Patil, Shyam Deshmukh, K Rajeswari . Analysis of SimpleKMeans with Multiple Dimensions using WEKA. International Journal of Computer Applications. 110, 1 ( January 2015), 14-17. DOI=10.5120/19280-0694
Clustering techniques have more importance in data mining especially when the data size is very large. It is widely used in the fields including pattern recognition system, machine learning algorithms, analysis of images, information retrieval and bio-informatics. Different clustering algorithms are available such as Expectation Maximization (EM), Cobweb, FarthestFirst, OPTICS, SimpleKMeans etc. SimpleKMeans clustering is a simple clustering algorithm. It partitions n data tuples into k groups such that each entity in the cluster has nearest mean. This paper is about the implementation of the clustering techniques using WEKA interface. This paper includes a detailed analysis of various clustering techniques with the different standard online data sets. Analysis is based on the multiple dimensions which include time to build the model, number of attributes, number of iterations, number of clusters and error rate.