Finding the Number of Clusters in Unlabeled Datasets using Extended Dark Block Extraction

Srinivasulu Asadi; Dr Ch D V Subba Rao; V Saikrishna

Call for Paper

April Edition

IJCA solicits high quality original research papers for the upcoming April edition of the journal. The last date of research paper submission is 20 March 2026

Submit your paper

Know more

The week's pick

Explainable Hybrid Deep Learning for Automated Diagnosis of Canine Mammary Tumors

Elham Shawky Salama Heba Askr Ashraf Darwish Aboul Ella Hassanien

Random Articles

Reseach Article

Finding the Number of Clusters in Unlabeled Datasets using Extended Dark Block Extraction

by Srinivasulu Asadi, Dr Ch D V Subba Rao, V Saikrishna

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 7 - Number 3

Year of Publication: 2010

Authors: Srinivasulu Asadi, Dr Ch D V Subba Rao, V Saikrishna

10.5120/1148-1503

Srinivasulu Asadi, Dr Ch D V Subba Rao, V Saikrishna . Finding the Number of Clusters in Unlabeled Datasets using Extended Dark Block Extraction. International Journal of Computer Applications. 7, 3 ( September 2010), 1-4. DOI=10.5120/1148-1503

@article{ 10.5120/1148-1503,

author = { Srinivasulu Asadi, Dr Ch D V Subba Rao, V Saikrishna },

title = { Finding the Number of Clusters in Unlabeled Datasets using Extended Dark Block Extraction },

journal = { International Journal of Computer Applications },

issue_date = { September 2010 },

volume = { 7 },

number = { 3 },

month = { September },

year = { 2010 },

issn = { 0975-8887 },

pages = { 1-4 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume7/number3/1148-1503/ },

doi = { 10.5120/1148-1503 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T19:55:55.924102+05:30

%A Srinivasulu Asadi

%A Dr Ch D V Subba Rao

%A V Saikrishna

%T Finding the Number of Clusters in Unlabeled Datasets using Extended Dark Block Extraction

%J International Journal of Computer Applications

%@ 0975-8887

%V 7

%N 3

%P 1-4

%D 2010

%I Foundation of Computer Science (FCS), NY, USA

Abstract

Clustering analysis is the problem of partitioning a set of objects O = {o1… on} into c self-similar subsets based on available data. In general, clustering of unlabeled data poses three major problems: 1) assessing cluster tendency, i.e., how many clusters to seek? 2) Partitioning the data into c meaningful groups, and 3) validating the c clusters that are discovered. We address the first problem, i.e., determining the number of clusters c prior to clustering. Many clustering algorithms require number of clusters as an input parameter, so the quality of the clusters mainly depends on this value. Most methods are post clustering measures of cluster validity i.e., they attempt to choose the best partition from a set of alternative partitions.

References

R.F. Ling, Comm. ACM, vol. 16, pp. 355-361, 1973, “A Computer Generated Aid for Cluster Analysis,”
J.C. Bezdek and R. Hathaway,” Proc. Int’l Joint Conf. Neural Networks (IJCNN ’02), pp. 2225-2230, 2002,
J. Huband, J.C. Bezdek, and R. Hathaway, Pattern Recognition, vol. 38, no. 11, pp. 1875-1886, 2005, “bigVAT: Visual Assessment of Cluster Tendency for Large Data Sets”.
R. Hathaway, J.C. Bezdek, and J. Huband, Pattern Recognition, vol. 39, pp. 1315-1324, 2006, “Scalable Visual Assessment of Cluster Tendency”.
W.S. Cleveland, Visualizing Data. Hobart Press, 1993. J.C. Bezdek, R.J. Hathaway, and J. Huband, IEEE Trans. Fuzzy Systems, vol. 15, no. 5, pp. 890-903, 2007, “Visual Assessment of Clustering Tendency for Rectangular Dissimilarity Matrices”.
R.C. Gonzalez and R.E. Woods, Prentice Hall, 2002, Digital Image Processing.
I. Dhillon, D. Modha, and W. Spangler, Proc. 30th Symp. Interface: Computing Science and Statistics, 1998, “Visualizing Class Structure of Multidimensional Data”.
R.F. Ling, Comm. ACM, vol. 16, pp. 355-361, 1973, “A Computer Generated Aid for Cluster Analysis”.
T. Tran-Luu, PhD dissertation, Univ. of Maryland, College Park, 1996, “Mathematical Concepts and Novel Heuristic Methods for Data Clustering and Visualization”.
J.C. Bezdek and R. Hathaway, Proc. Int’l Joint Conf. Neural Networks (IJCNN ’02), pp. 2225-2230, 2002, “VAT: A Tool for Visual Assessment of (Cluster) Tendency”.
J. Huband, J.C. Bezdek, and R. Hathaway, Pattern Recognition, vol. 38, no. 11, pp. 1875-1886, 2005, “bigVAT: Visual Assessment of Cluster Tendency for Large Data Sets”.
Liang Wang, Christopher Leckie, Kotagiri Ramamohanarao, and James Bezdek, Fellow, IEEE-MARCH 2009, Automatically Determining the Number of Clusters in Unlabeled Data Sets.

Index Terms

Computer Science

Information Sciences

Keywords

Clustering Cluster Tendency Reordered Dissimilarity Image VAT C-Means Clustering