2nd National Conference on Information and Communication Technology |
Foundation of Computer Science USA |
NCICT - Number 3 |
November 2011 |
Authors: Ashish Jaiswal, Prof. Nitin Janwe |
03233b8b-4cbc-4c24-afed-f0ba889ce7bd |
Ashish Jaiswal, Prof. Nitin Janwe . Hierarchical Document Clustering: A Review. 2nd National Conference on Information and Communication Technology. NCICT, 3 (November 2011), 37-41.
As text documents are largely increasing in the internet, the process of grouping similar documents for versatile applications have put the eye of researchers in this area. However most clustering methods suffer from challenges in dealing with problems of high dimensionality, scalability, accuracy and meaningful cluster labels. This paper presents a review on all these well known methods of document clustering. Hierarchical document clustering method is explained in detail. Study shows that hierarchical document clustering performs well but still there is a scope to improve above mentioned problems.