International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 72 - Number 6 |
Year of Publication: 2013 |
Authors: P. Venkateshkumar, A. Subramani |
10.5120/12497-7430 |
P. Venkateshkumar, A. Subramani . Using Data Fusion for a Context Aware Document Clustering. International Journal of Computer Applications. 72, 6 ( June 2013), 17-20. DOI=10.5120/12497-7430
The large volume of unstructured text data available at various sources such as digital libraries, news, internet, has given arise a need to organize the information as per the user's requirement. Search for relevant information is efficient when context of the selected word in the document is considered. Document Clustering aims to discover natural groupings, and present an overview of classes (topics) in a document collection. Thus, documents with similar contents are related to the same query. In this paper, a new method for clustering documents is proposed. In the proposed method, the term frequency of the document collection is computed and contexts based terms are fused. Agglomerative clustering and Bisecting K-Means are used to cluster the extracted features.