International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 89 - Number 2 |
Year of Publication: 2014 |
Authors: Soheila Karbasi, Mehdi Yaghoubi |
10.5120/15475-4164 |
Soheila Karbasi, Mehdi Yaghoubi . Term Importance Degree Impact on Search Result Clustering. International Journal of Computer Applications. 89, 2 ( March 2014), 32-34. DOI=10.5120/15475-4164
As wellactual clustering algorithms have to deal with explosive growth of documents of various sizes and terms of various frequencies, an appropriate term-weighting scheme has a crucial impact on the overall performance of such systems. Term-weighting is one of the critical process for document retrieval and ranking in most search result clustering systems. In this paper we introduce a new technique forclustering algorithms that solve the problem of indexing the terms of big datasets and their characteristicswhich exist in most of current clustering approaches. The paper focus on term frequency normalization step ofclustering algorithms. Anew factor has been applied tobasic term-weighting schemes for using in clustering process. The evaluated results confirm the impact of this factor to increase the performance of clusteringtechniques. The experiments were carried out on the standard algorithms and ODP-239 datasets which validated by statistical tests.