International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 51 - Number 2 |
Year of Publication: 2012 |
Authors: Mahmud S. Alkoffash |
10.5120/8012-0675 |
Mahmud S. Alkoffash . Automatic Arabic Text Clustering using K-means and K-mediods. International Journal of Computer Applications. 51, 2 ( August 2012), 5-8. DOI=10.5120/8012-0675
In this study we have implemented the Kmeans and Kmediods algorithms in order to make a practical comparison between them. The system was tested using a manual set of clusters that consists from 242 predefined clustering documents. The results showed a good indication about using them especially for Kmediods. The average precision and recall for Kmeans compared with Kmediods are 0. 56, 0. 52, 0. 69 and 0. 60 respectively. we have also extract feature set of keywords in order to improve the performance, the result illustrates that two algorithms can be applied to Arabic text, a sufficient number of examples for each category, the selection of the feature space, the training data set used and the value of K can enormously affect the accuracy of clustering.