International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 46 - Number 8 |
Year of Publication: 2012 |
Authors: A. Alajmi, E. M. Saad, R. R. Darwish |
10.5120/6926-9341 |
A. Alajmi, E. M. Saad, R. R. Darwish . Toward an ARABIC Stop-Words List Generation. International Journal of Computer Applications. 46, 8 ( May 2012), 8-13. DOI=10.5120/6926-9341
Over the past decades systems for automatic management of electronic documents have been one of the main fields of research. Text processing is a wide area that includes many important disciplines. In the processes of organizing unstructured text in order to implement a mining technique, preprocessing has to be applied. One of the most important preprocessing techniques is the removal of functional words which affects the performance of text mining tasks. In this paper, a statistical approach is presented to extract Arabic stop-words list. The extracted list was compared to a general list. The comparison yield an improvement in an ANN based classifier using the generated stop-words list over the general list.