CFP last date
20 December 2024
Reseach Article

Classification of Hadiths using LVQ based on VSM Considering Words Order

by Mohamed Ghanem, Abdelaaziz Mouloudi, Mohammed Mourchid
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 148 - Number 4
Year of Publication: 2016
Authors: Mohamed Ghanem, Abdelaaziz Mouloudi, Mohammed Mourchid
10.5120/ijca2016911077

Mohamed Ghanem, Abdelaaziz Mouloudi, Mohammed Mourchid . Classification of Hadiths using LVQ based on VSM Considering Words Order. International Journal of Computer Applications. 148, 4 ( Aug 2016), 25-28. DOI=10.5120/ijca2016911077

@article{ 10.5120/ijca2016911077,
author = { Mohamed Ghanem, Abdelaaziz Mouloudi, Mohammed Mourchid },
title = { Classification of Hadiths using LVQ based on VSM Considering Words Order },
journal = { International Journal of Computer Applications },
issue_date = { Aug 2016 },
volume = { 148 },
number = { 4 },
month = { Aug },
year = { 2016 },
issn = { 0975-8887 },
pages = { 25-28 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume148/number4/25746-2016911077/ },
doi = { 10.5120/ijca2016911077 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:52:26.537889+05:30
%A Mohamed Ghanem
%A Abdelaaziz Mouloudi
%A Mohammed Mourchid
%T Classification of Hadiths using LVQ based on VSM Considering Words Order
%J International Journal of Computer Applications
%@ 0975-8887
%V 148
%N 4
%P 25-28
%D 2016
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The religion of Islam is based on a sacred text called Qur’an, a divine speech expressed in Arabic language. Qur’an constitutes the main root of Islam jurisprudence which has a second source of inspiration known as Hadiths. As the Muslim’s life is governed by those holy texts, need of their authenticity is required. Using VSM (Vector Space Model), we can represent Hadiths as a vector of words. The Term Weighting obtained by multiplying term frequency by the inverse document frequency does not take into account the word order, however, order of narrators is critical to classify Hadith. In this paper we propose a new method considering the words order (in our case the narrator’s order), to classify Hadiths into four categories: Sahih, Hasan, Da’if and Maudu’. We use in this purpose LVQ (Learning Vector Quantization). We got good results for classifying Sahih and Maudu’ categories.

References
  1. El Kourdi, M., Bensaid, A., Rachidi, T.E. (2004). Automatic Arabic Document Categorization Based on the Naïve Bayes Algorithm. 20th International Conference on Computational Linguistics, Geneva.
  2. Mesleh, A. (2007). Chi Square Feature Extraction Based Svms Arabic Language Text Categorization System. Journal of Computer Science, 3(6), pp. 430-435.
  3. Sawaf, H., Zaplo, J., Ney, H. (2001). Statistical Classification Methods for Arabic News Articles. the Arabic Natural Language Processing Workshop (ACL2001), Toulouse, France.
  4. Martı́n-Valdivia, M.T., Garcı́a-Vega, M., Ureña-López, L.A. (2003). LVQ for text categorization using a multilingual linguistic resource. Neurocomputing, 55(3), 665-679.
  5. Harrag, F., El-Qawasmah, E. (2009). Neural Network for Arabic text classification. Applications of Digital Information and Web Technologies, ICADIWT'09, Second International Conference on the IEEE pp. 778-783.
  6. Kashif, B., Sajjad, Mohsin. (2012). Muhadith: A Cloud Based Distributed Expert System for Classification of Ahadith. Frontiers of Information Technology (FIT), 10th International Conference, pp. 73-78.
  7. Karim, N., Hazmi, N. (2005). Assessing Islamic information quality on the Internet: A case of information about hadith. Malaysian Journal of Library and Information Science, 10.2, 51.
  8. Hernández-Reyes, E., Martínez-Trinidad, J.F., Carrasco-Ochoa, J.A., García-Hernández, R.A. (2006). Document representation based on maximal frequent sequence sets. Progress in Pattern Recognition, Image Analysis and Applications, pp. 854-863. Springer Berlin Heidelberg.
  9. Salton, G., McGill, M. (1983). Introduction to Modern Information Retrieval. New York.
  10. Al-Shalabi, R., Kanaan, G., Gharaibeh, M. (2006). Arabic Text Categorization Using kNN Algorithm. the Int. multi conf. on computer science and information technology.
  11. Kohonen, T. (1997). Learning vector quantization. In: Self-Organizing Maps, p. 203-217. Springer Berlin Heidelberg.
  12. Martín-Valdivia, M.T., Ureña-López, L.A., García-Vega, M. (2007). The Learning Vector Quantization Algorithm Applied to Automatic Text Classification Tasks. Neural Networks, 20(6), 748-756.
Index Terms

Computer Science
Information Sciences

Keywords

Arabic Natural Language Processing Learning vector quantization Term Weighting Text categorization Vector Space Model.