CFP last date
20 February 2025
Reseach Article

Hybrid Intelligence: Filtering and Deep Learning for Handwritten Text Recognition

by Arshad Iqbal, Shabbir Hassan, Maria Qamar
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 186 - Number 55
Year of Publication: 2024
Authors: Arshad Iqbal, Shabbir Hassan, Maria Qamar
10.5120/ijca2024924273

Arshad Iqbal, Shabbir Hassan, Maria Qamar . Hybrid Intelligence: Filtering and Deep Learning for Handwritten Text Recognition. International Journal of Computer Applications. 186, 55 ( Dec 2024), 20-31. DOI=10.5120/ijca2024924273

@article{ 10.5120/ijca2024924273,
author = { Arshad Iqbal, Shabbir Hassan, Maria Qamar },
title = { Hybrid Intelligence: Filtering and Deep Learning for Handwritten Text Recognition },
journal = { International Journal of Computer Applications },
issue_date = { Dec 2024 },
volume = { 186 },
number = { 55 },
month = { Dec },
year = { 2024 },
issn = { 0975-8887 },
pages = { 20-31 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume186/number55/hybrid-intelligence-filtering-and-deep-learning-for-handwritten-text-recognition/ },
doi = { 10.5120/ijca2024924273 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-12-27T02:45:45.139842+05:30
%A Arshad Iqbal
%A Shabbir Hassan
%A Maria Qamar
%T Hybrid Intelligence: Filtering and Deep Learning for Handwritten Text Recognition
%J International Journal of Computer Applications
%@ 0975-8887
%V 186
%N 55
%P 20-31
%D 2024
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Handwritten text recognition systems have gained substantial attention in pattern recognition and artificial intelligence due to their applications in digitizing historical documents and automated reading tasks. This study proposes a novel preprocessing approach for recognizing Kufic manuscripts, aiming to improve the accuracy of handwritten text segmentation. The approach consists of three stages: noise removal, thresholding, and additional noise removal after thresholding. Hybrid filtering, using Wiener and Median filters, effectively reduces noise in the input images. The u-Net deep learning model is employed for precise thresholding and segmentation of the handwritten text. The post-thresholding noise removal step refines the segmented regions and eliminates residual noise artifacts. The proposed approach contributes to enhancing the Quranic Kufic Manuscripts Recognition System (QKMRS) and advancing the digitization of historical documents and the preservation of cultural heritage. The performance measures for noise removal include Mean Square Error (MSE) and Peak Signal Noise Ratio (PSNR). For segmentation and thresholding Intersection over union (IoU), entropy and dice coefficient are used.

References
  1. Iqbal, A., Zafar, A.: Offline Handwritten Quranic Text Recognition: A Research Perspective. In: 2019 Amity International Conference on Artificial Intelligence (AICAI), pp. 125-128. IEEE (2019). https:/doi.org/10.1109/AICAI.2019.8701404
  2. Ronneberger, O., Fischer, P., & Brox, T. (2015). U-Net: Convolutional Networks for Biomedical Image Segmentation. Lecture Notes in Computer Science, 234–241. https://doi.org/10.1007/978-3-319-24574-4_28
  3. Smith, R. G., Sridharan, S., & Wang, S. (2006). Handwritten Word Recognition Using Adaptive Noise Filtering. Proceedings of the 18th International Conference on Pattern Recognition, 1050-1053.
  4. Liu, Z., Liu, Z., Luo, J., & Wang, G. (2017). Contrast Enhancement for Handwritten Document Images Based on Adaptive Histogram Equalization. Proceedings of the 3rd International Conference on Industrial Engineering, 141-147
  5. Otsu, N.: A threshold selection method from gray level histograms. IEEE Trans. on Systems, Man and Cybernetics (1979) 9:62-66.
  6. Sauvola, J., Seppanen, T., .Haapakoski, S., Pietikainen, M.: Adaptive Document Binarization. 4th Int. Conf. On Document Analysis and Recognition, Ulm, Germany (1997), pp. 147-152.
  7. Aghdam, H. H., Faez, K., Gheissari, N., & Dengel, A. (2017). A Robust Text-Line Extraction Method for Historical Handwritten Documents. International Journal on Document Analysis and Recognition, 20(3), 129-142.
  8. Nikolaos, N., Dimitrios, V.: A Binarization Algorithm for Historical Manuscripts. 12th Wseas International Conference on Communications. pp. 41-51 (2008). ISSN: 1790-5117.
  9. Yahya, S. R.,Sheikh Abdullah, S. N. H.,Omar, K., Zakaria, M. S., Liong, C. Y. .: Review on image enhancement methods of old manuscript with the damaged background. International Journal on Electrical Engineering and Informatics. 2(1), 62 - 67 (2010).
  10. Al-Harahsheh, M., Al-Saleh, N., & Abuhaiba, I. (2018). Kufic Manuscript Text Line Segmentation Using Modified Projection Profile. International Journal of Computer Science and Network Security, 18(6), 227-234.
  11. Abuhaiba, I., Al-Harahsheh, M., & Al-Saleh, N. (2020). Enhancing the Quality of Kufic Manuscripts Using Image Processing Techniques. International Journal of Computer Science Issues, 17(2), 366-375.
  12. Almazroi, I., Alquran, M., Azeemi, A., & Shafait, F. (2019). Inpainting Degraded Manuscripts for Arabic Script Recognition. Proceedings of the 8th International Conference on Pattern Recognition Systems, 1-6.
  13. Zafar, A., Iqbal, A.: Application of Soft Computing Techniques in Machine Reading of Quranic Kufic Manuscripts. Journal of King Saud University - Computer and Information Sciences (2020). https://doi.org/10.1016/j.jksuci.2020.04.017.
  14. Fred, A. L., Kumar, S.N., Kumar, A. H., Daniel, A. V., Abisha, W.: Evaluation of local thresholding techniques in Palm-leaf Manuscript images. International Journal of Computer Sciences and Engineering. 6(4), 124-131 (2018). https:/doi.org/10.26438/ijcse/v6i4.124131.
  15. Bouillon, M., Ingold, R., Liwicki, M.: Grayification: A meaningful grayscale conversion to improve handwritten historical documents analysis, Pattern Recognition Letters, 2019.,
  16. Baig, A.,Al-Ma’adeed, S. A.S., Bouridane, A.,Cheriet, M.: Automatic segmentation and reconstruction of historical manuscripts in gradient domain. IET Image Process (2018). 12(4), pp. 502-512
  17. Xiong, W.,Zhou, L., L.,Yue, L., Li, L., Wang, S.: An enhanced binarization framework for degraded historical document images. EURASIP Journal on Image and Video Processing (2021). 2021:13, pp. 1-24. https://doi.org/10.1186/s13640-021-00556-4
  18. Zafar, A, Iqbal, A.: Machine Reading of Arabic Manuscripts using KNN and SVM Classifiers. In: Proceedings of the 14th INDIACom, 2020 7th International Conference on Computing for Sustainable Global Development (2020). pp. 83-87. https://doi.org/10.23919/INDIACom49435.2020. 9083696.
  19. Isa, I. S., Sulaiman, S. N., Mustapha, M., Darus, S.: Evaluating Denoising Performances of Fundamental Filters for T2- Weighted MRI Images. 19th International Conference on Knowledge Based and Intelligent Information and Engineering Systems (2015), pp. 760-768.
  20. Michalak, H., Okarma, K.: Improvement of Image Binarization Methods Using Image Preprocessing with Local Entropy Filtering for Alphanumerical Character Recognition Purposes. Entropy 2019, 21, 562. https://doi.org/10.3390/e21060562
  21. Niblack, W.: An Introduction to Digital Image Processing, Prentice Hall (1986), pp. 115-116.
  22. Michalak, H., Okarma, K.: Improvement of Image Binarization Methods Using Image Preprocessing with Local Entropy Filtering for Alphanumerical Character Recognition Purposes. Entropy 2019, 21, 562. https://doi.org/10.3390/e21060562
  23. Zhang, Y., Cheng, H. D., Huang, J. Tang, X..: An effective and objective criterion for evaluating the performance of denoising filters. Pattern Recognit (2012). 45(7): 2743–2757
  24. Westpha, F., Grahn, H., Lavesson, N.: Efficient document image binarization using heterogeneous computing and parameter tuning. International Journal on Document Analysis and Recognition (IJDAR) (2018), 21:41–58. https://doi.org/10.1007/s10032-017-0293-7
  25. Karim, A., Kadhm, M S.: An Efficient Image Thresholding Method for Arabic Handwriting Recognition System. Eng & Tech. Journal (2016). 34(1):26-34.
  26. He, K., Gkioxari, G., Dollár, P., & Girshick, R. (2017). Mask R-CNN. In Proceedings of the IEEE International Conference on Computer Vision (pp. 2961-2969). https://doi.org/10.1109/ICCV.2017.322.
Index Terms

Computer Science
Information Sciences

Keywords

Filtering Kufic Manuscript Median Filter Thresholding U-net Wiener Filter.