CFP last date
20 January 2025
Reseach Article

A Novel Approach to Recognition of the Isolated Persian Characters using Decision Tree

by Mir Mohammad Alipour
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 66 - Number 12
Year of Publication: 2013
Authors: Mir Mohammad Alipour
10.5120/11134-6211

Mir Mohammad Alipour . A Novel Approach to Recognition of the Isolated Persian Characters using Decision Tree. International Journal of Computer Applications. 66, 12 ( March 2013), 14-20. DOI=10.5120/11134-6211

@article{ 10.5120/11134-6211,
author = { Mir Mohammad Alipour },
title = { A Novel Approach to Recognition of the Isolated Persian Characters using Decision Tree },
journal = { International Journal of Computer Applications },
issue_date = { March 2013 },
volume = { 66 },
number = { 12 },
month = { March },
year = { 2013 },
issn = { 0975-8887 },
pages = { 14-20 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume66/number12/11134-6211/ },
doi = { 10.5120/11134-6211 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:22:10.936649+05:30
%A Mir Mohammad Alipour
%T A Novel Approach to Recognition of the Isolated Persian Characters using Decision Tree
%J International Journal of Computer Applications
%@ 0975-8887
%V 66
%N 12
%P 14-20
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Optical Character Recognition (OCR) is an area of research that has attracted the interest of researchers for the past forty years. Although the subject has been the center topic for many researchers for years, it remains one of the most challenging and exciting areas in pattern recognition. Because of the cursive nature of Persian language, recognition of its characters is more difficult than Latin or Chinese language. In this paper we propose a novel method to recognize the isolated characters of Persian language using decision tree based on structural features of characters. The system has been tested on a database including all letters of Persian language and a recognition rate of 90. 56% has been achieved. Our experimental recognition results are encouraging and confirm our expectation that the use of structural features is an interesting issue of Persian character recognition.

References
  1. J. Mantas, "An Overview of Character Recognition Methodologies", Pattern Recognition 19, 1986, pp. 425-430.
  2. R. M. Bozinovic and S. N. Shihari, "Off Line Cursive Script Word Recognition", IEEE Trans. Pattern Anal. Mach. Intell. PAMI 11, 1989, pp. 68-83.
  3. R. Casey and G. Nagy, "Automatic Reading Machine", IEE Trans. Comput. 17, 1968, pp. 492-503.
  4. Amin, A. : Off-line Arabic character recognition: the state of the art. Pattern Recognition. 1998, 31(5), 517–530
  5. Gouda, A. M. , Rashwan, M. A. : Segmentation of connected Arabic characters using hidden Markov models. IEEE International Conference on Computational Intelligence for Measurement Systems and Applications, USA 2004, pp. 115–119
  6. Kurdy, B. , AlSabbagh, M. : Omnifont Arabic optical character recognition system. In: Proceedings of International Conference on Information and Communication Technologies: From Theory to Applications, pp. 2004, 469–470
  7. Khosravi, H. , Kabir, E. : Introducing a very large dataset of handwritten Farsi digits and a study on their varieties. Pattern Recognit. Lett. 2007, 28(10), 1133–1141
  8. Mansoory, S. , Hassibi, H. , Rajabi, F. : A heuristic Persian handwritten digit recognition with neural network. In: The 6th Iranian Conference on Electrical Engineering, 1998, pp. 131–135
  9. Soltanzadeh, H. , Rahmati, M. : Recognition of Persian handwritten digits using image profiles of multiple orientations. Pattern Recognit. Lett. 2004, 25(14), 1569–1576
  10. Mozaffari, S. and H. Soltanizadeh, 2009. ICDAR 2009. handwritten Farsi/Arabic character recognition competition. Proceedings of the 10th International Conference on Document Analysis and Recognition, July 26-29, IEEE Xplore, Barcelona, 2009, pp: 1413-1417. DOI: 10. 1109/ICDAR. 283
  11. M. Alipour, "A New Approach to Segmentation of Persian Cursive Script based on Adjustment the Fragments," International Journal of Computer Applications 2013, Vol. 64, No 11, pp. 21–26.
  12. Azmi, R. , Kabir, E. : A new segmentation technique for omnifont Farsi text. Pattern Recognit. Lett. 2001, 22, 97–104
  13. Ebrahimi, A. , Kabir, E. : A pictorial dictionary for printed Farsi subwords. Pattern Recognit. Lett. 2008, 29(5), 656–663
  14. Mehran, R. , Pirsiavash, H. , Razzaziy, F. : A front-end OCR for omni-font Persian/Arabic cursive printed documents. Digital Imaging Computing: Techniques and Applications, 2005, pp. 385–392
  15. Parhami, B. , Taraghi, M. : Automatic recognition of printed Farsi texts. Pattern Recognit. Lett. 1981, 14, 395–403
  16. N. Otsu, A threshold selection method from Gray-level histogram, IEEE Trans. Systems Man Cybernet. 9 (1) 1979, 62-66.
  17. W. H. Tsai, Moment-preserving thresholding: a new approach, Comput. Vision Graphics Image Process. 29, 1985, 377-393.
  18. C. Gonzales. Rafael and E. Richard,Woods. , Digital Image Processing. 2nd ed. Englewood Cliffs, NJ: Prentice-Hall, 2002.
  19. A. Amin, W. H. Wilson, Hand-printed character recognition system using artificial neural network, Proceeding of second International Conference on Document Analysis and Recognition, 1993, pp. 943-946.
  20. B. K. Jang and R. T. Chin, "Analysis of thinning algorithms using mathematical morphology,"IEEE Trans. Patt. Anal. Machine Intell. , 1990, vol. PAMI-12, no. 6, pp. 541-551.
  21. B. Timsari, Character recognition in typed Persian words: a morphological approach, M. S. thesis, Isfahan Univ. of Tech. 1992, Iran.
  22. R. Safabakhsh, and P. Adibi. Nastaaligh Handwritten Word Recognition Using a Continuous-Density Variable-Duration HMM. The Arabian Journal for Science and Engineering. 2005, 30: 95-118. April.
  23. H. Goraine, M. Usher, and S. Al-Emami. Off-Line Arabic Character Recognition,? Computer, 1992, vol. 25, pp. 71-74.
  24. B. AL -Badr and S. Mahmoud. Survey and bibliography of Arabic optical text recognition. Signal Processing, 1995, 41(1): 49-77.
  25. F. Zaki, S. Elkonyaly, A. Elfattah, and Y. Enab. A new technique for arabic handwriting recognition. Proceedings of the 11th International Conference for Statistics and Computer Science, Cairo, Egypt, 1986, pp; 171–180.
  26. A. Rosenfeld and A. Kak, Digital Picture Processing, Academic Press, New York, 1976.
  27. R. El-Hajj , L. Likforman-Sulem, C. Mokbel, "Arabic Handwriting Recognition Using Baseline Dependant Features and Hidden Markov Modeling", Proceedings of the 8th International Conference on Document Analysis and Recognition (ICDAR'05), Seoul, Korea, 2005
  28. Surhone, L. M. , M. T. Tennoe and S. F. Henssonow. Randomized Hough Transform. 1st Edn. , VDM Verlag Dr. Mueller AG and Co. Kg, Germany, ISBN-10: 6134695823, 2010, pp: 92.
  29. A. Dehghani, F . Shabani and P. Nava. Off-Line Recognition of Isolated Persian Handwritten Characters Using Multiple HiddenMarkov Models, Proc. Int'l Conf. Information Technology: Coding and Computing, 2001, pp. 506-510.
Index Terms

Computer Science
Information Sciences

Keywords

Cursive Script Persian Isolated Character Recognition Classification Decision Tree