CFP last date
20 February 2025
Reseach Article

Robust Printed Devanagari Document Recognition using Hybrid Approach of Shirorekha Chopping, Fuzzy Directional Features and Support Vector Machine

by Nitin Mishra, Ankur Agrawal
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 57 - Number 1
Year of Publication: 2012
Authors: Nitin Mishra, Ankur Agrawal
10.5120/9076-8727

Nitin Mishra, Ankur Agrawal . Robust Printed Devanagari Document Recognition using Hybrid Approach of Shirorekha Chopping, Fuzzy Directional Features and Support Vector Machine. International Journal of Computer Applications. 57, 1 ( November 2012), 11-16. DOI=10.5120/9076-8727

@article{ 10.5120/9076-8727,
author = { Nitin Mishra, Ankur Agrawal },
title = { Robust Printed Devanagari Document Recognition using Hybrid Approach of Shirorekha Chopping, Fuzzy Directional Features and Support Vector Machine },
journal = { International Journal of Computer Applications },
issue_date = { November 2012 },
volume = { 57 },
number = { 1 },
month = { November },
year = { 2012 },
issn = { 0975-8887 },
pages = { 11-16 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume57/number1/9076-8727/ },
doi = { 10.5120/9076-8727 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:01:19.217705+05:30
%A Nitin Mishra
%A Ankur Agrawal
%T Robust Printed Devanagari Document Recognition using Hybrid Approach of Shirorekha Chopping, Fuzzy Directional Features and Support Vector Machine
%J International Journal of Computer Applications
%@ 0975-8887
%V 57
%N 1
%P 11-16
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

This paper presents a novel methodology for recognizing machine printed Devanagari script document. Shirorekha Chopping based preprocessing is chosen to enable the segmentation of printed text into various characters. Fuzzy Directional Features have shown improvement over commonly used Directional features. A set of 8 directional Fuzzy Directional Features (FDF) for each character is extracted and classified to the appropriate character class. Radial Basis function (RBF) kernel based Support Vector Machines (SVM) model is used for training the various multi font characters and testing the Devanagari document to be recognized. Experiments are conducted for the multi font Devanagari document recognition. The recognition rate of the proposed OCR system with the image document of Devnagari Script has been found to be 97. 9% for two fonts Mangal and Krutidev.

References
  1. Bansal, V. and Sinha, R. M. K. "A Complete OCR for Printed Hindi Text in Devnagari Script", Sixth International Conference on Document Analysis and Recognition, IEEE Publication, Seatle USA, 2001, Page(s):800-804.
  2. Jindal, M. K. , Sharma, R. K. , lehal, G. S. "A Study of Different Kinds of Degradation in Printed Gurmukhi Script", Proceedings of the International Conference on Computing: Theory and Applications (ICCTA'07),2007.
  3. Yadav, D. , Sharma, A. K. and Gupta, J. P. Optical character recognition for printed Hindi text in Devanagari using soft-computing technique, IASTED International Multi-Conference: Artificial Intelligence and Applications, Innsbruck, Austria, 2007, pp. 102-107
  4. Chaudhuri, B. B. and Pal, U. "An OCR System to Read Two Indian Language Scripts: Bangla and Devnagari (Hindi)", Proc. of 4th ICDAR vol. 2, Ulm, Germany, 1997, Page(s): 1011 -1015
  5. Pal, U. , Chaudhuri, B. B. ''Indian Script Character recognition: A survey'', Pattern Recognition, vol. 37, pp. 1887-1899, 2004. .
  6. Agrawal, P. , Hanmandlu, M. and Lall, B. , "Coarse Classification of Handwritten Hindi Characters", International Journal of Advanced Science and Technology,Vol. 10, September, 2009.
  7. Saba, T. , Sulong, G. and Rehman, A. "A Survey on Methods and Strategies on Touched Characters Segmentation", International Journal of Research and Reviews in Computer Science (IJRRCS) Vol. 1, No. 2, June 2010.
  8. Claus Bahlmann, Bernard Haasdonk, and Hans Burkhardt. On-line handwriting recognition with support vector machines. a kernel approach. In Proc. of the 8th IWFHR, pages 49. 54, 2002.
  9. G. C. Cawley. MATLAB support vector machine toolbox (v0. 55¯) University, of East Anglia, School of Information Systems, Norwich, Norfolk, U. K. , 2000. available at: [http://theoval. sys. uea. ac. uk/. gcc/svm/toolbox].
  10. Chih-Chung Chang and Chih-Jen Lin, LIBSVM : a library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2:27:1—27:27, 2011. The Software available at http://www. csie. ntu. edu. tw/~cjlin/libsvm
Index Terms

Computer Science
Information Sciences

Keywords

Devanagari OCR Shirorekha Chopping Fuzzy Directional Features Support Vector Machine