CFP last date
20 December 2024
Reseach Article

Recognition of Similar appearing Gujarati Characters using Fuzzy-KNN Algorithm

by Amit H. Choksi, Shital P. Thakkar
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 55 - Number 6
Year of Publication: 2012
Authors: Amit H. Choksi, Shital P. Thakkar
10.5120/8758-2666

Amit H. Choksi, Shital P. Thakkar . Recognition of Similar appearing Gujarati Characters using Fuzzy-KNN Algorithm. International Journal of Computer Applications. 55, 6 ( October 2012), 12-17. DOI=10.5120/8758-2666

@article{ 10.5120/8758-2666,
author = { Amit H. Choksi, Shital P. Thakkar },
title = { Recognition of Similar appearing Gujarati Characters using Fuzzy-KNN Algorithm },
journal = { International Journal of Computer Applications },
issue_date = { October 2012 },
volume = { 55 },
number = { 6 },
month = { October },
year = { 2012 },
issn = { 0975-8887 },
pages = { 12-17 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume55/number6/8758-2666/ },
doi = { 10.5120/8758-2666 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:56:33.961365+05:30
%A Amit H. Choksi
%A Shital P. Thakkar
%T Recognition of Similar appearing Gujarati Characters using Fuzzy-KNN Algorithm
%J International Journal of Computer Applications
%@ 0975-8887
%V 55
%N 6
%P 12-17
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

This paper describes the Optical Character Recognition of similar appearing characters of Gujarati language. Gujarati language is a type of Indian language. Recognition accuracy of Gujarati Script is affected by characters very similar in shape. Here, Fuzzy KNN classifier in pair with two different features Geometric and Wavelet features are used to handle this problem. Fuzzy KNN not only label the class of pattern to be identified, it also decides strength of that pattern for that class. This makes use of Fuzzy KNN for imprecise class boundary. The test data for similar appearing characters are collected from various sources like scanned pages of text books of Gujarati language, newspapers etc. Train data set is prepared by typing Gujarati characters in different font types and size and then scanned.

References
  1. R. O. Duda and P. E. Hart, Pattern Classification and scene Analysis. New York : Wiley,1973.
  2. T. M. Cover and P. E. Hart," Nearest Neighbour pattern classification," IEEE Trans. Inform. Theory, vol. 17,no. 1, pp. 15-28,1978.
  3. L. A. Zadeh, " Fuzzy Sets," Inf. Control, vol. 8, pp. 338-353,1965.
  4. R C Gonzalez and R E Woods. "Digital Image Processing". Publication Addison-Wesley, 1993.
  5. http://en. wikipedia. org/wiki/KNN algorithm.
  6. Dinesh Dilip, "A Feature Extraction Technique Based on Character Geometry for Character Recognition", Department of Electronics and Communication Engineering, Amrita School of Engineering, Kollam, Kerala.
  7. A. Yajnik, S. Rama Mohan, "Identification of Gujarati Characters Using Wavelets and Neural Network", Proc. Of 10th IASTED International Conference on Artificial Intelligence and Soft Computing, Acta press , 2006, pp. 150-155.
  8. Atul Negi, Chakravarthy Bhagvati, and B. Krishna. "An OCR System for Telugu". Proc. of 6th ICDAR, IEEE Computer Society, 2001, pp. 1110-1114.
  9. Jignesh Dholakia, Atul Negi, S. Rama Mohan, "Zone Identification in the Printed Gujarati Text", Proc. of 8th ICDAR, IEEE Computer Society, 2005, pp. 272-276.
  10. Sameer Antani, Lalitha Agnihotri, "Gujarati Character Recognition", Proc. 5th ICDAR, IEEE Computer Society, 1999, pp. 418-422.
  11. S. Rama mohan, A. Yajnik, "Gujarati Numeral Recognition Using Wavelets and Neural Network", Proc. Of 2ndIICAI, Pune, 2005, pp. 397-406.
  12. Lie Huang, Xiao Huang, "Multiresolution Recognition Of Offline Handwritten Chinese Characters With Wavelet Transform", Proc. 6th ICDAR, IEEE Computer Society, 2001, pp. 631-634.
  13. Atul Negi, Jignesh Dholakia, A. Yajnik, "Wavelet Feature Based Confusion Character Sets for Gujarati Script" International Conference on Computational Intelligence and Multimedia Applications, 2007.
  14. "Design and Implementation of Optical Character Recognition System to Recognize Gujarati Script using Template Matching"' by Prof S K Shah, A Sharma.
  15. J. M. Keller, M. R. Gray, and J. A. Givens, Jr. , "A Fuzzy K-Nearest Neighbor Algorithm", IEEE Transactions on Systems, Man, and Cybernetics, Vol. 15, No. 4, pp. 580-585.
  16. O D Trier, A K Jain and T Taxt. 'Feature Extraction Methods for Character Recognition – A Survey'. Pattern Recognition, vol 29, no 4, 1996,pp 641-662.
  17. D M Gavrila, D Benze. 'Multi Feature Hierarchical Template Matching using Distance Transforms'. Proceedings of ICDAR, 2001
  18. A. Hashizume, P. S. Yeh, A. Rosenfeld, "A method of detecting the orientation of aligned components", Pattern Recognition Letters, 1996, pp. 125-132.
  19. B. V. Dasarathy. Nearest neighbor (NN) norms, NN pattern classification techniques. 1991.
  20. U Pal, B B Choudhuri: Indian Script Character Recognition: A Survey of Pattern Recognition, Vol. 37,pp. 1887-1899, 2004.
  21. N. Sharma, U. Pal, and F. Kimura, "Recognition of Handwritten Kannada Numerals", Proc, of IEEE-ICIT 2006.
  22. W. K. Pratt. Digital Image Processing. Wiley Interscience, 1991.
  23. S. Tsujimoto and H. Asada. Major component of a complete text reading system. In L. O'Gorman and R. Kasturi, editors, Document Image Analysis, pages 298–314, 1995.
  24. H. S. Baird. Document Image Defect Models. In L. O'Gorman and R. Kasturi, editors, Document Image Analysis, pages 315–325, 1995.
  25. Arun K. Pujari, C. Dhananjay Naidu, M. Sreenivasa Rao, B. C. Jingara, "An Adaptive Character Recognizer for Telugu Scripts using Multiresolution Analysis and Associative Memory", Image Vision Computing 22(14), 2004 , pp. 1221-1227.
Index Terms

Computer Science
Information Sciences

Keywords

Optical Character Recognition Fuzzy KNN Wavelets