CFP last date
20 January 2025
Call for Paper
February Edition
IJCA solicits high quality original research papers for the upcoming February edition of the journal. The last date of research paper submission is 20 January 2025

Submit your paper
Know more
Reseach Article

Extraction of Characters and Modifiers from Handwritten Gujarati Words

by Chhaya Patel, Apurva Desai
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 73 - Number 3
Year of Publication: 2013
Authors: Chhaya Patel, Apurva Desai
10.5120/12719-9541

Chhaya Patel, Apurva Desai . Extraction of Characters and Modifiers from Handwritten Gujarati Words. International Journal of Computer Applications. 73, 3 ( July 2013), 7-12. DOI=10.5120/12719-9541

@article{ 10.5120/12719-9541,
author = { Chhaya Patel, Apurva Desai },
title = { Extraction of Characters and Modifiers from Handwritten Gujarati Words },
journal = { International Journal of Computer Applications },
issue_date = { July 2013 },
volume = { 73 },
number = { 3 },
month = { July },
year = { 2013 },
issn = { 0975-8887 },
pages = { 7-12 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume73/number3/12719-9541/ },
doi = { 10.5120/12719-9541 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:39:02.794538+05:30
%A Chhaya Patel
%A Apurva Desai
%T Extraction of Characters and Modifiers from Handwritten Gujarati Words
%J International Journal of Computer Applications
%@ 0975-8887
%V 73
%N 3
%P 7-12
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The research activity related to Optical Character Recognition (OCR) for almost all Indian languages is very less. Gujarati script is one of the scripts for which very less literature is available, as far as OCR activities are concerned. This paper describes one of the important phase of OCR, segmentation of handwritten words into its basic components namely basic characters, conjunct characters and modifiers, which are essential for recognition of a word. The paper describes methods for identification of zone boundaries for a word and usage of zone boundaries details for segmenting the word into its subcomponents. Connected component labeling is applied to detect subcomponents of a word, which can be further dissected if needed to obtain other subcomponents of word. It is the first attempt to dissect handwritten Gujarati words into its subcomponents.

References
  1. Dr. Bholanath Tiwari ,"Bhash Vigyan", 8th edition 1971, Publisher: Kitab Mahal, Allahabad, India.
  2. S. Antani, L. Agnihotri, "Gujarati character recognition", Proc. of International Conference on Document Analysis and Recognition , pp. 418–421,1999.
  3. U. Pal, B. B. Chaudhuri, "Automatic Separation of Machine- Printed and Hand-Written Text Lines", Proc. of International Conference on Document Analysis and Recognition , pp. 645-648, 1999.
  4. Veena Bansal, R. M. K Sinha , "A Complete OCR for Printed Hindi Text in Devanagari Script", Proc. of International Conference on Document Analysis and Recognition, pp. 800-804, 2001.
  5. Jignesh Dholakia ,Atul Negi, S. Rama Mohan, "Zone Identification in the Printed Gujarati Text", Proc. of International Conference on Document Analysis and Recognition, Vol. 1, pp. 1520-5263, 2005.
  6. B. B. Chaudhhuri, U. Pal, M. Mitra, "Automatic recognition of printed Oriya script", Saadhanaa 27 (1), pp. 23–34, 2002.
  7. Md. Abul Hasnat , Mumit Khan, "Rule Based Segmentation of Lower Modifiers in Complex Bangla Scripts", Proceedings of the Conference on Language & Technology , pp. 94-101, 2009.
  8. Casey R. G. , Lecolinet E. , "A survey of methods and strategies in character segmentation", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 18, Issue 7, pp. 690–706, 1996.
  9. S. M. Mahmud, N. Shahrier, D. Hossain, T. M. Chowdhury, M. A. Sattar, "An Efficient Segmentation Scheme for the Recognition of Printed Bangla characters", Proc. of ICCIT, pp. 779-781, 2003.
  10. U. Pal, B. B. Chaudhuri, "OCR in Bangla: an Indo- Bangladeshi Language", Proc. of ICPR, pp. 269-274, 1994.
  11. B. B. Chaudhuri, U. Pal, "An OCR System to Read Two Indian Language Scripts: Bangla and Devnagari(Hindi)", Proc. of International Conference on Document Analysis and Recognition, Vol. 2, pp. 1011 -1015, 1997.
  12. Haralick and Shapiro, Computer and Robot Vision, Vol. I, Addison-Wesley, 1992.
  13. Chhaya Patel, Apurva Desai, "Zone Identification for Gujarati Handwritten Word", Second International Conference on Emerging Applications of Information Technology, pp. 194-197,2011. IEEE DOI 10. 1109 /EAIT. 2011. 47, 2011.
  14. Patel C. , Desai, A. , "Segmentation of text lines into words for Gujarati handwritten text" ,Proc. of International Conference on Signal and Image Processing (ICSIP)-2010, pp. 130 - 134, Print ISBN: 978-1-4244-8595-6, DOI 10. 1109/ICSIP. 2010. 5697455 , 2010.
Index Terms

Computer Science
Information Sciences

Keywords

zone identification upper zone lower zone middle zone distance transform shirolekha character extraction modifier extraction connected component labeling