International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 73 - Number 3 |
Year of Publication: 2013 |
Authors: Chhaya Patel, Apurva Desai |
10.5120/12719-9541 |
Chhaya Patel, Apurva Desai . Extraction of Characters and Modifiers from Handwritten Gujarati Words. International Journal of Computer Applications. 73, 3 ( July 2013), 7-12. DOI=10.5120/12719-9541
The research activity related to Optical Character Recognition (OCR) for almost all Indian languages is very less. Gujarati script is one of the scripts for which very less literature is available, as far as OCR activities are concerned. This paper describes one of the important phase of OCR, segmentation of handwritten words into its basic components namely basic characters, conjunct characters and modifiers, which are essential for recognition of a word. The paper describes methods for identification of zone boundaries for a word and usage of zone boundaries details for segmenting the word into its subcomponents. Connected component labeling is applied to detect subcomponents of a word, which can be further dissected if needed to obtain other subcomponents of word. It is the first attempt to dissect handwritten Gujarati words into its subcomponents.