International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 177 - Number 37 |
Year of Publication: 2020 |
Authors: Pradyumna Mukunda, Niraj S. Prasad, Mamatha H. R. |
10.5120/ijca2020919885 |
Pradyumna Mukunda, Niraj S. Prasad, Mamatha H. R. . KanOCR: Conversion of Printed Kannada Document to Editable form using Convolutional Neural Networks. International Journal of Computer Applications. 177, 37 ( Feb 2020), 51-58. DOI=10.5120/ijca2020919885
Optical Character Recognition (OCR) technology in converting an image containing text to an editable text format is of high sense in document image processing. Input to OCR could be a scanned document, or a simple newspaper cut-out. Supervised Learning using Neural Networks yield the output with greater accuracy. Unlike English, Kannada Language has a huge set of characters as it includes kaagunithas, vattaksharas, etc. This makes recognition of the characters much more complex. The paper mainly concentrates on OCR for the Kannada Text which goes through a threshold as a first step converting input image into binary image, making segmentation easier. Characters can be extracted from the documents using various Segmentation methods. The vattaksharas are extracted/differentiated from the words by using base-line technique. When the characters are recognized, they are compared with Unicodes available on the system and then printed. In the above method, CNN plays a pivotal role in reading the character and comparing it with the Unicode look up table values to print the output. This system has been tested with varying fonts. A total number of 37 sample documents are used for experimentation. The system has been developed for only printed Kannada Text.