CFP last date
20 January 2025
Reseach Article

Word Spotting and Character Recognition using Quadrant Density and Aspect Ratio

by Nivas K S, Preethi V
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 66 - Number 15
Year of Publication: 2013
Authors: Nivas K S, Preethi V
10.5120/11161-6278

Nivas K S, Preethi V . Word Spotting and Character Recognition using Quadrant Density and Aspect Ratio. International Journal of Computer Applications. 66, 15 ( March 2013), 24-28. DOI=10.5120/11161-6278

@article{ 10.5120/11161-6278,
author = { Nivas K S, Preethi V },
title = { Word Spotting and Character Recognition using Quadrant Density and Aspect Ratio },
journal = { International Journal of Computer Applications },
issue_date = { March 2013 },
volume = { 66 },
number = { 15 },
month = { March },
year = { 2013 },
issn = { 0975-8887 },
pages = { 24-28 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume66/number15/11161-6278/ },
doi = { 10.5120/11161-6278 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:23:27.037265+05:30
%A Nivas K S
%A Preethi V
%T Word Spotting and Character Recognition using Quadrant Density and Aspect Ratio
%J International Journal of Computer Applications
%@ 0975-8887
%V 66
%N 15
%P 24-28
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The desire to access, search and explore large amount of documents had paved the way for digitizing and storing the document in computer for easy access. But doing a word search (word spotting) in a scanned document is difficult, since the entire document is saved as an image. Different methods are already proposed which recognizes the characters from the scanned document and converts it into a text document in which the word spotting is done. In proposed method, a set of features are extracted from each character and the feature vector is converted to a floating point value. This floating point is a combination of quadrant densities obtained from the character and its aspect ratio with their respective importance. Now using this floating point recognizing the character with reference to a trained set of floating point values can be done. Now when the user searches for a certain word, spatial adjacency algorithm is used to spot the searched keyword directly in the image. Character recognition is one of the most dynamic part of today's artificial intelligence systems. Here the proposing system analysis the similarity between various characters in a language and trains itself to identify and understand similar characters using the previously learned data.

References
  1. Stefan Klink, German Research centre for the Artificial Intelligence (DFKI,GmbH),Germany
  2. Andreas Dengel, Thomas Kieninger German Research canter for the Artificial Intelligence (DFKI, GmbH), Germany
  3. Sami Lais is a freelance writer in Takoma Park, Md.
  4. T. M. Rath, R. Manmatha et al. [trath,manmatha] @ cs. umass. edu
  5. T. M. Rath and R. Manmatha, "Word spotting for historical documents", International Journal on Document Analysis and Recognition (IJDAR), Vol. 9, No 2 – 4, pp. 139– 152 , 2006.
  6. V. Lavrenko, T. M. Rath, R. Manmatha: "Holistic Word Recognition for Handwritten Historical Documents", Proceedings of the First International Workshop on Document Image Analysis for Libraries (DIAL'04),pp 278- 287, 2004.
  7. Niblack, W. , "An Introduction to Digital Image Processing", pp. 115–116. Prentice Hall, Englewood Cliffs, NJ, (1986).
  8. A. Bhardwaj, D. Jose, and V. Govindaraju. Script inde- pendent word spotting in multilingual documents. 2nd Intl Workshop on Cross Lingual Information Access, pages 48–54, 2008
  9. A. Fischer, A. Keller, V. Frinken, and H. Bunke. Lexicon-free handwritten word spotting using character hmms. Pattern Recogn. Lett. , 33(7):934–942, May 2012.
  10. R. Jayadevan, S. R. Kolhe, P. M. Patil, and U. Pal. Database development and recognition of handwritten devanagari legal amount words. Document Analysis and Recognition, International Conference on, 0:304–308, 2011.
  11. R. Saabni and J. El-Sana. Keyword searching for Arabic handwritten documents. 11th International Conference on Frontiers in Handwriting recognition (ICFHR2008), pages 716–722, 2008.
  12. S. N. Srihari, H. Srinivasan, C. Huang, and S. Shetty. Spotting words in latin, devanagari and arabic scripts. Artificial Intelligence, page 2006
Index Terms

Computer Science
Information Sciences

Keywords

Word Spotting