CFP last date
20 January 2025
Reseach Article

Article:Handwritten Document Retrieval System for Tamil Language

by AN. Sigappi, S. Palanivel, V. Ramalingam
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 31 - Number 4
Year of Publication: 2011
Authors: AN. Sigappi, S. Palanivel, V. Ramalingam
10.5120/3814-5268

AN. Sigappi, S. Palanivel, V. Ramalingam . Article:Handwritten Document Retrieval System for Tamil Language. International Journal of Computer Applications. 31, 4 ( October 2011), 42-47. DOI=10.5120/3814-5268

@article{ 10.5120/3814-5268,
author = { AN. Sigappi, S. Palanivel, V. Ramalingam },
title = { Article:Handwritten Document Retrieval System for Tamil Language },
journal = { International Journal of Computer Applications },
issue_date = { October 2011 },
volume = { 31 },
number = { 4 },
month = { October },
year = { 2011 },
issn = { 0975-8887 },
pages = { 42-47 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume31/number4/3814-5268/ },
doi = { 10.5120/3814-5268 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:17:17.346366+05:30
%A AN. Sigappi
%A S. Palanivel
%A V. Ramalingam
%T Article:Handwritten Document Retrieval System for Tamil Language
%J International Journal of Computer Applications
%@ 0975-8887
%V 31
%N 4
%P 42-47
%D 2011
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The paper attempts to create a handwritten document retrieval system suitable for Tamil language, with a view to record traditional literature content for future reference. It projects a search mechanism to access the query word images using a statistical model based methodology. The scheme revolves around a well defined procedure which results in word models from where the search word can be recognised and the relevant documents retrieved. The approach involves the use of hidden Markov models (HMM) to characterize the features of the dynamically varying strokes of handwritten characters. The strategy is investigated for a sample document set over a commonly used literature. The results reveal that the system yields a reasonable performance with considerable accuracy. The highlight of this procedure is that it can effectively segment differently written words from text lines in a document and imbibes in it a flexibility to cover a wide range of tilts in the strokes that are attached to the different words.

References
  1. Konstantinos Zagoris, Kavallieratou Ergina, Nikos Papamarkos. 2010. A document image retrieval system. Engineering Applications of Artificial Intelligence. Vol. 23, No.6, 872-879 .
  2. Huaigu Cao, Anurag Bhardwaj, Venu Govindaraju. 2009. A probabilistic method for keyword retrieval in handwritten document images. Pattern Recognition. Vol. 42, No.12, 3374-3382.
  3. G. Louloudis, B. Gatos, I. Pratikakis, C. Halatsis. 2009. Text line and word segmentation of handwritten documents. Pattern Recognition. Vol. 42, No.12, 3169-3183.
  4. Kaustubh Bhattacharyya, Kandarpa Kumar Sarma. 2009. ANN-based innovative segmentation method for handwritten test in Assamese. Intl. Journal of Computer Science Issues. Vol. 5, 9-16.
  5. Million Meshesha, C. V. Jawahar. 2008. Matching word images for content based retrieval from printed document images. International Journal on Document Analysis and Recognition. Vol. 11, No.1, 29-38.
  6. Jose A. Rodriguez, Florent Perronnin, Gemma Sanchez, Josep Llados. 2008. Unsupervised writer style adaptation for hand written word spotting. 19th International Conference on Pattern Recognition. 1-4.
  7. Rafael C. Gonzalez, Richard E. Woods. 2008. Digital Image Processing. Prentice Hall.
  8. Richard O. Duda, Peter E. Hart, David G. Stork. 2007. Pattern Classification. Wiley-India.
  9. Gregory R. Ball, Sargur N. Srihari, Harish Srinivasan. 2006. Segmentation-based and Segmentation-free methods for spotting handwritten Arabic words. 10th International Workshop on Frontiers in Handwriting Recognition. 53-58.
  10. Sargur Srihari, Chen Huang, Harish Srinivasan. 2005. A search engine for handwritten documents. In Proc. of Document Recognition and Retrieval. 66-75.
  11. Toni M. Rath, R. Manmatha. 2003. Features for word spotting in historical manuscripts. ICDAR. 218-222.
  12. O Due Trier, Anil K. Jain, Torfin Taxt. 1996. Feature extraction methods for character recognition: A Survey. Pattern Recognition. Vol. 29, No.4, 641-662.
  13. Lawrence R. Rabiner. 1989. A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE. Vol. 77, No.2, 257-285.
Index Terms

Computer Science
Information Sciences

Keywords

Handwritten document retrieval Profile features Segmentation Hidden Markov models