CFP last date
20 January 2025
Reseach Article

Automatic Processing of Structured Handwritten Documents: An Application for Indian Railway Reservation System

by Sandip Rakshit, Soumya Sona Das, Kalyan S Sengupta, Subhadip Basu
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 6 - Number 11
Year of Publication: 2010
Authors: Sandip Rakshit, Soumya Sona Das, Kalyan S Sengupta, Subhadip Basu
10.5120/1115-1460

Sandip Rakshit, Soumya Sona Das, Kalyan S Sengupta, Subhadip Basu . Automatic Processing of Structured Handwritten Documents: An Application for Indian Railway Reservation System. International Journal of Computer Applications. 6, 11 ( September 2010), 26-30. DOI=10.5120/1115-1460

@article{ 10.5120/1115-1460,
author = { Sandip Rakshit, Soumya Sona Das, Kalyan S Sengupta, Subhadip Basu },
title = { Automatic Processing of Structured Handwritten Documents: An Application for Indian Railway Reservation System },
journal = { International Journal of Computer Applications },
issue_date = { September 2010 },
volume = { 6 },
number = { 11 },
month = { September },
year = { 2010 },
issn = { 0975-8887 },
pages = { 26-30 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume6/number11/1115-1460/ },
doi = { 10.5120/1115-1460 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T19:55:08.658541+05:30
%A Sandip Rakshit
%A Soumya Sona Das
%A Kalyan S Sengupta
%A Subhadip Basu
%T Automatic Processing of Structured Handwritten Documents: An Application for Indian Railway Reservation System
%J International Journal of Computer Applications
%@ 0975-8887
%V 6
%N 11
%P 26-30
%D 2010
%I Foundation of Computer Science (FCS), NY, USA
Abstract

An effective document processing system must be able to recognize structured and semi structured forms that is written by different persons’ handwriting. In this work we have developed a method and system that can process structured form document layout and recognize its contents. Our approach has been applied here in the context of Indian railway reservation/cancellation requisition system with encouraging results. In reality, handwritten data usually touch or cross the preprinted form frames and texts, creating complex problems for the recognition routines. In this paper, we address these issues and attempted to solve the problem for Indian Railway Reservation system using our custom built form processing software and Tesseract open source character recognition engine.

References
  1. http://www.indianrail.gov.in/..
  2. Yaakov Navon, Ella Barkan, Boaz Ophir, "A Generic Form Processing Approach for Large Variant Templates," icdar, pp.311-315, 2009 10th International Conference on Document Analysis and Recognition, 2009.
  3. B. Yu and A. K. Jain, "A Generic System for Form Dropout", IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 18, No. 11, Nov. 1996, pp. 1127-1134.
  4. Y. Belaid, et al., “Item Searching in Forms: Application to French Tax Form”, Int. Conf. on Document Analysis and Recognition, Aug. 1995, pp. 744-747.
  5. C.D. Yan, Y.Y. Tang, and C.Y. Suen, “Form Understanding System Based on Form Description Language”, Int. Conf. on Document Analysis andRecognition, Oct. 1991, pp. 283-293.
  6. K. Fan and M. Chang, “Form document identification using line structure based features”, Proc. Int. Conf. on Pattern Recognition, Vol. 2, Aug. 1998, pp. 1098 – 1100
  7. H. Fujisawa, Y. Nakano, and K. Kurino, “Segmentation Methods for Character Recognition: From Segmentation to Document Structure Analysis”, Proc. of the IEEE, Vol. 80, No. 7, 1992, pp. 1079-1092.
  8. Hiroshi Sako et al., “Form Reading based on Form-type Identification and Form-data Recognition”, Int. Conf. on Doc. Ana. and Recognition, Aug. 2003, Vol. 2, pp. 926-930.
  9. X. Ye, M. Cheriet and C.Y. Suen, “A generic method of cleaning and enhancing handwritten data from business forms,” International Journal on Document Analysis and Recognition, vol. 4, pp. 84-96, 2001.
  10. http://www.smartform.com/.
  11. http://code.google.com/p/tesseract-ocr
  12. R. Smith. “An overview of the Tesseract OCR engine”. In ICDAR’2007, International Conference on Document Analysis and Recognition, Curitiba, Brazil, Sept. 2007
  13. S.Rakshit, A. Kundu, M. Maity,S. Mandal, S. Sarkar, S. Basu, “Recognition of handwritten Roman Numerals using Tesseract open source OCR engine” Second International Conference on Advances in Computer Vision and Information Technology (ACVIT 2009) pp572-577.
  14. S. Rakshit,S. Basu, Hisashi Ikeda “ Recognition of Handwritten Textual Annotations Using Tesseract Open Source OCR Engine FOR information Just in Time (iJIT) In Proc.of International Conference on Information Technology and business Intelligence(ITBI-09).
  15. S.Rakshit, S. Basu, “Development of a Multiuser Handwritten Recognition System Using Tesseract Open source OCR” in proc. of C3IT-2009 An International conference, pp.240-247 .Proceedings published by Macmillan advanced Research Series, ISBN NO: 023-063-759-0
  16. S. Rakshit, S. Basu, “Recognition of Handwritten Roman Script Using Tesseract Open source OCR Engine,” in proc. of National Conference on NAQC-2008, pp. 141-145, Kolkata.
  17. S. Rakshit, D. Ghosal, T. Das, S. Dutta, S. Basu, “Development Of A Multi-User Recognition Engine For Handwritten Bangla Basic Characters And Digits” In Proc.(CD)of International Conference on Information Technology and business Intelligence(ITBI-09).
  18. S. Basu, K. Konishi, N. Furukawa, H, Ikeda, “A novel scheme for retrieval of handwritten textual annotations for information Just In Time (iJIT)”, proceedings (CD) of IEEE Region 10 Conference (TENCON) -2008
Index Terms

Computer Science
Information Sciences

Keywords

Optical Character Recognition Handwritten Document Analysis Form Processing Tesseract OCR