CFP last date
20 December 2024
Reseach Article

Segmentation of Touching, Overlapping, Skewed and Short Handwritten Text Lines

by Rohini.s, Uma Devi.r.s, Mohanavel.s
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 49 - Number 19
Year of Publication: 2012
Authors: Rohini.s, Uma Devi.r.s, Mohanavel.s
10.5120/7877-1163

Rohini.s, Uma Devi.r.s, Mohanavel.s . Segmentation of Touching, Overlapping, Skewed and Short Handwritten Text Lines. International Journal of Computer Applications. 49, 19 ( July 2012), 24-27. DOI=10.5120/7877-1163

@article{ 10.5120/7877-1163,
author = { Rohini.s, Uma Devi.r.s, Mohanavel.s },
title = { Segmentation of Touching, Overlapping, Skewed and Short Handwritten Text Lines },
journal = { International Journal of Computer Applications },
issue_date = { July 2012 },
volume = { 49 },
number = { 19 },
month = { July },
year = { 2012 },
issn = { 0975-8887 },
pages = { 24-27 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume49/number19/7877-1163/ },
doi = { 10.5120/7877-1163 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:46:38.032812+05:30
%A Rohini.s
%A Uma Devi.r.s
%A Mohanavel.s
%T Segmentation of Touching, Overlapping, Skewed and Short Handwritten Text Lines
%J International Journal of Computer Applications
%@ 0975-8887
%V 49
%N 19
%P 24-27
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Text line segmentation is an inherent part of document recognition system and important preprocessing step for word and character segmentation. Presence of touching or overlapping text lines, short-lines, curvilinear or skewed lines and small or variant gaps between the text lines make the segmentation challenging. These variations cause errors in recognition phase. This paper describes the top-down approach of handwritten text line segmentation. The proposed method begins with core detection. To segment the overlapping components, run-length is used for obtaining the structural knowledge which classifies the components into upper and lower text lines. To segment the short lines and skewed lines, distance metrics and connected component are used recursively. The system was evaluated using 200 images from the IAM database and 100 documents collected from different writers. From the experiments conducted, it was observed that the system has 91. 92% accuracy and imbibes in its reliability.

References
  1. Abo Samra. K, et al. , (2011), A Comprehensive Algorithm for Segmenting Handwritten Arabic Scripts in Off-Line Systems, Document Recognition and Retrieval XVIII, Electronic Imaging, United States.
  2. Brodic. D, (2011), Methodology for the Evaluation of the Algorithms for Text Line Segmentation Based on Extended Binary Classification, Measurement Science Review, Volume 11, No. 3.
  3. Jayant Kumar, et al. , Segmentation of Handwritten Text lines in Presence of Touching Components.
  4. Laurence Likforman-Sulem, et al. , Text Line Segmentation of Historical Documents: A Survey.
  5. Likforman Sulem L. and Faure C. , (1994), Extracting Lines on Handwritten Documents by Perceptual Grouping, Advances in Handwriting and Drawing: A Multidisciplinary Approach, pp. 21-38, Europia, Paris.
  6. Louloudisa, et. al. , (2009), Text line and Word Segmentation of Handwritten Documents, Science Direct Pattern Recognition Journal, Vol. 42, pp 3169-3183
  7. Manmatha R. and Srimal N, (1999), Scale Space Technique for Word Segmentation in Handwritten Manuscripts, Proceedings of 2nd International Conference on Scale Space Theories in Computer Vision, pp. 22-33.
  8. Marti U. and Bunke H. , (2001), The Influence of Vocabulary Size and Language Models in Unconstrained Handwritten Text Recognition, Proceedings of ICDAR'01, Seattle, pp. 260-265
  9. Nicolaou. A and Gatos. B, (2009), Handwritten Text Line Segmentation by Shredding Text into its Lines, 10th International Conference on Document Analysis and Recognition.
  10. Oztop E. , et al. , (1999), Repulsive Attractive Network for Baseline Extraction on Document Images, Signal Processing, Vol. 75, pp. 1-10.
  11. Rodolfo P. , et al. , Text Line Segmentation Based on Morphology and Histogram Projection.
  12. Shi Z. and Govindaraju V. , (2004), Line Separation for Complex Document Images Using Fuzzy Run length, Proceedings of the International Workshop on Document Image Analysis for Libraries, Palo, Alto, CA.
  13. Syed Saqib Bukhari, et al. , Script-Independent Handwritten Text line Segmentation Using Active Contours.
  14. Tseng Y. H. and Lee H. J. , (1999), Recognition-based Handwritten Chinese Character Segmentation Using a Probabilistic Viterbi Algorithm, Pattern Recognition Letters, Vol. 20, No. 8, pp. 791-806.
  15. Vassilis Papavassilioua, et al. , (2010), Handwritten Document Image Segmentation into Text lines and Words, Science Direct Pattern Recognition Journal, Vol. 43, pp 369-377.
  16. Wong K. , R. Casey and F. Wahl, (1982), Document Analysis Systems, IBM Journal of research and development, Vol. 26, No. 6.
  17. Yangdong Gao, Xiaoqing Ding and Changsong Liu, (2011), A Multi-scale Text Line Segmentation Method in Freestyle Handwritten Documents, International Conference on Document Analysis and Recognition.
Index Terms

Computer Science
Information Sciences

Keywords

Line Segmentation Connected Component Distance Metrics Run Length