We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 November 2024
Reseach Article

Novel Approach for Baseline Detection and Text Line Segmentation

by Mahdi Keshavarz Bahaghighat, Javad Mohammadi
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 51 - Number 2
Year of Publication: 2012
Authors: Mahdi Keshavarz Bahaghighat, Javad Mohammadi
10.5120/8013-1039

Mahdi Keshavarz Bahaghighat, Javad Mohammadi . Novel Approach for Baseline Detection and Text Line Segmentation. International Journal of Computer Applications. 51, 2 ( August 2012), 9-16. DOI=10.5120/8013-1039

@article{ 10.5120/8013-1039,
author = { Mahdi Keshavarz Bahaghighat, Javad Mohammadi },
title = { Novel Approach for Baseline Detection and Text Line Segmentation },
journal = { International Journal of Computer Applications },
issue_date = { August 2012 },
volume = { 51 },
number = { 2 },
month = { August },
year = { 2012 },
issn = { 0975-8887 },
pages = { 9-16 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume51/number2/8013-1039/ },
doi = { 10.5120/8013-1039 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:49:21.863387+05:30
%A Mahdi Keshavarz Bahaghighat
%A Javad Mohammadi
%T Novel Approach for Baseline Detection and Text Line Segmentation
%J International Journal of Computer Applications
%@ 0975-8887
%V 51
%N 2
%P 9-16
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Baseline detection and line segmentation are essential preprocessing steps of any OCR system. In this paper we have proposed a robust and fast method for base lines detection based on projected pattern analysis of Radon Transform. The algorithm have been tested on more than 350 samples including both printed and handwriting of Persian/Arabic, English and also multilingual documents. Obtained results indicate that in spite of narrow interline spaces and noisy components our method is capable to extract baseline in documents precisely. In addition, in the case of multi-frequencies pattern, it has been shown that proposed method can reach its performance to accurate detection of base lines.

References
  1. Z.A. Khan, W. Sohn,.” Real Time Human Activity Recognition System basedon Radon Transform”, IJCA Special Issue on “Artificial Intelligence Techniques - Novel Approaches & Practical Applications” AIT, 2011.
  2. H.K.Chethan, G.Hemantha Kumar,2010.Graphics separation and skew correction for mobile captured documents and comparative analysis with existing methods, International Journal of Computer Applications(IJCA) (0975 – 8887), Volume 7– No.3.
  3. N. Priyanka, S.Pal, R. Mandal,”Line and Word Segmentation Approach for PrintedDocuments” IJCA Special Issue on “Recent Trends in Image Processing and Pattern Recognition” , RTIPPR, 2010.
  4. S.Akram, M.U.Din Dar, A.Quyoum, “Document Image Processing - A Review”, International Journal of Computer Applications(IJCA) (0975 – 8887)Volume 10– No.5, November 2010.
  5. A.Bouchemha, A.Nait-Ali,N. Doghmane,”A Robust Technique to Characterize the Palmprint usingRadon Transform and Delaunay Triangulation” , International Journal of Computer Applications (0975 – 8887)Volume 10- No.10, November 2010.
  6. S.Prasad,V.K.Singh,A. Sapre, “Handwriting Analysis based on Segmentation Method for Prediction of Human Personality using Support Vector Machine”,International Journal of Computer Applications (0975 – 8887)Volume 8– No.12, October 2010
  7. M.Hangarge,B.V.Dhandra,”Offline Handwritten Script Identification in Document Images”, International Journal of Computer Applications (0975 – 8887) Volume 4 – No.6, July 2010.
  8. P. Nagabhushan, A.Alaei, “Tracing and Straightening the Baseline inHandwritten Persian/Arabic Text-line: A NewApproach Based on Painting-technique”, (IJCSE) International Journal on Computer Science and EngineeringVol. 02, No. 04, 2010, 907-916.
  9. A.Alaei, P. Nagabhushan, U.Pal, “A New Text-line Alignment Approach Based on Piece-wise Painting Algorithm for Handwritten Documents”, 2011 International Conference on Document Analysis and Recognition, IEEE DOI 10.1109/ICDAR.2011.73.
  10. VahidKiani, Reza Pourreza& Hamid Reza Pourreza, Offline Signature Verification Using Local Radon Transform and Support Vector Machines,International Journal of Image Processing (IJIP) Volume(3), Issue(5).
  11. J.Mohammadi, M.KeshavarzBahaghighat ,R.Akbari , ,Vehicle Speed Estimation Based On The Image Motion Blur Using RADON Transform , 2010 2nd International Conference on Signal Processing Systems (ICSPS).
  12. F. Hjouj, D.W. Kammler. “Identification of Reflected, Scaled, Translated, and Rotated Objects From Their Radon Projections”. IEEE Transactions on Image Processing, 17(3):301-310,2008.
  13. M. R. Hejazi, G. Shevlyakov, Y-S Ho. “Modified Discrete Radon Transforms and Their Application to Rotation-Invariant Image Analysis”. IEEE 8th Workshop on Multimedia Signal Processing, 2006.
  14. M.I.Razzak, M.Sher ,S. A.Hussain,” Locally baseline detection for online Arabic scriptbased languages character recognition”, International Journal of the Physical Sciences Vol. 5(7), pp. 955-959, July 2010.
  15. A. Zahour, B. Taconet, P. Mercy, and S. Ramdane, “ArabicHand-written Text-line Extraction”, in Proceedings of theSixth International. Conference on Document Analysis andRecognition, ICDAR 2001, Seattle, USA, September 10-132001, pp. 281–285.
  16. M. Arivazhagan, H. Srinivasan, S. N. Srihari, "A StatisticalApproach to Handwritten Line Segmentation", inProceedings of SPIE Document Recognition and RetrievalXIV , San Jose, CA, February 2007.
  17. G. Tímár, K. Karacs, Cs. Rekeczky, "Analogic Preprocessing and Segmentation Algorithms ForOfflineHandwriting Recognition", Proceedings of IEEE CNNA’02,World Scientific 2002, pp.407-414.
  18. Y. Li, Y. Zheng, D. Doermann, and S. Jaeger, “A newalgorithm for detecting text line in handwritten documents,”in International Workshop on Frontiers in HandwritingRecognition, 2006, pp. 35–40.
  19. L. Likforman-Sulem, C. Faure, "Extracting text lines inhandwritten documents by perceptual grouping", Advancesin handwriting and drawing : a multidisciplinary approach,C. Faure, P. Keuss, G. Lorette and A. Winter Eds, Europia,Paris, 1994, pp. 117-135.
  20. I.S.I. Abuhaiba, S. Datta, M.J.J. Holt, ''Line Extraction andStroke Ordering of Text Pages'', Proceedings of the ThirdInternational Conference on Document Analysis andRecognition, Montreal, Canada, 1995, pp. 390-393.
  21. L. Likforman-Sulem, A. Hanimyan, C. Faure, “A Houghbased algorithm for extracting text lines in handwrittendocuments", Third International Conference on DocumentAnalysis and Recognition, Vol. 2, August 1995, pp. 774-777.
  22. M. Ziaratban and K. Faez, “A Novel Two-Stage Algorithm forBaseline Estimation and Correction in Farsi and Arabic HandwrittenText line,” Proc. of International Conference on Pattern Recognition(ICPR’08) , 2008, pp. 1-5.
  23. Waibel, A., T. Hanazawa, G. Hilton, K. Shikano, and K. J. Lang, "Phoneme recognition using time-delay neural networks," IEEE Transactions on Acoustics, Speech, and Signal Processing, Vol. 37, 1989, pp. 328–339.
  24. J.Luo, K.Ying, P.He, J.Bai ,”Properties of Savitzky–Golay digital differentiators”, Elsevier Inc. Digital Signal Processing 15 (2005) 122–136.
  25. Sharat S. Chikkerur, Alexander N. Cartwright and VenuGovindaraju, "Fingerprint Image Enhancement Using STFT Analysis", PHD thesis , Center for Unified Biometrics and Sensors University at Buffalo, NY, USA (2003).
Index Terms

Computer Science
Information Sciences

Keywords

Optical Character Recognition Document Analysis Multilingual Documents Radon Transform Neural Networks