CFP last date
20 December 2024
Reseach Article

Denoising of Document Images using Discrete Curvelet Transform for OCR Applications

by C. Patvardhan, A. K. Verma, C. V. Lakshmi
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 55 - Number 10
Year of Publication: 2012
Authors: C. Patvardhan, A. K. Verma, C. V. Lakshmi
10.5120/8790-2775

C. Patvardhan, A. K. Verma, C. V. Lakshmi . Denoising of Document Images using Discrete Curvelet Transform for OCR Applications. International Journal of Computer Applications. 55, 10 ( October 2012), 20-27. DOI=10.5120/8790-2775

@article{ 10.5120/8790-2775,
author = { C. Patvardhan, A. K. Verma, C. V. Lakshmi },
title = { Denoising of Document Images using Discrete Curvelet Transform for OCR Applications },
journal = { International Journal of Computer Applications },
issue_date = { October 2012 },
volume = { 55 },
number = { 10 },
month = { October },
year = { 2012 },
issn = { 0975-8887 },
pages = { 20-27 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume55/number10/8790-2775/ },
doi = { 10.5120/8790-2775 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:56:53.101644+05:30
%A C. Patvardhan
%A A. K. Verma
%A C. V. Lakshmi
%T Denoising of Document Images using Discrete Curvelet Transform for OCR Applications
%J International Journal of Computer Applications
%@ 0975-8887
%V 55
%N 10
%P 20-27
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In this paper, a denoising and binarization scheme of document images corrupted by white Gaussian noise and Impulse noise is presented using Curvelet Transform. The ability of sparse representation and edge preservation of Curvelet transform is utilized. Impulse noise gets added during document scanning or after binarization of scanned document images. White Gaussian noise corrupts the document images during transmission. The presence of either type of noise or a combination of them can severely degrade the performance of any OCR system. In the proposed denoising scheme, the curvelet transform is used with level dependent threshold calculated by modified sqtwolog method (universal threshold) at each scale with estimation of noise standard deviation. The noisy curvelet coefficients are thresholded by Hard Thresholding method. After curvelet based denoising, the image is binarized using global Otsu method and post processed to smoothen the text boundaries and remove isolated pixels for better OCR performance. The curvelet based scheme is compared with a wavelet transform based scheme and a modified wavelet based scheme with edge preservation. The results show that curvelet based scheme performs better in case of images containing Gaussian, Impulse and a combination of both the noises.

References
  1. Gonzalez Rafael C. , Richard E. Woods, "Digital Image Processing", 2e, PHI, ISBN 978-81-203-2758-0, 2002.
  2. Fu S. , Ruan Q. , Wang W. and Li Y. , "Feature Preserving Nonlinear Diffusion for Ultrasonic Image Denoising and Edge Enhancement", World Academy of Science, Engineering and Technology, Vol. 2, pp. 148-151. , 2005.
  3. Lei Zhang, Weisheng Dong, David Zhang, Guangming Shi, "Two-stage image denoising by principal component analysis with local pixel grouping", Elsevier Pattern Recognition, Vol. 43, Issue 4, pp. 1531-1549, 2010.
  4. Antonin Chambolle, "An Algorithm for Total Variation Minimization and Applications", Journal of Mathematical Imaging and Vision, Kluwer Academic Publishers, Vol. 20, pp. 89–97, 2004.
  5. Leonid I. Rudin, Stanley Osher and Emad Fatemi, "Nonlinear total variation based noise removal algorithms", Elsevier Science Publishers, Physica D: Nonlinear Phenomena, Vol. 60, pp. 259-268, 1992.
  6. Ricardo D. da Silva, R. Minetto, W. R. Schwartz, H. Pedrini, "Adaptive edge-preserving image denoising using wavelet Transforms", Springer, Pattern Analysis and Applications (PAA), DOI: 10. 1007/s10044-012-0266-x, pp. 1-14, 2012.
  7. M. Dai,C. Peng, A. K. Chan and D. Loguinov, "Bayesian Wavelet Shrinkage with Edge Detection for SAR Image Despeckling", IEEE transactions on Geoscience and Remote Sensing, Vol. 42, No. 8, 2004.
  8. D. Gnanadurai, V. Sadasivam, "An Efficient Adaptive Thresholding Technique for Wavelet Based Image Denoising", World Academy of Science, Engineering and Technology, Vol. 1(2), pp. 114-119, 2006.
  9. J. Starck, E. J. Candes and D. L. Donoho, "The Curvelet Transform for Image Denoising", IEEE transactions on image processing, Vol. 11, No. 6, June 2002.
  10. Al-Dahoud Ali, P. D. Swami and J. Singhai, "Modified Curvelet Thresholding Algorithm for Image Denoising", Journal of Computer Science, Science Publications, Vol. 6 (1), pp. 18-23, 2010.
  11. Thai V. Hoang, Elisa H. Barney Smith and Salvatore Tabbone, "Edge noise removal in bilevel graphical document images using sparse representation", Proceedings of 18th IEEE International Conference on Image Processing (ICIP), pp. 3549-3552, Brussels, Belgium, 2011.
  12. Jianwei Ma, Gerlind Plonka, "Combined Curvelet Shrinkage and Nonlinear Anisotropic Diffusion", IEEE transactions on image processing, Vol. 16, No. 9, September 2007.
  13. Linda Tessens, Aleksandra Piz?urica, Alin Alecu, Adrian Munteanu and Wilfried Philips, "Context adaptive image denoising through modeling of curvelet domain statistics", SPIE Journal of Electronic Imaging, Vol. 17(3), pp. 33017-33021. 2008.
  14. D. L. Donoho, "De-noising by soft thresholding", IEEE Transaction on Information Theory, Vol. 41, pp. 613-627, 1995.
  15. I. Pratikakis, B. Gatos, K. Ntirogiannis, "Handwritten Document ImageBinarizationCompetition (HDIBCO), 12thInternationalConferenceon frontiers in handwriting recognition, IEEE, 2010.
  16. ABBYY Fine Reader: http://finereader. abbyy. com/
Index Terms

Computer Science
Information Sciences

Keywords

Curvelets Wavelets Edge Preservation Gaussian Noise Impulse Noise Thresholding Denoising Document Images