CFP last date
20 January 2025
Reseach Article

Analysis of an Automatic Text Content Extraction Approach in Noisy Video Images

by C.p. Sumathi, N. Priya
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 69 - Number 4
Year of Publication: 2013
Authors: C.p. Sumathi, N. Priya
10.5120/11828-7529

C.p. Sumathi, N. Priya . Analysis of an Automatic Text Content Extraction Approach in Noisy Video Images. International Journal of Computer Applications. 69, 4 ( May 2013), 6-13. DOI=10.5120/11828-7529

@article{ 10.5120/11828-7529,
author = { C.p. Sumathi, N. Priya },
title = { Analysis of an Automatic Text Content Extraction Approach in Noisy Video Images },
journal = { International Journal of Computer Applications },
issue_date = { May 2013 },
volume = { 69 },
number = { 4 },
month = { May },
year = { 2013 },
issn = { 0975-8887 },
pages = { 6-13 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume69/number4/11828-7529/ },
doi = { 10.5120/11828-7529 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:29:18.442009+05:30
%A C.p. Sumathi
%A N. Priya
%T Analysis of an Automatic Text Content Extraction Approach in Noisy Video Images
%J International Journal of Computer Applications
%@ 0975-8887
%V 69
%N 4
%P 6-13
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Text either embedded or superimposed within video frames is very useful for describing the contents of the frames, as it enables both keyword and free-text based search, automatic video logging, and video cataloging. Low contrast, noise and poor quality are the main problems of text extraction in video images. This article explores a novel approach for text extraction from video frames, which can handle complex image backgrounds with different font sizes, font styles, and font appearances such as normal and noisy video. The pre processing is done to de-noise the images through wavelet based approach by removing noise in the frequency field and reducing by the soft-threshold method. Then, the enhanced image is obtained through the inverse wavelet transform. The Morphological operators are applied to sharpen the image for clear edges and to detect the connected components accurately. Lastly, features are extracted and fed into an artificial neural network to classify the text pixel from that of the background of the image. A quantitative measure of comparison and analysis is provided by the different parameters with different noises.

References
  1. Ohya. J, Shio. A, and Akamatsu. S," Recognizing Characters in Scene Images," IEEE Transactions on Pattern Analysis and Machine Intelligence, 16:214–224, 1994.
  2. Zhong. Y, Karu. K, and Jain. A. K, "Locating Text in Complex Color Images," Pattern Recognition,28(10):1523–1536,October 1995.
  3. Lopresti. D and Zhou. J, " Document Analysis and the World Wide Web," In International Workshop on Document Analysis Systems, Malvern, PA, USA, pages 651–669, 1996.
  4. Smith. M. A and Kanade. T, "Video skimming for quick browsing based on audio and image characterization", CMU-CS-95-186, Technical report, Carnegie Mellon University, 1995.
  5. Jung. K, "Neural network-based text location in color images", Pattern Recognition Letters ,22:1503-1515, 2001.
  6. JiangWu, Shao-Lin Qu, Qing Zhuo,Wen-YuanWang, "Automatic text detection in complex color images", Proc. of Intl. Conf. on Machine Learning and Cybernetics, 2002. Text Localization and Extraction from Complex Gray Images 785
  7. Yuan. Q and Tan. C. L, "Text Extraction from Gray Scale Document Images Using Edge Information", Proc. of Sixth Intl. Conf. on Document Analysis and Recognition, 2001
  8. Ariki. Y and Teranishi. T, " Indexing and Classification of TV News Articles based on Telop Recognition," International Conference on Document Analysis and Recognition, pages 422–427, 1997.
  9. Yeo. B. L," Visual Content Highlighting Via Automatic Extraction of Embedded Captions on MPEG Compressed Video", SPIE/IS&T Symposium on Electronic Imaging Science and Technology: Digital Video Compression: Algorithms and Technologies, volume 2668, 1996.
  10. Hauptmann. A and Smith. M, "Text, Speech, and Vision for Video Segmentation: The Informedia Project," AAAI Fall 1995 Symposium on Computational Models for Integrating Language and Vision, 1995.
  11. Kazubek. M. "Wavelet domain image de-noising by thresholding and Wiener filtering". Signal Processing Letters IEEE, Volume: 10, Issue: 11, Nov. 2003 265 Vol. 3.
  12. Donoho, D. L "Wavelet Shrinkage and W. V. D. : A 10-minute Tour" (David L. Donoho's website)
Index Terms

Computer Science
Information Sciences

Keywords

Morphological Operators Neural Network Text Extraction Wavelet