CFP last date
20 December 2024
Reseach Article

Handwritten Text Image Compression for Indic Script Document

by Smita V. Khangar, Latesh G. Malik
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 47 - Number 5
Year of Publication: 2012
Authors: Smita V. Khangar, Latesh G. Malik
10.5120/7183-9888

Smita V. Khangar, Latesh G. Malik . Handwritten Text Image Compression for Indic Script Document. International Journal of Computer Applications. 47, 5 ( June 2012), 11-16. DOI=10.5120/7183-9888

@article{ 10.5120/7183-9888,
author = { Smita V. Khangar, Latesh G. Malik },
title = { Handwritten Text Image Compression for Indic Script Document },
journal = { International Journal of Computer Applications },
issue_date = { June 2012 },
volume = { 47 },
number = { 5 },
month = { June },
year = { 2012 },
issn = { 0975-8887 },
pages = { 11-16 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume47/number5/7183-9888/ },
doi = { 10.5120/7183-9888 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:41:05.399573+05:30
%A Smita V. Khangar
%A Latesh G. Malik
%T Handwritten Text Image Compression for Indic Script Document
%J International Journal of Computer Applications
%@ 0975-8887
%V 47
%N 5
%P 11-16
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In this paper, compression scheme is presented for Indian Language handwritten text document images. Document image compression is an active area of research. Current OCR technology is not effective for handling the handwritten text images. The proposed compression scheme deals with the handwritten gray level document in Devnagri script. The method is based on the separation of foreground and background of an image and connected component labeling. Experiments are done with handwritten images in Devnagri (Hindi and Marathi). Compression schemes are available for the printed text in Indian language. But there is little work reported towards the compression standards for handwritten text image. The results of the modules are showing good compression ratio. Hence compression of handwritten text images in Indian language is important.

References
  1. I. Witeten, T. Bell, H. Emberson, A. Moffat, J. 1994. Textual Image Compression: Two Stage Lossy/ Lossless Encoding of Textual Images. In Proceedings of the IEEE. transaction.
  2. Y. Ye and P. Cosman, J. 2001. "Dictionary design for text image compression with JBIG2". IEEE Transcation on Image Processing.
  3. P. G. Howard, "Text image compression using soft pattern matching" , The Computer Journal,1997.
  4. X. Danhua, B. Xudong, 2009. High efficient compression strategy for scanned receipts and handwritten documents. IEEE International Conference on Information and Engineering.
  5. U. Garain, T. Paquet, L. Heutte, "On foreground-background separation in low quality document images", International Journal of Document Analysis, 2006. 5. CONCLUSION In this paper compression strategy of handwritten text for Indian language gray level document is presented. To the best of our knowledge, this is the first effort towards the compression of handwritten text for Indian language documents. As mentioned earlier, most of the work is done for foreign language handwritten document.
  6. B. Gatos, I. Pratikakis, 2004. An Adaptive technique for low quality historical documents. 6th International Workshop on Document Analysis Systems, vol. 3163,pp. 102-113,2004.
  7. J. Kittler, J. Illingworth, 1985. Threshold Selection based on Simple Image Stastics. Computer Vision Graphics and Image Processing.
  8. L. Bottou, P. Howard, "High quality document image compression with DjVu", International Journal of Electronic Imaging,1998.
  9. U. Garain, S. Debnath, A. Mandal, B. Chaudhari 2003. Compression of scan digitized printed Text: A soft pattern matching technique. ACM Symposium on Document Engineering.
  10. Java Sun Documentation. [Online]. Available: https://docs. oracle. com/javase/1. 3/docs/api.
  11. Michel B. Dillencourt, Hannan Samet, Markku Tamminen. A General approach to connected component labeling for arbitrary image representations. J. ACM 1992.
  12. Kesheng Wu, Ekow Otoo, Kenji Suzuki J. " Optimizing two passs connected component labeling algorithms", Journal of Pattern Analysis and Application, 2009.
  13. S. Naoi, 1995. High speed labeling method using adaptive variable window size for character shape feature. IEEE Asian Conference on Computer Vision.
  14. Chirstophe Fiorio and Jens Gusted. 1996 Two linear time union-find strategies for image processing. Theoretical computer sci.
  15. Benarard A. Galler and Michel Fisher. 1964 An improved equivalence algorithm. Communication on ACM.
Index Terms

Computer Science
Information Sciences

Keywords

Handwritten Text Connected Component Labeling Compression Indian Language Devnagri Script Gray Level Document