We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 November 2024
Reseach Article

Article:Myanmar Printed Portable Document Format Recognition and Transformation with Formatting

by Dr. Yadana Thein, Cherry Maung
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 9 - Number 6
Year of Publication: 2010
Authors: Dr. Yadana Thein, Cherry Maung
10.5120/1389-1872

Dr. Yadana Thein, Cherry Maung . Article:Myanmar Printed Portable Document Format Recognition and Transformation with Formatting. International Journal of Computer Applications. 9, 6 ( November 2010), 23-29. DOI=10.5120/1389-1872

@article{ 10.5120/1389-1872,
author = { Dr. Yadana Thein, Cherry Maung },
title = { Article:Myanmar Printed Portable Document Format Recognition and Transformation with Formatting },
journal = { International Journal of Computer Applications },
issue_date = { November 2010 },
volume = { 9 },
number = { 6 },
month = { November },
year = { 2010 },
issn = { 0975-8887 },
pages = { 23-29 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume9/number6/1389-1872/ },
doi = { 10.5120/1389-1872 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T19:57:56.123982+05:30
%A Dr. Yadana Thein
%A Cherry Maung
%T Article:Myanmar Printed Portable Document Format Recognition and Transformation with Formatting
%J International Journal of Computer Applications
%@ 0975-8887
%V 9
%N 6
%P 23-29
%D 2010
%I Foundation of Computer Science (FCS), NY, USA
Abstract

This paper proposed Myanmar Printed Character Recognition with their related format. The system consists of two parts; recognition and formatting. It recognizes for Myanmar Portable Document Format (.pdf) such as font size, font style, alignment and table, and it converts the existing document to Machine Editable Word Document (.doc). Table classification can also be performed for table recognition and formatting. The extraction of text format, paragraph format and table format can be done efficiently. The system is based on MICR (Myanmar Intelligent Character Recognition) which is one kind of ICR (Intelligent Character Recognition). MICR used statistical and semantic information which includes width and height ratio, black stroke counts, number of loops, open directions and histogram value, etc. MICR has become successful in character recognition area recent years. MICR can produce character recognition with high accuracy rate and faster speed. The final decision is made by the voting system. The system use image processing and Matlab programming.

References
  1. Dipti Deodhare, NNR Ranga Suri, R.Amit, “Preprocessing and Image Enhancement Algorithms for a Form-based Intelligent Character Recognition System”, International Journal of Computer Science & Applications Vol. 2, No. 2, pp. 131-144, © 2005 Techno mathematics Research Foundation.
  2. Tay Zar Ko Ko and Dr.Yadana Thein, “Converting Myanmar Portable Document Format (.pdf) to Machine Editable Text with format”,
  3. Ei Ei Phyu, Zar Chi Aye, Ei Phyu Khaing, Yadana Thein and Myint Myint Sein, “Recognition of Myanmar Handwritten Compound Words based on MICR”, the 29th Asian Conference on Remote Sensing (ACRS), Colombo, Sri Lanka, 2008
  4. Zar Chi Aye, Ei Ei Phyu, Yadana Thein and Myint Myint Sein, “INTELLIGENT CHARACTER RECOGNITION (MICR) AND MYANMAR VOICE MIXER (MVM) SYSTEM”, the 29th Asian Conference on Remote Sensing (ACRS), Colombo, Sri Lanka, 2008.
  5. Swe, T. and Tin, P., 2005. Recognition and Translation of the Myanmar Printed Text Based on Hopfield Neural Network. In Proc. of 6th Asia-Pacific Symposium on Information and Telecommunication Technologies (APSITT 2005), pp. 99-104, Yangon, Myanmar.
  6. Chavdhuri, B. B., Pal, U. And Mitra, M., “Automatic Recognition of Printed Oriya Script”, Sadhana, 2002, Vol. 27, Part I
  7. R. K, Rajapakse, A. R. Weerasinghe and E. K.Seneviratne, “A Neural Network Based Character Recognition System for Sinhala Script,” South East Asian Regonial Computer Confederation, Conference and Cyberexhibition (SEARCC’96), Bangkok, Thailand,July 4-7th 1996.
  8. LI Guo-hong, SHI Peng-fei.2003. An approach to offline handwritten Chinese character recognition based on segment evaluation of adaptive duration, ISSN 1009-3095
Index Terms

Computer Science
Information Sciences

Keywords

Hough Transformation Statistical and Semantic table and paragraph formatting pali character recognition