We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 December 2024
Reseach Article

Offline Handwritten Script Identification in Document Images

by B.V.Dhandra, Mallikarjun Hangarge
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 4 - Number 6
Year of Publication: 2010
Authors: B.V.Dhandra, Mallikarjun Hangarge
10.5120/834-1170

B.V.Dhandra, Mallikarjun Hangarge . Offline Handwritten Script Identification in Document Images. International Journal of Computer Applications. 4, 6 ( July 2010), 1-5. DOI=10.5120/834-1170

@article{ 10.5120/834-1170,
author = { B.V.Dhandra, Mallikarjun Hangarge },
title = { Offline Handwritten Script Identification in Document Images },
journal = { International Journal of Computer Applications },
issue_date = { July 2010 },
volume = { 4 },
number = { 6 },
month = { July },
year = { 2010 },
issn = { 0975-8887 },
pages = { 1-5 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume4/number6/834-1170/ },
doi = { 10.5120/834-1170 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T19:52:21.207030+05:30
%A B.V.Dhandra
%A Mallikarjun Hangarge
%T Offline Handwritten Script Identification in Document Images
%J International Journal of Computer Applications
%@ 0975-8887
%V 4
%N 6
%P 1-5
%D 2010
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Automatic handwritten script identification from document images facilitates many important applications such as sorting, transcription of multilingual documents and indexing of large collection of such images, or as a precursor to optical character recognition (OCR). In this paper, we investigate a texture as a tool for determining the script of handwritten document image, based on the observation that text has a distinct visual texture. Further, K nearest neighbour algorithm is used to classify 300 text blocks as well as 400 text lines into one of the three major Indian scripts: English, Devnagari and Urdu, based on 13 spatial spread features extracted using morphological filters. The proposed algorithm attains average classification accuracy as high as 99.2% for bi-script and 88.6% for tri-script separation at text line and text block level respectively with five fold cross validation test.

References
Index Terms

Computer Science
Information Sciences

Keywords

Script Identification offline handwritten documents Optical character reader cross validation