Offline Handwritten Script Identification in Document Images

B.V.Dhandra; Mallikarjun Hangarge

Call for Paper

September Edition

IJCA solicits high quality original research papers for the upcoming September edition of the journal. The last date of research paper submission is 20 August 2026

Submit your paper

Know more

The week's pick

Structured and Compact: A Novel Encoding and Enhancement Paradigm for ML-based SAT Solving

Ziqi Zhang Lan Zhang

Random Articles

Identifying Overloaded Servers and Managing Dynamic Placement of Virtual machines in Cloud

April

2016

A Survey on various Machine Learning Approaches for ECG Analysis

Apr

2017

Sentiment Analysis Approach based N-gram and KNN Classifier

Jul

2018

A Novel Technique for Data Extraction from Hidden Web Databases

February

2011

Reseach Article

Offline Handwritten Script Identification in Document Images

by B.V.Dhandra, Mallikarjun Hangarge

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 4 - Number 6

Year of Publication: 2010

Authors: B.V.Dhandra, Mallikarjun Hangarge

10.5120/834-1170

B.V.Dhandra, Mallikarjun Hangarge . Offline Handwritten Script Identification in Document Images. International Journal of Computer Applications. 4, 6 ( July 2010), 1-5. DOI=10.5120/834-1170

@article{ 10.5120/834-1170,

author = { B.V.Dhandra, Mallikarjun Hangarge },

title = { Offline Handwritten Script Identification in Document Images },

journal = { International Journal of Computer Applications },

issue_date = { July 2010 },

volume = { 4 },

number = { 6 },

month = { July },

year = { 2010 },

issn = { 0975-8887 },

pages = { 1-5 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume4/number6/834-1170/ },

doi = { 10.5120/834-1170 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T19:52:21.207030+05:30

%A B.V.Dhandra

%A Mallikarjun Hangarge

%T Offline Handwritten Script Identification in Document Images

%J International Journal of Computer Applications

%@ 0975-8887

%V 4

%N 6

%P 1-5

%D 2010

%I Foundation of Computer Science (FCS), NY, USA

Abstract

Automatic handwritten script identification from document images facilitates many important applications such as sorting, transcription of multilingual documents and indexing of large collection of such images, or as a precursor to optical character recognition (OCR). In this paper, we investigate a texture as a tool for determining the script of handwritten document image, based on the observation that text has a distinct visual texture. Further, K nearest neighbour algorithm is used to classify 300 text blocks as well as 400 text lines into one of the three major Indian scripts: English, Devnagari and Urdu, based on 13 spatial spread features extracted using morphological filters. The proposed algorithm attains average classification accuracy as high as 99.2% for bi-script and 88.6% for tri-script separation at text line and text block level respectively with five fold cross validation test.

References

Index Terms

Computer Science

Information Sciences

Keywords

Script Identification offline handwritten documents Optical character reader cross validation