Guide to OCR for Indic Scripts: Document Recognition and Retrieval (Google eBook)

Front Cover
Springer Science & Business Media, Sep 25, 2009 - Computers - 346 pages
0 Reviews
This unique guide/reference is the very first comprehensive book on the subject of OCR (Optical Character Recognition) for Indic scripts. Features: contains contributions from the leading researchers in the field; discusses data set creation for OCR development; describes OCR systems that cover 8 different scripts Bangla, Devanagari, Gurmukhi, Gujarati, Kannada, Malayalam, Tamil, and Urdu (Perso-Arabic); explores the challenges of Indic script handwriting recognition in the online domain; examines the development of handwriting-based text input systems; describes ongoing work to increase access to Indian cultural heritage materials; provides a section on the enhancement of text and images obtained from historical Indic palm leaf manuscripts; investigates different techniques for word spotting in Indic scripts; reviews mono-lingual and cross-lingual information retrieval in Indic languages. This is an excellent reference for researchers and graduate students studying OCR technology and methodologies.
  

What people are saying - Write a review

We haven't found any reviews in the usual places.

Contents

Building Data Sets for Indian Language OCR Research
3
Bangla and Devanagari
27
A Complete MachinePrinted Gurmukhi OCR System
43
Progress in Gujarati Document Processing and Character Recognition
73
Design of a Bilingual KannadaEnglish OCR
97
Recognition of Malayalam Documents
125
A Complete OCR System for Tamil Magazine Documents
147
Experiments on Urdu Text Recognition
163
Online Handwriting Recognition for Indic Scripts
209
Part II Retrieval of Indic Documents
235
Enhancing Access to Primary Cultural Heritage Materials of India
237
Digital Image Enhancement of Indic Historical Manuscripts
249
GFGBased Compression and Retrieval of Document Images in Indian Scripts
269
Word Spotting for Indic Documents to Facilitate Retrieval
285
Indian Language Information Retrieval
301
Colour Plates
315

The BBN Byblos Hindi OCR System
173
Generalization of Hindi OCR Using Adaptive Segmentation and Font Files
181

Common terms and phrases

About the author (2009)

Dr Venu Govindaraju is a UB Distinguished Professor of Computer Science and Engineering at the University at Buffalo (SUNY Buffalo) and the founder of the Center for Unified Biometrics and Sensors (CUBS). He has coauthored more than 300 reviewed technical papers, four U.S. patents and two books.

Bibliographic information