Gabor filters for Document analysis in Indian Bilingual Documents

Reasonable success has been achieved at developing monolingual OCR systems in Indian scripts. Scientists, optimistically, have started t o look beyond. Development of bilingual OCR systems and OCR systems with capability t o identify the text areas are some of the pointers to future activities in In...

Full description

Bibliographic Details
Main Authors: Pati, Peeta Basa, Raju, Sabari S, Pati, Nishikanta, Ramakrishnan, AG
Format: Conference Object
Language:unknown
Published: IEEE 2004
Subjects:
Online Access:http://eprints.iisc.ernet.in/386/
http://eprints.iisc.ernet.in/386/1/gabor.pdf
Description
Summary:Reasonable success has been achieved at developing monolingual OCR systems in Indian scripts. Scientists, optimistically, have started t o look beyond. Development of bilingual OCR systems and OCR systems with capability t o identify the text areas are some of the pointers to future activities in Indian scenario. The separation of text and non-text regions before considering the document image for OCR is an important task. In this paper, we present a biologically inspired, multi-channel filtering scheme for page layout analysis. The same scheme has been used for script recognition as well. Parameter tuning is mostly done heuristically. It has also been seen t o be computationally viable for commercial OCR system development.