You are here
A multi-stage document processing approach To Arabic text recognition.
We approach the analysis of electronic documents as a multi-stage process, which we implement via a multi-filter document processing framework that provides (a) flexibility for research prototyping, (b) efficiency for development, and (c) reliability for deployment. In the context of this framework, we present our multi-stage solutions to multi-engine Arabic OCR (MEMOE) and Arabic handwriting recognition (AHWR). We also describe our adaptive pre-OCR document image cleanup system called ImageRefiner. Experimental results are reported for all mentioned systems.