Abstract
This paper investigates the problem of the effectiveness of input quality on the performance evaluation of Arabic OCR systems. The experimental results show that performance for Arabic OCR systems gives accepted error rate for low noisy images and gives high error rate for images with high noises and Arabic OCR accuracy can be increased by filtering the noise images. Robust word-based and character-based accuracy metrics are used to show the performance evaluation of different Arabic OCR engines using different samples such as newspapers, books, regular text are used.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Chen, Q.: Evaluation of OCR algorithms for images with different spatial resolutions and noises. Ph.D. thesis, University of Ottawa, Canada 2003. http://bohr.wlu.ca/hfan/cp467/08/nodes/master_thesis.pdf. Accessed Oct 2015
Soudi, A.: Arabic computational morphology knowledge-based and empirical methods. Knowledge-Based and Empirical Methods, pp. 3–14. Springer, Amsterdam (2007)
AbdelRaouf, A., Higgins, C.A., Pridmore, T., Khalil, M.: Building a multi-modal Arabic corpus (MMAC). IJDAR 13.4, 285–302 (2010), (Springer)
AbdelRaouf, A.: Offline printed Arabic character recognition. Ph.D. thesis, The University of Nottingham for the Degree of Doctor of Philosophy, May 2012
Jing, H., Lopresti, D., Shih, C.: Summarizing noisy documents. In: Proceedings of the Symposium on Document Image Understanding Technology, pp. 111–119. Apr 2003
Kanungo, T., Marton, G. A., Bulbul, O.: Performance evaluation of two Arabic OCR products. In: The 27th AIPR workshop: Advances in computer-assisted recognition (pp. 76–83). International Society for Optics and Photonics (1999).
Albidewi, A.: The use of object-oriented approach for Arabic documents recognition. Int. J. Comput. Sci. Netw. Secur. (IJCSNS) 8.4, 341 (2008)
Nartker, T.A., Rice, S., Nagy, G.: Performance metrics for document understanding systems. In: Proceedings of the Second International Conference on Document Analysis and Recognition, pp. 424–427 (1993)
Kjersten, B.: Arabic optical character recognition (OCR) evaluation in order to develop a post-OCR module. Computational and Information Sciences Directorate, ARL, Sept 2011
Kamboj, P., Rani, V.: A brief study of various noise model and filtering techniques. J. Glob. Res. Comput. Sci. 4.4, 166–171 (2013)
Saber, S., Ahmed, A., Hadhoud, M.: Robust metrics for evaluating Arabic OCR systems. In: IEEE IPAS’14, pp. 1–6. University of sfax, Tunisia, Nov 2014
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Saber, S., Ahmed, A., Elsisi, A., Hadhoud, M. (2016). Performance Evaluation of Arabic Optical Character Recognition Engines for Noisy Inputs. In: Gaber, T., Hassanien, A., El-Bendary, N., Dey, N. (eds) The 1st International Conference on Advanced Intelligent System and Informatics (AISI2015), November 28-30, 2015, Beni Suef, Egypt. Advances in Intelligent Systems and Computing, vol 407. Springer, Cham. https://doi.org/10.1007/978-3-319-26690-9_40
Download citation
DOI: https://doi.org/10.1007/978-3-319-26690-9_40
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26688-6
Online ISBN: 978-3-319-26690-9
eBook Packages: Computer ScienceComputer Science (R0)