4. Electronics - IJECEIERD - OCR - Patange V

Page 1

International Journal of Electronics, Communication & Instrumentation Engineering Research and Development (IJECIERD) ISSN 2249-684X Vol.2, Issue 3 Sep 2012 38-44 © TJPRC Pvt. Ltd.,

O.C.R. – A METHOD OF COMPUTES & WORKS ON THE VISION OF SCANNED IMAGE 1 1

PATANGE V.V. & 2DESHMUKH B.T.

P.G. student & Lecturer, Electronics & Telecommunication Engineering, S.T.B. College of Engineering, Tuljapur. Maharashtra, India 2

Professor, Electrical Engineering Department, J.N.E.C. Aurangabad, Maharashtra, India

ABSTRACT A simple optical character recognition system involves four step procedures of pre-processing, feature extraction, classification and post-processing. In pre-processing stage, binarization, boundary detection and segmentation of image is performed. Thinning and resizing of the segmented image is done for template formation. Image templates acts as feature vectors which are stored as reference database. In classification stage, the test features of vector are compared with all the reference feature vectors and feature vector with maximum match is declared as the best matched template. In post-processing stage, the character associated with the best match template is put in suitable word processor file.

KEYWORDS: OCR, Image Processing, Image, Scanning. INTRODUCTION Optical character recognition is one of the most successful applications in automatic pattern recognition. Optical character recognition has been very active field for research and development. Since mid 1950’s techniques developed were able to recognize high quality printed text documents or neatly written hand written text. The current research in optical character recognition is addressing documents that are not well handled by the available systems. Also efforts are being made to achieve lower substitution error rates and rejection rates even on good quality machine printed text. Selection of a feature extraction method is probably the single most important factor in achieving high recognition performance. Our interest in character recognition is to recognize machine printed digits or characters. From available feature extraction methods reported in literature, a newcomer to the field is faced by the following question. Which feature extraction method is best for the given application? This question leads to characterize the given feature extraction methods, so that the most promising method will be sorted out. One could argue that there is only a limited number of independent numbers of features that can be extracted from a character image, so that which set of features is used is not so important. However, the extracted features must be invariant to the expected distortions and variations that the characters must


Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.