This paper presents a simple, efficient, and less costly approach to construct OCR for reading any document that has fix font size and style or handwritten style and uses database to recognize English characters which makes this OCR very simple to manage.
Optical Character Recognition or OCR is the electronic translation of handwritten, typewritten or printed text into machine translated images. It is widely used to recognize and search text from electronic documents or to publish the text on a website. OCR is the machine replication of human reading and has been the subject of intensive research for more than three decades. OCR can be described as mechanical or electronic conversion of scanned images where images can be handwritten, typewritten or printed text. It is a method of digitizing printed texts so that they can be electronically searched and used in machine processes. It converts the images into machine-encoded text that can be used in machine translation, text-to-speech and text mining. This paper presents a simple, efficient, and less costly approach to construct OCR for reading any document that has fix font size and style or handwritten style. To achieve efficiency and less computational cost, OCR in this paper uses database to recognize English characters which makes this OCR very simple to manage. So this research paper is based on the construction, working and applications of OCR. Paper will also discuss different stages of OCR like optical scanning , location segmentation ,preprocessing ,feature extraction and recognition post processing.