In the name of God
Ocr(Optical Character Recognition)
Optical character recognition, usually abbreviated to OCR, is the mechanical or electronic translation of images of handwritten, typewritten or printed text (usually captured by a scanner) into machine-editable text. It is used to convert paper books and documents into electronic files, for instance, to computerize an old record-keeping system in an office, or to serve on a website such as Project Gutenberg. By replacing each block of pixels that resembles a particular character (such as a letter, digit or punctuation mark) or word with that character or word, OCR makes it possible to edit printed text, search it for a given word or phrase, store it more compactly, display or print a copy free of scanning artifacts, and apply such techniques as machine translation, text-to-speech and text mining to it. OCR is a field of research in pattern recognition, artificial intelligence and computer vision. Though academic research in the field continues, the focus on OCR has shifted to implementation of proven techniques. Optical character recognition (using optical techniques such as mirrors and lenses) and digital character recognition (using scanners and computer algorithms) were originally considered separate fields. Because very few applications survive that use true optical techniques, the OCR term has now been broadened to include digital image processing as well.
Early systems required training to read a specific font; they needed to be programmed with images of each character, and it only worked on one font at a time. "Intelligent" systems with a high degree of recognition accuracy for most fonts are now common. Some systems are even capable of reproducing formatted output that closely approximates the original scanned page including images, columns and other non-textual components.
OCR software
OCR Software and ICR Software technology are analytical artificial intelligence systems that consider only sequences of characters rather than whole words or phrases and do not cross-validate data during the recognition process. For these systems to effectively recognize hand printed or machine printed forms, words must be separated into individual characters.
Reference
http://en.wikipedia.org