Optical character recognition, usually abbreviated to OCR, is the mechanical or electronic conversion of scanned images of handwritten, typewritten or printed text into machine-encoded text. It is widely used as a form of data entry from some sort of original paper data source, whether documents, sales receipts, mail, or any number of printed records. It is a common method of digitizing printed texts so that they can be electronically searched, stored more compactly, displayed on-line, and used in machine processes such as machine translation, text-to-speech and text mining. OCR is a field of research in pattern recognition, artificial intelligence and computer vision.
Early versions needed to be programmed with images of each character, and worked on one font at a time. "Intelligent" systems with a high degree of recognition accuracy for most fonts are now common. Some systems are capable of reproducing formatted output that closely approximates the original scanned page including images, columns and other non-textual components.
Read more about Optical Character Recognition: History, Importance of OCR To The Blind, OCR Software, Current State of OCR Technology
Famous quotes containing the words optical, character and/or recognition:
“The convent, which belongs to the West as it does to the East, to antiquity as it does to the present time, to Buddhism and Muhammadanism as it does to Christianity, is one of the optical devices whereby man gains a glimpse of infinity.”
—Victor Hugo (18021885)
“Gross and obscure natures, however decorated, seem impure shambles; but character gives splendor to youth, and awe to wrinkled skin and gray hairs.”
—Ralph Waldo Emerson (18031882)
“In a cabinet of natural history, we become sensible of a certain occult recognition and sympathy in regard to the most unwieldy and eccentric forms of beast, fish, and insect.”
—Ralph Waldo Emerson (18031882)