At microMEDIA, we utilize a number of different optical character recognition (OCR) and automated forms processing software packages. Our primary forms processing engine is ReadSoft software. We can offer document scanning and conversions for both structured and unstructured forms. The performance of an automatic machine print (OCR) or handprint (ICR) recognition system depends heavily on the quality of the image.
If a document contains many broken, touching or noisy character images, the processing accuracy of the OCR or automated forms' recognition will degrade significantly, diminishing the chances of finding the right data from the document. Our 28 years of experience converting documents assures the highest possible image quality from the documents we scan and thus the most effective OCR and automated forms processing.
Machine-print
Machine-print text is much easier to recognize than handwriting, primarily because of its regularity. Each time a letter appears, it appears the exact same way throughout the text. Isolating words or characters from the given text can be done reliably from a few heuristic rules, and variations among different fonts are very limited.
Isolated Handprint
Besides machine-print, microMEDIA can read isolated handprint, touching handprint, cursive and mixed-writing (part cursive, part print). Right now, the most widely used ICR system is the isolated handprint recognition system. This system is popular because characters are written in pre-designed boxes and combs, eliminating the difficult word or character isolation problem.
Data Field Objects
We may have to recognize a phrase, a sentence or even a paragraph of text instead of just a single word. Having a higher-level understanding of such phrases or sentences is important in guiding and checking the recognition processes and their results. In a business form processing environment, data is usually arranged into fields. A data field usually contains one or more words with a fixed set of vocabulary and grammar rules. For example, a date field containing "March 21, 1997" could be written in any of the following styles: 3/21/97; 3-21-97; 03-21-97; 3-21-1997; 3.21.97; Mar. 21, 97; March 21, 1997 and others.
Correction Procedures
Extremely accurate forms conversion is accomplished by comprehensive correction procedures and additional quality control steps; all done by microMEDIA trained technicians.
Onsite Conversion
Onsite document scanning and conversion is available for very sensitive or valuable documents that can't be removed from their current locations. Our onsite Client Services Department conducts large backfile conversions of work as well as ongoing daily processing on a facilities management program.
More than Paper
We don't just scan paper documents and capture their information; we can also convert materials that were recorded in microfilm, microfiche, aperture cards, engineering drawings, books, slides and transparencies. We are especially adept at media beyond paper since our beginnings were in the microfilm industry.
Deliverables
Output is available in word processor format, spreadsheets, ASCII files and direct formatting for client databases.
