Screen Scraping is a commonly used method for transferring data from one application to another by using OCR to read text from the application window.
The Syntax or Type of Regular Expression/RegEx that SimpleIndex uses is .NET
Is there a way to just use part of a bar code or OCR value? For example, extract “50” from the value “124450”
To do this example, create a barcode field (Field 1 for example) and a 2nd field with type “Fixed”. In the template for the 2nd field, enter %FIELD1[5,2]% to get “50” from “124450”.
%FIELD1% would get the entire value for Field #1, the barcode field. By adding the [5,2] you tell SimpleIndex to start at the 5th character (5) and take 2 characters from the value (50).
Training has been removed with version 7 due to the addition of the ABBYY FineReader OCR engine.
There are several things you can do to improve accuracy for OCR. -Scan at 300dpi, black & white for best results. -Adjust the scan settings to remove background noise and improve the definition of characters. -For Zone OCR, field recognition can often vary based on the surrounding white space and text in the zone. Try varying the size of the zone to achieve optimal results. -For template matching, make sure all variations of the field format are included in the template list. -For dictionary matching, add common variations and OCR mistakes to the “thesaurus” list. -On the Zones & OCR tab (accessed from the Job Options) you can adjust the Max Errors setting to allow for more mistakes in the dictionary matching process. -Use the Strip Spaces, Strip Characters, Replace Characters and Case Fixing options to standardize the field format prior to matching. Please refer to the manual for details on how to configure these options. Find out more about Optical Character Recog