SIGN IN YOUR ACCOUNT TO HAVE ACCESS TO DIFFERENT FEATURES

CREATE AN ACCOUNT FORGOT YOUR PASSWORD?

FORGOT YOUR DETAILS?

AAH, WAIT, I REMEMBER NOW!

CREATE ACCOUNT

ALREADY HAVE AN ACCOUNT?
QUESTIONS? CALL: 865-637-8986
  • SIGN UP
  • LOGIN

SimpleIndex

SimpleIndex

T (865) 637-8986
Email: info@simpleindex.com

SimpleIndex by SimpleSoftware
500 W Summit Hill Dr SW # 302, Knoxville, TN 37902

  • LEARN MORE
    • GENERAL INFO
      • Getting Started
      • How To Scan Documents
      • Barcode Scanning Guide
      • Searching & Viewing
      • Schedule a Web Demo
      • News & Updates
    • FEATURES
      • Streamlined Interface
      • TWAIN and ISIS Scanning
      • Dynamic OCR
      • Database Integration
      • Required Documents Check
      • Integrated & Unattended Processing
      • SharePoint Document Scanning
    • –
      • Document Classification
      • PDF & MS Office Text Parsing
      • Barcode Recognition
      • Optical Mark Recognition
      • Match Documents to Existing Data
      • Imprinting & Watermarking
      • Screenshot OCR
  • SOLUTIONS
    • General
      • All-In-One Scanning & Sorting Tool
      • Affordable Document Management
      • Instant Integration
      • Network Scanners & Copiers
      • Remote Document Capture
      • Reduce Click Charges for Data Capture
    • Specific
      • Mortgage & Loan Documents
      • Material Safety Data Sheets (MSDS)
      • Patent ID and Title Extraction
      • Federal Tax Returns
      • Invoice Processing
  • SUITE
    • SimpleCoversheet – Print Bar Codes
    • SimpleExport – Data File Converter
    • SimpleView – Search, View & Edit
    • SimpleQB – QuickBooks Integrator
    • SimpleOCR – Freeware OCR
    • Buy Suite Apps
    • Buy Suite Bundles
  • ORDER NOW
    • COMPARE VERSIONS
      • Versions & Feature
      • Price List (PDF)
    • SIMPLEINDEX WORKSTATION
      • Machine License
      • Concurrent User
      • Subscription License
    • SIMPLEINDEX SERVER
    • SUITE APPLICATIONS
    • SUITE BUNDLES
    • SUPPORT & MAINTENANCE
      • Annual Maintenance Renewals
    • FIND A DEALER
      • Dealer Locator
      • Become a Dealer
  • DEMOS
    • TRIAL DOWNLOADS
    • SCHEDULE A DEMO
    • COMPARE SOLUTIONS
    • VIDEO DEMOS
      • Zone OCR with Template Matching
      • Invoice Processing with Full Page OCR
      • PDF Invoice OCR Demo
      • Sort and Index MS Office Documents
    • SIMPLEINDEX UNIVERSITY
      • SimpleIndex University – 100 Series
      • SimpleIndex University – 200 Series
      • SimpleIndex University – 300 Series
  • CONTACT
    • Contact Us
    • Support
    • FAQ
    • Privacy Policy
  • My Account
    • Downloads
  • MY CART
    No products in cart.
  • Home
  • Knowledge Base - Article

OCR Form Processing

Capture data from scanned forms or PDFs with OCR and save it to CSV, XML or any SQL database. Automate PDF forms by capturing data from filled-in forms or filling in blank PDF forms from any data source.

Change the Dictionary Separator Value

Monday, 29 July 2019 by Simple Software

This is used to change the dictionary separator value when doing thesaurus matching from the default character of | to any character(s) that you want. This can be useful in cases where the values you would like in your list or dictionary might include the pipe character or “|” or “Shift Backslash”

This setting is also used as the delimiter when parsing multiple index field values from bar codes (e.g. field1|field2|field3).

Instructions for changing the dictionary separator value:

  1. Right click on the Job Configuration file that you would like to suppress the prompt on and select Open With>Notepad
  2. Search the XML settings text open in Notepad for this term:
    <OCR_DICT_SEPARATOR>
  3. Change the value in-between from “|” to any other single character that you want.
  4. For TAB separation use %TAB%
This image has an empty alt attribute; its file name is Separator1.jpg

Bar Code ScanningBar CodesBarcode OCRBarcode Reading SoftwareBarcode Recognition SoftwareOCROCR Form ProcessingOCR ScanningPDF Barcode RecognitionZone OCR
Read more
No Comments

Change the OCR Font or Type

Monday, 29 July 2019 by Simple Software

This is used to changed the default OCR recognition font or type from the default, which is “To Be Detected”. This can be used to look for a specific type of OCR font and is especially useful for recognizing things like Dotmatrix, OCR A and OCR B.

Instructions for setting OCR Font:

1.  Right click on the .sic file and select Open With a text editor (Notepad, Wordpad, etc.)

2.  Find <OCR_TEXT_TYPE>.  If you can’t find <OCR_TEXT_TYPE> then add the following as the last row in the text file:  

<OCR_TEXT_TYPE>#</OCR_TEXT_TYPE>

3.  Change the number in between:  <OCR_TEXT_TYPE>#</OCR_TEXT_TYPE>

4.  Number of desired font:            

  • 0  Normal
  • 1  Typewriter 
  • 2  Dotmatrix 
  • 3  Index
  • 5  OCR A  
  • 6  OCR B 
  • 7  MICR E13B  
  • 8  MICR CMC7   
  • 9  Gothic       
  • 10  To Be Detected

     5.  Close and save file

AnnotationClipboard OCROCROCR Form ProcessingOCR ScanningScreen Scraping OCRScreenshot OCRZone OCR
Read more
No Comments

Regular Expression (RegEx) – Syntax or Type

Monday, 29 July 2019 by Simple Software

The Syntax or Type of Regular Expression/RegEx that SimpleIndex uses is .NET

Barcode OCRClipboard OCRInvoice OCROCROCR Form ProcessingOCR ScanningScreen Scraping OCRScreenshot OCRSearchable PDF OCRZone OCR
Read more
No Comments

I’m using full page OCR. The information is all appearing in the txt file but it is losing format about half way through. Data to the right is ending up at the end of the txt doc. Can this be fixed?

Wednesday, 28 February 2018 by dwilder

SimpleIndex version 7 solves this problem with the incorporation of the FineReader OCR engine. Full text in PDFs will now flow with the formatting of the PDF.

Legacy Versions: SimpleIndex can also be used with other OCR applications and servers to improve accuracy, formatting and performance. Use the OCR applications to convert the scanned images to text or searchable PDF, and SimpleIndex can extract index values from the text and automatically sort and organize the files.

Full Text IndexingOCROCR Form ProcessingOCR ScanningOffice PDF Text ProcessingPDF Data Extraction SoftwareSearchable PDF OCRText ProcessingZone OCR
Read more
  • Published in OCR
No Comments

Is there a way to just use part of a bar code or OCR value? For example, extract “50” from the value “124450”

Wednesday, 28 February 2018 by dwilder

To do this example, create a barcode field (Field 1 for example) and a 2nd field with type “Fixed”. In the template for the 2nd field, enter %FIELD1[5,2]% to get “50” from “124450”.

%FIELD1% would get the entire value for Field #1, the barcode field. By adding the [5,2] you tell SimpleIndex to start at the 5th character (5) and take 2 characters from the value (50).

Find out more about barcode scanning on our Barcode Scanning Guide and read up on Optical Character Recognition on the SimpleOCR scanning solutions guide.

Automatic Data CaptureAutomatic Indexing SoftwareBar Code ScanningBar CodesBarcode OCRBarcode Reading SoftwareBarcode Recognition SoftwareClipboard OCRDocument ImagingDocument ScanningImage ScanningInvoice OCRKeyword IndexingOCROCR Form ProcessingOCR ScanningOffice PDF Document IndexingPDF Barcode RecognitionPDF417QR CodeQuickBooks Document ManagementScanned Document IndexingScreen Scraping OCRScreenshot OCRZone OCR
Read more
  • Published in Bar Codes, OCR, Office PDF Text Processing
No Comments

If I have a form which is filled manually by hand, can SimpleIndex read the data from it?

Wednesday, 28 February 2018 by dwilder

No, SimpleIndex cannot read handwriting. You would have to type this information in manually.
Find out more about ICR (Handprint Recognition) software on the SimpleOCR ICR Guide.

OCR Form Processing
Read more
  • Published in OCR
No Comments

How do you train the OCR engine for better accuracy?

Wednesday, 28 February 2018 by dwilder

Training has been removed with version 7 due to the addition of the ABBYY FineReader OCR engine.

Invoice OCROCROCR Form ProcessingOCR ScanningScreen Scraping OCRScreenshot OCRSearchable PDF OCRZone OCR
Read more
  • Published in OCR
No Comments

How do you configure full text searching in Retrieval mode?

Wednesday, 28 February 2018 by dwilder

On the Database tab there dropdown in the lower portion of the panel for Full Text OCR Field. Put the name of the field that will store the full-text data there. This must be configured both for Insert and Retrieval mode configurations. The database field needs to be sufficient length to store the entire text of your document.

Of course, the Insert Mode configuration must have “Enable Full Page OCR” checked to generate full text data from images. Text from MS Office documents, PDF files and existing OCR text files can be used without setting this option.

When designing your Retrieval Mode configuration, create a Text field to use for full text search queries. On the Database tab, set the corresponding “Database Field Name” to the full text database field.

When searching on your full text field, SimpleIndex finds the text you enter no matter where it appears in the document. It is able to match partial words. It does not perform boolean or natural language searches. The text entered must match the document text exactly.

ArchiveDatabaseDatabase & RetrievalDocument Management SoftwareExportFull Text IndexingIndexingIntegrationMS AccessMySQLOCROCR Form ProcessingOCR ScanningODBCOffice PDF Text ProcessingOraclePaperless OfficePDF Archive Scanning SoftwarePDF Data Extraction SoftwareQuickBooks Document ManagementRecords ManagementSearchSearchable PDF OCRServerSharePoint ScanningSQL ServerText ProcessingZone OCR
Read more
  • Published in Database & Retrieval, OCR
No Comments

How can I improve recognition rates for my OCR fields?

Wednesday, 28 February 2018 by dwilder

There are several things you can do to improve accuracy for OCR.

-Scan at 300dpi, black & white for best results.

-Adjust the scan settings to remove background noise and improve the definition of characters.

-For Zone OCR, field recognition can often vary based on the surrounding white space and text in the zone. Try varying the size of the zone to achieve optimal results.

-For template matching, make sure all variations of the field format are included in the template list.

-For dictionary matching, add common variations and OCR mistakes to the “thesaurus” list.

-On the Zones & OCR tab (accessed from the Job Options) you can adjust the Max Errors setting to allow for more mistakes in the dictionary matching process.

-Use the Strip Spaces, Strip Characters, Replace Characters and Case Fixing options to standardize the field format prior to matching.

Please refer to the manual for details on how to configure these options.

Find out more about Optical Character Recognition on the SimpleOCR Guide. You may also check out our Advanced OCR Guide to find out how to use third-party OCR applications with SimpleIndex.

Clipboard OCRInvoice OCROCROCR Form ProcessingOCR ScanningScreen Scraping OCRScreenshot OCRSearchable PDF OCRZone OCR
Read more
  • Published in OCR
No Comments

Can OCR text be saved to MS Word or HTML formats?

Wednesday, 28 February 2018 by dwilder

Yes. On the Zones & OCR tab of the Job Options, there is a dropdown list for “Full-page OCR file type”. By default it is set to TEXT, but can be changed to WORD, HTML or PDF.

If the output file type is set to PDF, OCR text will be embedded as hidden text in the PDF file.

Find out more about Optical Character Recognition on the SimpleOCR Guide.

ExportFull Text IndexingOCROCR Form ProcessingOCR ScanningOffice PDF Text ProcessingPDF Data Extraction SoftwareSearchable PDF OCRText ProcessingZone OCR
Read more
  • Published in Licensing & Installation, OCR
No Comments

Can SimpleIndex create searchable PDF Image+Text files with hidden text?

Wednesday, 28 February 2018 by dwilder

If you enable full-page OCR and output to PDF, the full-page OCR text will be inserted as invisible text on each page.

With the addition of the FineReader Engine in version 7, SimpleIndex now creates PDF files with fully searchable text formatted to flow with the image of the document.

Find out more about Optical Character Recognition on the SimpleOCR Guide.

ExportFull Text IndexingOCROCR Form ProcessingOCR ScanningOffice PDF Text ProcessingPDF Data Extraction SoftwareSearchable PDF OCRText ProcessingZone OCR
Read more
  • Published in Export, OCR, Office PDF Text Processing
No Comments

Contact Us Today!

Search Knowledge Base

Recent KB Articles

  • Command Line Arguments
  • Stop/Turn Off Image Deletion when Blank is Recognized
  • Error in Scanning Batch 743
  • How do I download and utilize TaxStacker after purchasing?
  • How to emulate Server behavior in the client
  • Oracle database is slow to respond
  • SharePoint Login Issues
  • Reset SharePoint Login Information

Feature Cloud

Installation & Registration Scanned Document Indexing Document Scanning Zone OCR Records Management Imprinting & Watermarking XML Document Imaging Licensing General Configuration PDF Barcode Recognition PaperVision Search TWAIN Workflow Mortgage Contentverse RegEx OCR Scanning Document Management Software Document Numbering System Unattended SharePoint Scanning TWAIN & ISIS Scanning Remote Capture XSLT Data Conversion Software Full Text Indexing MySQL OCR SharePoint Migration File Indexing Database Barcode OCR SimpleIndex Automatic Data Capture Database & Retrieval Batch Scanning Front End Scanning Scanning Software Invoice Scanning Software OMR Microsoft Word Data Extraction Bar Code Scanning Optical Mark Recognition SAGE

Online Support Options

Simple Software provides an interactive Frequently Asked Questions database and Live Support chat system, as well as free Training Videos.

How to Buy

Solutions start at just $500! Buy SimpleIndex online or from an Authorized Dealer in your area. Price List (PDF).

Authorized Dealers

Authorized DealersSimpleIndex is a great addition to any system integrator's product line. Become an Authorized Dealer.

Get a Web Demo

Get a free online demo with a scanning specialist who can configure SimpleIndex on your computer remotely.
Sign up for a demo now!

Download a Trial

SimpleIndex TrialFully functional 30-day demos are available for all Simple Software applications.
Download Now!

SimpleIndex Applications

SimpleIndex Applications See how SimpleIndex can be used in your business.
"Out-of-the-Box" Solutions
Case Studies
Common Applications
Industry-Specific Applications

© 2021 Meta Enterprises, LLC | Knoxville, Tennessee | A Family Owned Company

TOP
});