PDF OCR Text ParsingBack to Videos


This demonstrates the PDF OCR text processing capabilities of SimpleIndex by extracting the Document Number, Date, Document Type, Customer and Total from a number of Estimates and Invoices.

All of this information is read automatically using the existing text layer of a computer generated PDF, such as those created using PDF printer drivers. Template and dictionary matching algorithms are used to locate and extract the correct data values from the text.

Since the existing text is being used, OCR is not performed. This makes processing much faster and 100% accurate. OCR can be used to get text from scanned PDF files with no existing text.

Document Imaging Suite

SimpleIndex
- document capture solution
SimpleView
- document management
SimpleCoversheet
- barcode printing
SimpleSend
- document distribution
SimpleExport
- automatic data conversion
SimpleQB
- Quickbooks data import
SimpleOCR
- OCR engine & freeware
All Simple Software Products
Free Document Scanning Web Demo
Get a free online demo with a scanning specialist who can configure SimpleIndex on your computer remotely.
Sign up for a demo now!
Scanning Software Download
Fully functional 30-day demos are available for all Simple Software applications. Download Now!
ScanStore® and SimpleIndex® are registered trademarks of Meta Enterprises, LLC. All rights reserved.