SIGN IN YOUR ACCOUNT TO HAVE ACCESS TO DIFFERENT FEATURES

CREATE AN ACCOUNT FORGOT YOUR PASSWORD?

FORGOT YOUR DETAILS?

AAH, WAIT, I REMEMBER NOW!

CREATE ACCOUNT

ALREADY HAVE AN ACCOUNT?
QUESTIONS? CALL: 865-637-8986
  • SIGN UP
  • LOGIN

SimpleIndex

SimpleIndex

T (865) 637-8986
Email: info@simpleindex.com

SimpleIndex by SimpleSoftware
500 W Summit Hill Dr SW # 302, Knoxville, TN 37902

  • LEARN MORE
    • GENERAL INFO
      • Getting Started
      • How To Scan Documents
      • Barcode Scanning Guide
      • Searching & Viewing
      • Schedule a Web Demo
      • News & Updates
    • FEATURES
      • Streamlined Interface
      • TWAIN and ISIS Scanning
      • Dynamic OCR
      • Database Integration
      • Required Documents Check
      • Integrated & Unattended Processing
      • SharePoint Document Scanning
    • –
      • Document Classification
      • PDF & MS Office Text Parsing
      • Barcode Recognition
      • Optical Mark Recognition
      • Match Documents to Existing Data
      • Imprinting & Watermarking
      • Screenshot OCR
  • SOLUTIONS
    • General
      • All-In-One Scanning & Sorting Tool
      • Affordable Document Management
      • Instant Integration
      • Network Scanners & Copiers
      • Remote Document Capture
      • Reduce Click Charges for Data Capture
    • Specific
      • Mortgage & Loan Documents
      • Material Safety Data Sheets (MSDS)
      • Patent ID and Title Extraction
      • Federal Tax Returns
      • Invoice Processing
  • SUITE
    • SimpleCoversheet – Print Bar Codes
    • SimpleExport – Data File Converter
    • SimpleView – Search, View & Edit
    • SimpleQB – QuickBooks Integrator
    • SimpleOCR – Freeware OCR
    • Buy Suite Apps
    • Buy Suite Bundles
  • ORDER NOW
    • COMPARE VERSIONS
      • Versions & Feature
      • Price List (PDF)
    • SIMPLEINDEX WORKSTATION
      • Machine License
      • Concurrent User
      • Subscription License
    • SIMPLEINDEX SERVER
    • SUITE APPLICATIONS
    • SUITE BUNDLES
    • SUPPORT & MAINTENANCE
      • Annual Maintenance Renewals
    • FIND A DEALER
      • Dealer Locator
      • Become a Dealer
  • DEMOS
    • TRIAL DOWNLOADS
    • SCHEDULE A DEMO
    • COMPARE SOLUTIONS
    • VIDEO DEMOS
      • Zone OCR with Template Matching
      • Invoice Processing with Full Page OCR
      • PDF Invoice OCR Demo
      • Sort and Index MS Office Documents
    • SIMPLEINDEX UNIVERSITY
      • SimpleIndex University – 100 Series
      • SimpleIndex University – 200 Series
      • SimpleIndex University – 300 Series
  • CONTACT
    • Contact Us
    • Support
    • FAQ
    • Privacy Policy
  • My Account
    • Downloads
  • MY CART
    No products in cart.
  • Home
  • Page

Process text with RegEx (Regular Expressions) to perform complex pattern matching and extract data from the document text or OCR results.

Zone OCR and Dynamic OCR

Friday, 12 January 2018 by Simple Software

Many document scanning solutions use Zone OCR to obtain index data from the page, while SimpleIndex improves upon this time-tested but ultimately limited model with its unique Dynamic OCR feature. Let’s look at the difference between the two methods:

Zone OCR

Zone OCR is used to read document indexes or tags from text on the page. It is a great way to automate the data entry associated with scanning documents.

However, there are several limitations to zone OCR that must be overcome:

  • Index information must be in the exact same place on every page
  • Documents shift and skew during scanning, causing the zones to not line up
  • If surrounding lines or text on the document are too close, they can encroach on the zone

Dynamic OCR

SimpleIndex overcomes these limitations by using Dynamic OCR technology to locate the desired text even when it moves around on the page. Our simplified version of Dynamic OCR works great for many types of documents at a fraction of the cost of other solutions.

  • Index information can appear anywhere on any page
  • Unwanted characters are automatically ignored
  • Find unique patterns of letters and numbers using Template Matching
    (Social Security #, Date, etc.)
  • Use Dictionary Matching to find a value from a list of possible values
    (Vendor Name, Document Type, etc.)

Download document scanning and OCR software.

Dynamic OCR Examples

In the video we see how SimpleIndex approaches a typical Zone OCR example. With SimpleIndex you can use large zones that give a wide margin for error. Template and Dictionary matching are then used to extract the 7-digit Account Number, 6-digit Order Number and Company Name. SimpleIndex discards the surrounding text and keeps the correct value.

Another common example is finding a unique identifier, for example a social security number, that could appear anywhere on the page. Simply enter the template ###-##-#### and SimpleIndex will search the full OCR text until it finds a match. Since only one social security number is likely to appear on the page, a match on this pattern is almost certainly the required value.

With dictionary matching, you can give SimpleIndex a list of possible values and it will automatically search the zone or page for each possible value until it finds a match.

Many dynamic forms processing applications can be implemented using these simple algorithms. This makes SimpleIndex far more versatile than other zone OCR solutions that require the index value to be in the exact same location on every page. Yet SimpleIndex costs only a fraction of the price!

SimpleIndex‘s dynamic forms processing can greatly speed up data entry by eliminating a good percentage of indexing work. For many this can put the labor cost of scanning within their reach.

MS Office Document OCR Text Parsing Video

Dynamic OCR can also be applied to MS Office and PDF files, creating a fully automated process for intelligently indexing and reorganizing electronic documents.

Support for Regular Expressions

Use Regular Expressions to extract index data from OCR text, PDF and Office documents.

SimpleIndex OCR has a simple built-in template format, as well as support for Regular Expressions. Regular Expressions (RegEx for short) let you define complex search patterns to extract matching values from the text.  This greatly enhances the functionality of the dynamic OCR in SimpleIndex, making it capable of finding variable-length fields with no distinct pattern.

Regular Expressions are a commonly used in text parsing applications. The Perl programming language makes extensive use of RegEx, as do UNIX utilities like “grep”. Many programmers and IT personnel are already familiar with RegEx and can create complex expressions without specific training.

Click here for a reference guide to Regular Expressions

Download document scanning and OCR software.

New OCR Features in Version 9

  • OCR language pack now includes all available Tesseract languages including Hindi, Tamil, Arabic, Chinese, Thai, Vietnamese, Japanese, Korean, Indonesian, Hebrew and many more.

New OCR Features in Version 8

Versions 7 and above include the industry leading ABBYY FineReader® OCR engine for dramatically improved OCR accuracy and speed. Other OCR enhancements in version 8+ include:

  • Point & Click OCR
  • Tesseract OCR Engine now included with all license versions
  • Match OCR index fields against other index fields
  • Skip OCR processing on imported files that already include text, such as PDF + Text files for faster batch times

How to Configure SimpleIndex OCR

Watch this Simple Software University training video to see how to configure and run an OCR job with SimpleIndex.

Download document scanning and OCR software.

 

KB Articles for Optical Character Recognition (OCR)

  • Languages Supported in SimpleSoftware OCR Engines
  • What is Document Imaging?
  • Change the Dictionary Separator Value
  • Change the OCR Font or Type
  • Regular Expression (RegEx) - Syntax or Type
  • Autonumber Increment Value
  • I'm using full page OCR. The information is all appearing in the txt file but it is losing format about half way through. Data to the right is ending up at the end of the txt doc. Can this be fixed?
  • Is there a way to just use part of a bar code or OCR value? For example, extract "50" from the value "124450"
  • If I have a form which is filled manually by hand, can SimpleIndex read the data from it?
  • How do you train the OCR engine for better accuracy?
Automatic Data CaptureBatch ScanningDocument ClassificationDocument ImagingIndexingInvoice OCROCROffice PDF Text ProcessingOptical Character RecognitionRegExScreenshot OCRSearchSeparationServerText ProcessingZone OCR
Read more
No Comments

Contact Us Today!

Search Knowledge Base

Recent KB Articles

  • Command Line Arguments
  • Stop/Turn Off Image Deletion when Blank is Recognized
  • Error in Scanning Batch 743
  • How do I download and utilize TaxStacker after purchasing?
  • How to emulate Server behavior in the client
  • Oracle database is slow to respond
  • SharePoint Login Issues
  • Reset SharePoint Login Information

Feature Cloud

Workstation Barcode OCR XML Invoice Scanning Software Document Classification Bar Codes Text Processing Indexing RegEx ODBC Automatic Indexing Software PDF Archive Scanning Software MySQL Invoice OCR PDF Forms Image Scanning SAGE Wilder Workflow Oracle Licensing & Installation OMR TIFF Retrieval XSLT General Configuration Solution Barcode Recognition Software Document Scanning Metadata Licensing ISIS Driver Screen Scraping OCR Screenshot OCR E-Mail PDF417 Server Patch Code Automatic Data Capture Fast Scanning SimpleView SimpleCoversheet Barcode Printing InstaDocs XSLT Data Conversion Software

Online Support Options

Simple Software provides an interactive Frequently Asked Questions database and Live Support chat system, as well as free Training Videos.

How to Buy

Solutions start at just $500! Buy SimpleIndex online or from an Authorized Dealer in your area. Price List (PDF).

Authorized Dealers

Authorized DealersSimpleIndex is a great addition to any system integrator's product line. Become an Authorized Dealer.

Get a Web Demo

Get a free online demo with a scanning specialist who can configure SimpleIndex on your computer remotely.
Sign up for a demo now!

Download a Trial

SimpleIndex TrialFully functional 30-day demos are available for all Simple Software applications.
Download Now!

SimpleIndex Applications

SimpleIndex Applications See how SimpleIndex can be used in your business.
"Out-of-the-Box" Solutions
Case Studies
Common Applications
Industry-Specific Applications

© 2021 Meta Enterprises, LLC | Knoxville, Tennessee | A Family Owned Company

TOP
});