SIGN IN YOUR ACCOUNT TO HAVE ACCESS TO DIFFERENT FEATURES

Login with Google
CREATE AN ACCOUNT FORGOT YOUR PASSWORD?

FORGOT YOUR DETAILS?

AAH, WAIT, I REMEMBER NOW!

CREATE ACCOUNT

ALREADY HAVE AN ACCOUNT?

Login with Google

QUESTIONS? CALL: 865-637-8986
  • SIGN UP
  • LOGIN

SimpleIndex

  • LEARN MORE
    • GENERAL INFO
      • Getting Started
      • How To Scan Documents
      • Barcode Scanning Guide
      • Searching & Viewing
      • News & Updates
      • Schedule a Web Demo
    • FEATURES
      • Streamlined Interface
      • TWAIN and ISIS Scanning
      • Zone OCR and Dynamic OCR
      • Database Integration
      • Required Documents Check
      • Automated Processing & 1-Click Interface
      • SharePoint Document Scanning
    • –
      • Document Classification
      • PDF & MS Office Text Parsing
      • Barcode Recognition
      • Optical Mark Recognition
      • Match Documents to Existing Data
      • Imprinting & Watermarking
      • Screenshot OCR
  • SOLUTIONS
    • General
      • All-In-One Scanning & Sorting Tool
      • Affordable Document Management
      • Instant Integration
      • Network Scanners & Copiers
      • Remote Document Capture
      • Reduce Click Charges for Data Capture
    • Specific
      • Sales Tax Exemption Forms
      • Federal Tax Returns
      • Invoice Processing
      • Material Safety Data Sheets (MSDS)
      • Patent ID and Title Extraction
      • Mortgage & Loan Documents
    • Feature Demos
      • Zone OCR with Template Matching
      • Full-Page OCR & Multi-User Workflow
      • PDF Text Processing
      • Organize Office Documents
      • Integration with RPA Bots
      • Compare with Other Solutions
  • SUITE
    • SimpleCoversheet – Print Bar Codes
    • SimpleExport – Data File Converter
    • SimpleView – Search, View & Edit
    • SimpleQB – QuickBooks Integrator
    • SimpleOCR – Freeware OCR
    • Buy Suite Apps
    • Buy Suite Bundles
  • DOWNLOAD
  • SHOP
    • COMPARE VERSIONS
    • SIMPLEINDEX WORKSTATION
      • Machine License
      • Concurrent User
      • Subscription License
    • SIMPLEINDEX SERVER
    • SUITE APPLICATIONS
    • SUITE BUNDLES
    • MAINTENANCE & RENEWALS
    • FIND A DEALER
      • Dealer Locator
      • Become a Dealer
    • CONTACT SALES
  • SUPPORT
    • WIKI HELP
    • KNOWLEDGE BASE
    • SIMPLEINDEX UNIVERSITY
      • SimpleIndex University – 100 Series
      • SimpleIndex University – 200 Series
      • SimpleIndex University – 300 Series
    • PRIVACY POLICY
    • CONTACT SUPPORT
  • My Account
    • Downloads
  • MY CART
    No products in cart.
  • Home
  • Page

Automate the process of capturing data from documents with SimpleIndex. Dynamic OCR uses complex pattern matching to capture data no matter where it appears on a document. Bar code recognition with database matching can completely automate document indexing and filing processes. SimpleIndex provides the most powerful automatic data capture features of any solution in its price range.

Reduce Click Charges for Data Capture

Monday, 14 November 2022 by Simple Software

If you operate a high-volume scanning department or service bureau, chances are you use software like Kofax to scan and index documents for your clients. If you do then you are well aware of the high cost of click charges and the inevitable mad rushes to purchase additional clicks at the end of a peak volume month.

There are some scanning jobs that need the multi-user batching and indexing features of these systems, but many do not. SimpleIndex® can help you save big on click charges by supplementing your primary scanning infrastructure, letting you perform smaller, less complex jobs in a separate workflow.

Many data capture and forms processing applications charge for every page you process, even if all the data being read is only on the first page. Starting SimpleIndex 9, you can automatically send a copy of the first page from each exported file to a separate folder for data processing, helping you avoid unnecessary processing time and license costs.

Jobs like these can be easily processed with SimpleIndex:

  • Simple scan-to-file with no indexing
  • All indexing is done via bar codes or database lookup
  • No custom export or API integration is required

The following scenarios usually require a more robust solution:

  • Multi-user workflows
  • Complex data extraction and forms processing
  • Direct application integration with APIs

Basically, SimpleIndex is great for 1-2 user workflows where a single user performs the whole scanning and indexing process, or where one person scans and another indexes on a separate workstation. When more than 2 users are required to keep up with indexing volume then it makes more sense to use a system designed for multiple users.

KB Articles for Reduce Click Charges

  • Language Pack for Standard/Tesseract OCR
  • Languages Supported in SimpleSoftware OCR Engines
  • What is Document Imaging?
  • Change the Dictionary Separator Value
  • Change the OCR Font or Type
  • Regular Expression (RegEx) - Syntax or Type
  • Autonumber Increment Value
  • I'm using full page OCR. The information is all appearing in the txt file but it is losing format about half way through. Data to the right is ending up at the end of the txt doc. Can this be fixed?
  • Is there a way to just use part of a bar code or OCR value? For example, extract "50" from the value "124450"
  • If I have a form which is filled manually by hand, can SimpleIndex read the data from it?
1-Click Processing, Automatic Data Capture, Database, Document Classification, Document Imaging, TWAIN & ISIS Scanning, Workflow
1-Click ProcessingAutomatic Data CaptureDatabaseDocument ClassificationDocument ImagingTWAIN & ISIS ScanningWorkflow
Read more
No Comments

Zone OCR and Dynamic OCR

Monday, 07 November 2022 by Simple Software

Many document scanning solutions use Zone OCR to obtain index data from the page.

SimpleIndex improves upon this time-tested but ultimately limited model with its Dynamic OCR feature.

Let’s look at the difference between the two methods:

Zone OCR

Zone OCR is used to read document indexes or tags from text on the page. It is a great way to automate the data entry associated with scanning documents.

However, there are several limitations to zone OCR that must be overcome:

  • Index information must be in the exact same place on every page
  • Documents shift and skew during scanning, causing the zones to not line up
  • If surrounding lines or text on the document are too close, they can encroach on the zone

Dynamic OCR

SimpleIndex overcomes these limitations by using Dynamic OCR technology to locate the desired text even when it moves around on the page. Our simplified version of Dynamic OCR works great for many types of documents at a fraction of the cost of other solutions.

  • Index information can appear anywhere on any page
  • Unwanted characters are automatically ignored
  • Find unique patterns of letters and numbers using Template Matching
    (Social Security #, Date, etc.)
  • Use Dictionary Matching to find a value from a list of possible values
    (Vendor Name, Document Type, etc.)

Download document scanning and OCR software.

Dynamic OCR Examples

In the video we see how SimpleIndex approaches a typical Zone OCR example. With SimpleIndex you can use large zones that give a wide margin for error. Template and Dictionary matching are then used to extract the 7-digit Account Number, 6-digit Order Number and Company Name. SimpleIndex discards the surrounding text and keeps the correct value.

Another common example is finding a unique identifier, for example a social security number, that could appear anywhere on the page. Simply enter the template ###-##-#### and SimpleIndex will search the full OCR text until it finds a match. Since only one social security number is likely to appear on the page, a match on this pattern is almost certainly the required value.

With dictionary matching, you can give SimpleIndex a list of possible values and it will automatically search the zone or page for each possible value until it finds a match.

Many dynamic forms processing applications can be implemented using these simple algorithms. This makes SimpleIndex far more versatile than other zone OCR solutions that require the index value to be in the exact same location on every page. Yet SimpleIndex costs only a fraction of the price!

SimpleIndex‘s dynamic forms processing can greatly speed up data entry by eliminating a good percentage of indexing work. For many this can put the labor cost of scanning within their reach.

MS Office Document OCR Text Parsing Video

Dynamic OCR can also be applied to MS Office and PDF files, creating a fully automated process for intelligently indexing and reorganizing electronic documents.

Amazon AWS Textract Cloud OCR Batch Processing

Amazon AWS Textract Cloud OCR

With Textract you can capture data from almost any type of form, including handwritten ones! Textract identifies labeled text anywhere on the document and returns the label text along with the corresponding value. Map the labels to index fields in SimpleIndex and you are ready to capture that data no matter where it appears on the page.

Textract uses machine learning with a huge model based on the billions of pages processed using Textract to provide the most accurate OCR and form field extraction solution available.

By default, Textract is only available as an API and requires custom coding to integrate it into your document workflows. SimpleIndex turns it into a fully-featured document batch document and data processing app that is ready to use out-of-the-box.

Since there are no templates to configure or train, setup can be done in hours instead of days or weeks months required by other enterprise data capture solutions.

Pay-as-you-go pricing makes SimpleIndex with Textract the most affordable way to batch process forms for projects with less than 50,000 pages per year to process, especially if you need to read handwriting or have forms with many layout variations.

Wiki: How to configure AWS Textract OCR in SimpleIndex

Support for Regular Expressions

Use Regular Expressions to extract index data from OCR text, PDF and Office documents.

SimpleIndex OCR has a simple built-in template format, as well as support for Regular Expressions. Regular Expressions (RegEx for short) let you define complex search patterns to extract matching values from the text.  This greatly enhances the functionality of the dynamic OCR in SimpleIndex, making it capable of finding variable-length fields with no distinct pattern.

Regular Expressions are a commonly used in text parsing applications. The Perl programming language makes extensive use of RegEx, as do UNIX utilities like “grep”. Many programmers and IT personnel are already familiar with RegEx and can create complex expressions without specific training.

Click here for a reference guide to Regular Expressions

Download document scanning and OCR software.

New OCR Features in Version 10

SimpleIndex 10 includes major upgrades to the OCR and Bar Code engines 

  • Amazon Textract Cloud OCR option added, with settings for Text, Forms and Invoice & Receipt extraction.
  • FineReader Engine has been upgraded to version 11. Offers improved accuracy and speed when processing large documents.
  • Full-page OCR to Word (docx), Rich Text (rtf), Open Office (odt), Excel (xlsx), PowerPoint (pptx), ePub Zip (epub), FictionBook (fb2), HTML (htm), XML (xml), Alto XML (alto.xml).
  • MRC Compression for PDF files (Mixed Raster Content).
  • OCR language pack includes all available Tesseract languages including Hindi, Tamil, Arabic, Chinese, Thai, Vietnamese, Japanese, Korean, Indonesian, Hebrew and many more.

How to Configure SimpleIndex OCR

Our Wiki help has extensive information on how to configure OCR for various document and data capture scenarios.

  • Zone OCR read data in a specific location
  • Template matching to match unique patterns
  • Dictionary matching to match a list of possible values
  • OCR Options OCR job settings that apply to all fields
  • File Formats that can be output by OCR
  • Languages supported by OCR
  • FineReader versus Tesseract OCR engines
  • Searchable PDF with MRC compression
  • OCR to Field for point and click OCR during verification
  • Cloud OCR using Textract

Watch this Simple Software University training video to see how to configure and run an OCR job with SimpleIndex.

Download document scanning and OCR software.

 

KB Articles for Optical Character Recognition (OCR)

  • Language Pack for Standard/Tesseract OCR
  • Languages Supported in SimpleSoftware OCR Engines
  • What is Document Imaging?
  • Change the Dictionary Separator Value
  • Change the OCR Font or Type
  • Regular Expression (RegEx) - Syntax or Type
  • Autonumber Increment Value
  • I'm using full page OCR. The information is all appearing in the txt file but it is losing format about half way through. Data to the right is ending up at the end of the txt doc. Can this be fixed?
  • Is there a way to just use part of a bar code or OCR value? For example, extract "50" from the value "124450"
  • If I have a form which is filled manually by hand, can SimpleIndex read the data from it?
Automatic Data Capture, Batch Scanning, Document Classification, Document Imaging, File Indexing, Invoice OCR, OCR, Office PDF Text Processing, Optical Character Recognition, RegEx, Screenshot OCR, Search, Text Processing, Watermark PDF Files, Workflow Software, Zone OCR
Automatic Data CaptureBatch ScanningDocument ClassificationDocument ImagingFile IndexingInvoice OCROCROffice PDF Text ProcessingOptical Character RecognitionRegExScreenshot OCRSearchText ProcessingWatermark PDF FilesWorkflow SoftwareZone OCR
Read more
No Comments

SimpleIndex 10.1 with Textract!

Monday, 16 May 2022 by aaron
Amazon AWS Textract Cloud OCR Batch Processing

SimpleIndex 10.1 is now available, and it adds a huge new feature — Amazon Textract!

The Cloud OCR License adds the Amazon AWS Textract Cloud OCR engine to SimpleIndex, unlocking a bunch of great new capabilities:

  • The highest OCR accuracy of any available engine, using Amazon’s massive machine learning model
  • Handprint recognition, including unconstrained and cursive writing
  • Automatic form field extraction
  • Accounts Payable Invoices and Receipts extraction
  • Pay-as-you-go licensing

The form field extraction feature is pretty amazing. It locates any labeled field on the page and its corresponding value regardless of the page layout, even if the value is handwritten. It makes SimpleIndex able to do jobs that once required enterprise data capture software like Kofax, AnyDoc, or ReadSoft, but at a fraction of the price!

SimpleIndex works with your existing AWS account. For standard OCR it costs about $0.01 per page, for invoices and forms it is about $0.065 per page. This price can vary by region.

Download SimpleIndex 10.1

Wiki: How to configure and use Textract with SimpleIndex

Automatic Data CaptureAutomatic Indexing SoftwareInvoice OCRInvoice Scanning SoftwareMetadataOCROCR Form ProcessingPDF Data Extraction SoftwareRead PDF FormsServer OCR
Read more
  • Published in Release Notes
No Comments

FastImport to Disable Automatic Processing During Import

Thursday, 27 February 2020 by Alex Stewart

SimpleIndex has a variety of processing functions that automatically happen behind the scenes when importing documents to improve the quality and functionality of the images and processing capabilities of the software.

On some occasions these extra processing functions cause delays and conflicts or aren’t needed at all. If these processing functions are causing SimpleIndex to crash or slow down the import processing too much for a particular Job Configuration that can be turned off with a registry setting.

Follow these instructions to add this registry setting:

  1. Close out of SimpleIndex entirely
  2. Open the Windows Registry by going to the Windows Search and searching for “RegEdit”
  3. Go to this location in the Registry Folder Tree: Computer\HKEY_LOCAL_MACHINE\SOFTWARE\WOW6432Node\SimpleIndex\Misc
  4. In the right section of the Registry window Right Click in the white space and select New>String Value
  5. Name the new key “FastImport”
  6. Open the “FastImport” Registry Key, set the value to “1” and then click OK
Automatic Data CaptureAutomatic Indexing SoftwareCommand Line InterfaceCommand-LineDocument AutomationFile IndexingUnattended Processing
Read more
No Comments

What is Document Imaging?

Wednesday, 31 July 2019 by aaron

Document Imaging was the more commonly used term in the early days of document scanning and OCR and refers to any system used to replicate documents used in business. It evolved from the microfilm days where it was referred to as Document Image Management.

Document Imaging allows for the scanning of paper documents, as well as the processing of files saved electronically. These files are then named and saved for later searching.

Other document imaging terms include automatic imaging software, best digital imaging software, best imaging software, desktop imaging software, digital document imaging, digital imaging software, document imaging download, document imaging PDF, document imaging processing, document imaging products, document imaging software, document imaging solution, document imaging solutions, document imaging systems, document imaging technologies, document imaging technology, document imaging tools, image to database, imaging resource, imaging scanning software, imaging software companies, imaging software download, imaging software for windows, imaging solution, scanner imaging software, scanning and imaging, scanning imaging, and software for imaging.

Automatic Data CaptureAutomatic Indexing SoftwareDocument AutomationDocument ClassificationDocument ImagingDocument Management SoftwareDocument ScanningImage ScanningKeyword IndexingOffice PDF Document IndexingPersonal Document ManagementQuickBooks Document ManagementRequired Documents AuditingScanned Document IndexingWorkflow
Read more
No Comments

Index With Non-Latin Character Sets

Monday, 29 July 2019 by Simple Software

By default SimpleIndex uses the ANSI character set to display and edit captured OCR data, index field values and full-text OCR. This works for all languages based on the Latin alphabet (English, French, Spanish, German, etc.)

To index documents in other languages like Chinese, Japanese, Russian, Arabic and other non-Latin alphabets, set the default character set using this registry key. If the key is not set correctly then Unicode text will show up as ??????????.

Use Notepad to edit the “Charset” value from the sample setting below and save it to a .reg file. Then double-click the .reg file to install (Administrator privileges required).

You can download the .reg file here but you still need to edit in Notepad to set the Charset value before installing.

If you are on a 32-bit operating system be sure to remove the extra “\WOW6432Node” from the registry path.

[HKEY_LOCAL_MACHINE\SOFTWARE\WOW6432Node\SimpleIndex\Misc]
“Charset”=”1”

Charset NameCharset Value
ANSI_CHARSET (Latin)0
DEFAULT_CHARSET1
SYMBOL_CHARSET2
SHIFTJIS_CHARSET (Japanese)128
HANGUL_CHARSET (Korean)129
GB2312_CHARSET (Simplified Chinese)134
CHINESEBIG5_CHARSET (Chinese)136
GREEK_CHARSET (Greek)161
TURKISH_CHARSET (Turkish)162
HEBREW_CHARSET (Hebrew)177
ARABIC_CHARSET (Arabic)178
BALTIC_CHARSET (Baltic)186
RUSSIAN_CHARSET (Russian)204
THAI_CHARSET (Thai)222
EE_CHARSET238
OEM_CHARSET255

The full list of values is at https://msdn.microsoft.com/en-us/library/cc194829.aspx.

Automatic Data CaptureAutomatic Indexing SoftwareFile IndexingFull Text IndexingKeyword IndexingMetadataMicrosoft Word Data ExtractionOffice PDF Document IndexingPDF Data Extraction SoftwareScanned Document Indexing
Read more
No Comments

Autonumber Increment Value

Monday, 29 July 2019 by Simple Software

If you want to change the value of how much the Autonumber Increments each time from 1 to any number that you want then do the following:

1.  Right click on the configuration file and “Open With” any text editor, such as Notepad.
2.  Search for the following:
AUTONUMBER_COUNT
3.  Change the number in this entry to the amount that you want the Autonumber to Increment:
<AUTONUMBER_COUNT>1</AUTONUMBER_COUNT>
4.  Save the configuration file.

Automatic Data CaptureDocument Automation
Read more
No Comments

Is there a way to just use part of a bar code or OCR value? For example, extract “50” from the value “124450”

Wednesday, 28 February 2018 by dwilder

To do this example, create a barcode field (Field 1 for example) and a 2nd field with type “Fixed”. In the template for the 2nd field, enter %FIELD1[5,2]% to get “50” from “124450”.

%FIELD1% would get the entire value for Field #1, the barcode field. By adding the [5,2] you tell SimpleIndex to start at the 5th character (5) and take 2 characters from the value (50).

Find out more about barcode scanning on our Barcode Scanning Guide and read up on Optical Character Recognition on the SimpleOCR scanning solutions guide.

Automatic Data CaptureAutomatic Indexing SoftwareBar Code ScanningBar CodesBarcode OCRBarcode Reading SoftwareBarcode Recognition SoftwareClipboard OCRDocument ImagingDocument ScanningImage ScanningInvoice OCRKeyword IndexingOCROCR Form ProcessingOCR ScanningOffice PDF Document IndexingPDF Barcode RecognitionPDF417QR CodeQuickBooks Document ManagementScanned Document IndexingScreen Scraping OCRScreenshot OCRTWAIN Scanning SoftwareZone OCR
Read more
  • Published in Bar Codes, OCR, Office PDF Text Processing
No Comments

How do I configure the output folder and file naming scheme?

Wednesday, 28 February 2018 by dwilder

Use the Folder and Filename check boxes on the Indexing & File Naming step in the Job Settings Wizard to indicate whether field values will be used to generate subfolders or filenames. Any field with the Folder option checked will create nested subfolders for each value in the order the fields are listed. Any field with the Filename checked will have the values concatenated to form the filename.

For example, if Field 1 and Field 3 have the Folder option checked, and Field 2 and Field 3 have the Filename option checked, image filenames will be created in the format:

%OUTPUTFOLDER%\Field 1\Field 3\Field 2 – Field 3.tif

The Filename Separator option on the Advanced tab lets you change the ” – ” between the fields in the filename to anything you want.

Related Pages

  • SimpleIndex Wiki – File Naming Schema
  • SimpleIndex Wiki – Indexing & File Naming
Automatic Data CaptureAutomatic Indexing SoftwareFile IndexingFull Text IndexingKeyword IndexingMetadataMicrosoft Word Data ExtractionOffice PDF Document IndexingPDF Data Extraction SoftwareScanned Document Indexing
Read more
  • Published in Export
No Comments

Automatic Indexing Using Existing Data

Wednesday, 24 January 2018 by Simple Software

Automatic Indexing Using Existing Data

The Autofill feature of SimpleIndex is an easy way to associate many index fields with one document without retyping data that already exists in another database. Autofill uses a database lookup to retrieve records that match a key value entered by the user. Blank index fields are then filled in automatically with the data from this lookup. The result is a document database with many different possible search fields, of which only one needed to be entered during scanning.

The key field may be typed by the user, or it may be read from the document automatically using barcode recognition or OCR. The lookup is performed either when the user changes this field or when the index values are saved. If the lookup finds multiple matching records, the user will be notified and the first set of values will be used by default.

When used with pre-index batches, key information can be read automatically from barcodes or OCR and matched to database records with a single click. Search on up to 99 index fields without a single keystroke!

KB Articles for Automatic Indexing

  • Exclude Index Field from Index Log
  • Turn Off Prompts and Pop Ups on Job Configurations
  • Change the Font Size of Index Fields
  • Large documents (>500 pg) Slow to Process - Workaround
  • Regular Expression (RegEx) - Syntax or Type
  • Index With Non-Latin Character Sets
  • Skip to Blank Index on Save Index
  • Stop Autorun When Double Clicking Configuration
  • Autonumber Increment Value
  • Overlap of SimpleView Viewer in SimpleIndex Display
1-Click Processing, Automatic Data Capture, Automatic Indexing Software, Barcode Recognition Software, Database, Database Autofill, Document Automation, File Indexing, File Indexing, Full Text Indexing, Keyword Indexing, Metadata, Microsoft Word Data Extraction, OCR, Office PDF Document Indexing, PDF Data Extraction Software, Scanned Document Indexing, Scanning Software
1-Click ProcessingAutomatic Data CaptureAutomatic Indexing SoftwareBarcode Recognition SoftwareDatabaseDatabase AutofillDocument AutomationFile IndexingFull Text IndexingKeyword IndexingMetadataMicrosoft Word Data ExtractionOCROffice PDF Document IndexingPDF Data Extraction SoftwareScanned Document IndexingScanning Software
Read more
No Comments

OMR Optical Mark Recognition

Tuesday, 23 January 2018 by Simple Software

Simple Checkbox Recognition

Some forms require scanning software to recognize the presence or absence of a mark in a particular location, such as a checkbox, without worrying about the specific shape or symbol drawn therein. The ability to do this is called Optical Mark Recognition, or OMR. Let’s take a look at how this feature can help you index your documents and how SimpleIndex improves upon the standard OMR process:

Optical Mark Recognition

Optical Mark Recognition lets you define check box regions on scanned images. OMR is very fast and can be used for a variety of applications:

  • Business reply mail
  • Simple surveys
  • Separate multi-page documents
  • Document routing control
  • Verify presence of signatures

To configure OMR, use an unfilled form to obtain baseline counts of how many black “pixels” are in the box. When processing, SimpleIndex compares the amount of black in each image to the baseline value to determine if the box is checked or not.

With OMR, it is very important that the check boxes appear in the same place on every scan, and that other text on the document does not move into your check box zone. For best results, use large boxes with plenty of white space around them.

We wouldn’t recommend using the SimpleIndex OMR feature to grade the SATs, but if your documents include a few check box values that you want to capture the SimpleIndex OMR feature is what you need. For more advanced OMR and forms processing solutions, please visit ScanStore.com.

OMR Document Separation

SimpleIndex includes a unique use for mark recognition that can save you thousands on document separator pages. One of the most labor-intensive parts of scanning multi-page files is detecting where one document ends and the next one starts. Traditionally this has been done with blank pages (which doesn’t work with 2-sided documents) or barcodes and patch codes (which must be printed). All of these solutions require someone to insert a piece of paper between each document before scanning, wasting time, money, and paper.

  • Using the OMR feature in SimpleIndex, create a checkmark field in the upper-left corner of the page.
  • Create an Autonumber field that increments a document number each time the checkmark is found.
  • Use this job file to scan and separate documents into multi-page files.
  • Create a 2nd job file to index the multi-page files.
  • Use the Post-Process feature to run the two jobs consecutively.

When prepping files, simply take a felt tip pen and put a small mark the upper-left corner on the first page of each new document. This can be done very quickly, creates no additional paper and has a negligible effect on scan quality.

KB Articles for Optical Mark Recognition

  • Autonumber Increment Value
  • How do you configure OMR fields for check box recognition?
Automatic Data Capture, Batch Scanning, OCR, OMR, Optical Mark Recognition, Watermark PDF Files
Automatic Data CaptureBatch ScanningOCROMROptical Mark RecognitionWatermark PDF Files
Read more
No Comments

Streamlined Interface

Tuesday, 23 January 2018 by Simple Software

Maximum Data, Minimum Clicks

As with any repetitive task, a few seconds saved scanning and filing a single document quickly adds up to dozens or hundreds of hours over the course of a long project or daily routine. The most import part of planning your document capture project is to find the most efficient way to file them correctly. Creating an efficient workflow will save you countless hours of labor over the life of your project.

SimpleIndex is faster and easier because it is designed to perform all of the steps necessary to scan or import documents, process, verify and export them in one continuous workflow rather than requiring the user to click extra buttons each time to initiate the next step. When taken to the extreme, SimpleIndex is capable of performing all of these tasks automatically with just a single mouse click.

SimpleIndex does this by saving all of the settings for a document capture workflow to a file that can be opened just like an Office document. This file is configured by the administrator so the user doesn’t have to see any of the technical details. Very rarely does the operator need to be able to change, for instance, the export file format and file naming scheme. So why do some applications show you a complicated export settings screen every time you try to save a batch? It is this attention to detail that allows SimpleIndex to process the same batch 35-75% faster than its competitors.

SimpleIndex also has the ability to pre-set index values and run jobs using the Command Line Interface. More on this design feature can be found on our Getting Started page.

Index Automation Features

The two main methods for automating indexing are Barcode Recognition and Optical Character Recognition (OCR).

Barcode recognition is faster and more accurate, but your documents must contain a barcode on the document or a cover page for this to work.

OCR is able to read printed data directly from the page, which means most documents can be processed as-is. However it is not 100% accurate and usually requires some human review. Handwriting can be recognized as well, using the Cloud OCR option.

If your index data already exists in another database, SimpleIndex has features that can make use of this data to automate processing. The Index Autofill feature matches data read from barcodes or OCR to data in your database, verifying the correct value is read and populating additional search fields automatically.

Paper and Electronic Documents

Traditional document capture is focused on digitizing paper documents with a document scanner. However, more and more documents are living their best lives as native PDF and Word files, never once having to enter our physical realm.

SimpleIndex is designed to handle both scanned physical documents and electronic files in their native format seamlessly. The OCR function will use existing text from any PDF file or Office document when it is available, or automatically OCR scanned images when it isn’t.

Use the built-in SimpleView viewer to view most common file types, or use the PDF editor and word processor of your choice to provide full editing capabilities embedded right within the SimpleIndex application.

It can also simultaneously scan and import documents from a hotfolder into a single batch. So if, for example, you receive both paper and email invoices, you can process your day’s work all at once with just one click!

Using Pre-Indexed Batches

The Pre-Index Batch feature of SimpleIndex is what enables 1-click scanning and indexing, as well as command line and unattended processing.

Pre-indexing lets you set fixed values for index fields and apply them to a whole batch. These can be combined with automatic values from barcode recognition, OCR and Autofill to create fully automated batch processes that can be launched from your custom application, a desktop shortcut, scheduled server task or even linked to the scan button on your scanner.

KB Articles for Streamlined Interface

  • Features
  • Take control of Sales Tax exemption forms
  • Reduce Click Charges for Data Capture
  • Instant Integration With Any Application
  • Indexing Solutions with Barcode Recognition
  • Automated Processing & 1-Click Interface
  • Full-Page OCR Indexing Demo
  • Video Demos
  • Network Scanners & Copiers
  • The All-In-One Scanning & Sorting Tool
Automatic Data Capture, Barcode Recognition Software, Batch Scanning, Command Line Interface, Database, Document Automation, Document Classification, Document Imaging, Fast Scanning, OCR, Office PDF Text Processing, RPA, Scanning Software, Solution, TWAIN & ISIS Scanning, Unattended, Workflow, Workflow Software
Automatic Data CaptureBarcode Recognition SoftwareBatch ScanningCommand Line InterfaceDatabaseDocument AutomationDocument ClassificationDocument ImagingFast ScanningOCROffice PDF Text ProcessingRPAScanning SoftwareSolutionTWAIN & ISIS ScanningUnattendedWorkflowWorkflow Software
Read more
No Comments

Scanning & Document Migration for SharePoint

Friday, 12 January 2018 by Simple Software

SimpleIndex gives you an affordable, automated way to populate custom metadata tags when migrating documents to SharePoint. SimpleIndex uses a variety of methods to extract data, including zone OCR, barcode recognition, mark recognition (OMR) and text pattern matching. The data is assigned to index fields that correspond to the custom columns in your SharePoint document library. The extracted values can be reviewed and corrected if necessary before uploading.

Uploading individual documents to SharePoint is easy. The hard part is migrating thousands of files and tagging them with custom metadata that can be used for fast, precise searching and sorting.

Without metadata, only the filename is used for searching scanned documents. OCR can be used to enable text searching of images, but there are several limitations to text searches:

  • OCR cannot recognize any handwritten data
  • Low quality scans or documents with complex layouts often OCR poorly
  • OCR errors mean that you won’t always find all the files you are searching for
  • Cannot perform date range or number range searches
  • Cannot sort results by anything other than title and create/modified date

Advantages of the SimpleIndex SharePoint migration solution include:

  • Complete document imaging solution as well as electronic document migration
  • Advanced text pattern matching finds precise data in unstructured text
  • Parse folder and file paths to find metadata values
  • Unattended server processing mode available
  • Easy yet powerful alternative to managed metadata services
  • Far more features for the price than other SharePoint migration tools

KB Articles for SharePoint Migration

  • How to connect to a SharePoint Online site that uses multi-factor authentication
  • Reset SharePoint Login Information
  • How to Fix SharePoint Login Issues
  • Troubleshooting SharePoint Permissions Issues
  • SharePoint Managed Metadata
  • Can SimpleIndex integrate with Microsoft SharePoint?
Automatic Data CaptureDocument Management SoftwareScanning SoftwareServer OCRSharePoint Scanning
Read more
No Comments

MS Office & PDF Text Parsing

Tuesday, 03 October 2017 by dwilder

Office Videos | PDF Video

The template and dictionary matching capabilities of SimpleIndex‘s OCR function can be used to extract index information from the text of existing MS Office and PDF files, or any file with an accompanying TXT file. SimpleIndex® will search the document for matches on unique patterns and value lists, then index the document with the matching data. Zone coordinates can be set to limit the search area to pre-defined regions on standard forms. The result is a fully automated indexing and renaming process for all your electronic documents!

Using existing text, SimpleIndex can index and rename hundreds of files each minute and achieve perfect accuracy. These files can then be quickly searched with SimpleIndex Retrieval, SharePoint and Google search engines, or uploaded into your company’s document/content management system or custom business applications.

Enhanced Text Parsing & PDF Support

PDF Form Read Write DataMS Office and PDF text parsing features are now included in the Basic version of SimpleIndex, making it much more affordable to enable automatic document sorting on the desktop. Additional Office and PDF features include:

  • Convert any MS Office, HTML, XML and image files to PDF before processing
  • Read and write password protected PDF file
  • Searchable PDF output (Image + Hidden Text)
  • Interactive template builder and tester
  • Easily select PDF or PDF/A output format
  • Native PDF viewer and auto-repair of problematic PDFs
  • Read data from PDF forms
  • Populate blank PDF forms with index data

Batch Convert Office Documents to PDF

If you have Microsoft Office or OpenOffice installed, you can use SimpleIndex to automatically convert MS Office documents to PDF files for archival. PDF files are better for archival than editable formats like Word and Excel. They can be annotated, encrypted, searched and viewed with free PDF readers.

There are many free applications that let you convert documents to PDF one at a time. SimpleIndex lets you convert thousands of files at once while it also extracts data from the text for indexing or data entry automation. This feature is ideal for migrating or archiving Office documents to SharePoint, document management systems and custom web applications.

Quickly Organize Any File on Your Computer

SimpleIndex lets you process any type of file on your computer. If an OLE-enabled viewer is installed, SimpleIndex will display the document on the screen. Other documents can be opened automatically in their default application when they are indexed. Quickly type index field data that can be used to reorganize the files into subfolders and structured filenames for browsing and searching on your network, or uploaded to your document/content management system or custom business application.

If the file has an accompanying text file (*.TXT) with the same name, the text in that file can be used for index field extraction, fully automating the process.

Viewing & Indexing MS Office Documents

SimpleCoversheet Barcode Indexing CoversheetsSimpleIndex features full support for viewing and editing MS Office documents (Word, PowerPoint and Excel) on computers with or without MS Office installed. The full application interface is displayed within the SimpleIndex viewer, letting users view the full content of the documents, edit them with all the features of MS Office and save the changes. Modify privileges can be denied using Windows file security or by the SimpleIndex administration wizard to keep out unauthorized changes.

If MS Office is not installed, SimpleIndex can open and display them in the built-in viewer in read-only mode.

KB Articles for MS Office & PDF Text Parsing

  • Change the Dictionary Separator Value
  • Regular Expression (RegEx) - Syntax or Type
  • Check and Repair All PDF Files
  • Keep Pages in Original Order when Bookmarking
  • Do Not Combine Pages to 1 Bookmark
  • Can I split a PDF based on bookmark values?
  • Is it possible to search for and retrieve documents with Windows desktop search?
  • Can SimpleIndex read bar codes from existing PDF files?
  • Is there a way to just use part of a bar code or OCR value? For example, extract "50" from the value "124450"
  • How do you configure OCR to read index information from MS Office or PDF documents?
Automatic Data Capture, File Indexing, Microsoft Word Data Extraction, MS Office, Office PDF Document Indexing, Office PDF Text Processing, Office to PDF, Paperless Office, PDF, PDF Archive Scanning Software, PDF Barcode Recognition, PDF Data Extraction Software, PDF Forms, Text Processing, Unattended Processing
Automatic Data CaptureFile IndexingMicrosoft Word Data ExtractionMS OfficeOffice PDF Document IndexingOffice PDF Text ProcessingOffice to PDFPaperless OfficePDFPDF Archive Scanning SoftwarePDF Barcode RecognitionPDF Data Extraction SoftwarePDF FormsText ProcessingUnattended Processing
Read more
No Comments

Search

Contact Us Today!

=

Search Knowledge Base

Recent KB Articles

  • Database Export Error
  • SimpleIndex Standard Workstation
  • SimpleIndex OCR Workstation
  • SimpleIndex Barcode Workstation
  • SimpleIndex Professional Workstation
  • SimpleIndex Barcode Server 1M
  • Simple Software Server Processing Add-on for SimpleIndex
  • SimpleIndex Barcode Recognition Add-on Workstation

Feature Cloud

Microsoft Word Data Extraction QR Code Invoice OCR SimpleCoversheet OCR Form Processing XML Remote Capture TWAIN File Indexing Optical Character Recognition XSLT Data Conversion Software PaperVision Clipboard OCR Document Numbering System Paperless Office Barcode Recognition Software Solution Zone OCR TWAIN Scanning Software TIFF Bar Codes PDF Compression Invoice Scanning Software Keyword Indexing Unattended SharePoint Migration Screen Scraping OCR Database Text Processing Bates Numbering Software Bar Code Printing SAGE Distributed Scanning Required Documents Auditing PDF Barcode Recognition Automatic Data Capture Mortgage Watermark PDF Files Search Business Process Automation Document Classification SimpleSend MS Access TWAIN & ISIS Scanning OMR

Online Support Options

Check our Wiki Help, Knowledge Base and Training Videos, or Contact Support if you still need Help

How to Buy

Solutions start at just $500! Buy SimpleIndex online or from an Authorized Dealer in your area.

Authorized Dealers

Authorized DealersSimpleIndex is a great addition to any system integrator's product line. Become an Authorized Dealer.

Get a Web Demo

Get a free online demo with a scanning specialist who can configure SimpleIndex on your computer remotely.
Sign up for a demo now!

Download a Trial

SimpleIndex Trial30-day trial downloads are available for all Simple Software applications.
Download Now!

SimpleIndex Applications

SimpleIndex Applications Packaged apps built with SimpleIndex.
SimpleInvoice for AP
Sales Tax Manager
Mortgage LoanStacker
MSDS and Patents
SimpleIndex

© 2022 Meta Enterprises, LLC | Knoxville, Tennessee | A Family Owned Company
© 2022 SimpleSoftware | Consulting Services in the Field of Software as a Service

TOP
Manage Cookie Consent
We use cookies to optimize our website and our service.
Functional cookies Always active
The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
Preferences
The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
Statistics
The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
Marketing
The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
Manage options Manage services Manage vendors Read more about these purposes
View preferences
{title} {title} {title}
});