Automatic Indexing Using Existing Data
The Autofill feature of SimpleIndex is an easy way to associate many index fields with one document without retyping data that already exists in another database. Autofill uses a database lookup to retrieve records that match a key value entered by the user. Blank index fields are then filled in automatically with the data from this lookup. The result is a document database with many different possible search fields, of which only one needed to be entered during scanning.
The key field may be typed by the user, or it may be read from the document automatically using barcode recognition or OCR. The lookup is performed either when the user changes this field or when the index values are saved. If the lookup finds multiple matching records, the user will be notified and the first set of values will be used by default.
When used with pre-index batches, key information can be read automatically from barcodes or OCR and matched to database records with a single click. Search on up to 99 index fields without a single keystroke!
Use SQL Server, MS Access, Oracle, MySQL, Excel and other databases to match documents with existing data to organize files and assign metadata fields.
Using alternate database schemas
In SQL Server, Oracle and MySQL (among others) user logons are associated with a default “schema” that determines which tables and views they can see.
Users may have access to other tables and views that aren’t in their schema. When connecting to the database these tables will show up in the list, but when you try to get the list of fields you get an “Invalid object reference” error.
To reference these tables, use [Schema].[Table] for the table name to explicitly reference the table.
You can also change the user’s default schema on the server to the one matching the table.
A third option is to create a view under the user’s default schema that selects the necessary records.
At the present time you cannot enter [Schema].[Table] in the wizard, so if you need to use this workaround configure the job in the wizard and modify the table name in the old job options screen.
Access Database Connection String
If you would like to create a database string for an Access database that includes a password use this format:
Provider=Microsoft.Jet.OLEDB.4.0;Data Source=path\database.mdb;Jet OLEDB:Database Password=db_password;
How do you configure a field to select from a list of possible values?
List fields are populated using a text file or database field containing the values for that list. The text file scenario will be described first.
To configure a list field, go to the Index tab in the Job Options. Create a list field by selecting “List” for the field type and give it a name. In the “List File/Field” setting for this field, put the full path or UNC to the text file containing the list or click “Set” to browse to the file. If you have not created a file, you can put the path where you want the file stored here and click the “Edit” button. This will open the file in Notepad; new files will prompt you to create them.
On each line of the text file you will put one of the possible values that you want to be able to select from while you are indexing with SimpleIndex. You can copy this information from another source and paste it into the text file.
Save the file in Notepad and close it. The List field is now configured!
To use a database field, your configuration must be connected to a database using the settings on the Database tab. Any fields defined as “List” on the Index tab that have a corresponding field mapped on the Database tab will use the unique values from that field to populate the list.
You can check “Only allow values in list” to disallow users from entering a value not in the list.
Once the List index field is configured properly you can select the values from a drop down after scanning that appear in the main SimpleIndex window while indexing. It also makes the field autofill the closest match from the list based on the characters that you type in that field.
- Published in Indexing & UI
What is “Match & Attach” mode?
Match & Attach mode lets you batch update multiple records in a database using the index data from your SimpleIndex job. For example, if you have a large backfile of documents that you want to scan and link to records in an existing database, you can use Match & Attach to find the corresponding record and set the Image Path field to the newly scanned file.
For details on how to configure Match & Attach mode, please refer to the manual.
- Published in Database & Retrieval, Export
How do you configure the Autofill feature?
The Autofill feature of SimpleIndex is an easy way to associate many index fields with one document without retyping data that already exists in another application.
Autofill uses a database lookup to retrieve records that match a key value entered by the user. Blank index fields are then filled in automatically with the data from this lookup. The result is a document database with many different possible search fields, of which only one needed to be entered during scanning.
To configure the Autofill feature first go to the Index tab and define the field that will be used to perform the lookup. Each value in this field should correspond to a single record in the database, usually defined as the “Primary Key”. This field can be any type, including Barcode and OCR for full automation.
Next click “Configure Index Autofill” button. Here is where you will setup the database lookup. The “Match Data Source” is configured the same way as the “Data Source” on the Database tab. It can be the path to an MDB file, an ODBC data source or OleDB connection string. Click “Set” to browse to an Access MDB file or type the connection string.
Once the data source is set, the drop-downs for the Table and Field names will be populated. Select the table or query that contains the data, then select the field in the database that will be used for lookup. Under “SimpleIndex Key Field 1” select the field you defined in the first step.
“Key Field 2” gives you the option to use 2 field values to perform the lookup if a single unique matching value doesn’t exist.
Click OK and go back to the Index tab.
Finally set the field type to “Autofill” on every field you want to populate from the database. In the “Template” for all the Autofill fields put the name of the corresponding database field that will be used to populate it.
Now when a value is entered for the key field, the Autofill fields will be filled in automatically from the database!
Cascading List Mode
Cascading List Mode is a powerful new feature introduced in 8.2 that lets you filter each list field by the values you select. For example, you could have lists for Vendor Name and Invoice Number, and when you select a Vendor from the list, the Invoice Number list will only show invoices from the selected vendor.
- Published in Database & Retrieval, Indexing & UI
How do I connect to an SQL Server database?
On the Database tab, choose “SQL Server” from the drop-down labeled “Configure Data Source”. Click the “Start” button.
You will be prompted to enter the SQL Server name and database name. Once selected, an OLE DB data source will be entered under “Data Source”.
You can add a default user to the “Data Source” using the conventions for OLE DB provider strings. With this configuration, all users will use the same SQL Server login. To configure individual logins, check “Require Logon” on the Logging tab. If you would like the users to select their name from a list, add each user ID using the “Add User” button.
- Published in Database & Retrieval
How do I connect to an existing Access database?
To connect to an existing Access database go to Options then the Database tab. There is a drop-down list next to the “Start” button. Click the arrow and select “MS Access” then click “Start”.
An Open Database dialog will be displayed. Select the Access MDB you want to use then click “Open”.
Select the table or query you want to use from the “Table Name” list and click “Reload”.
The field selectors will now be populated with the list of fields from the table/query you selected. The Image Path stores the path to the image file for each record. Each entry in Database Fields list maps the corresponding Field # from the Index tab to the selected field in the database.
Click apply and your Access Database will be configured.
The Log Field, Log Table, Revision Field/Level and the Full Text and PageCount fields on the Advanced tab are option. See the manual for a description of what these items do.
- Published in Database & Retrieval
Indexing from Applications with Screen OCR
Some documents are difficult or impossible to automate with OCR. For example, documents with non-standard layouts, unconstrained handwriting or very poor scan quality. In applications like invoice processing, fully automating the data entry can require expensive software and weeks of consulting. Even after all that expense, many users miss the interface and data validations that their accounting software entry screens provide.
In cases like this, SimpleIndex can help improve data entry efficiency while archiving your scanned originals at the same time. Here’s how it works:
- Scan a batch of documents for data entry
- Place the SimpleIndex window side-by-side with your data entry window
- Enter the data normally, reading from the scanned image in SimpleIndex
- Press the hotkey combo to transfer the data to SimpleIndex
- Save the image and repeat with the next one
In this configuration, SimpleIndex captures an image of the data entry window, then uses OCR to read the data and index the image. Since the data entry screen has a consistent layout and clear, readable fonts, it can be reliably recognized with OCR.
There are several advantages to this approach:
- Configuration and training takes hours not weeks
- Scanned images are indexed with no extra work
- All the advantages of digital docs–security, searching, sharing, etc.
- Use all the data validation features of your software
- No flipping through paper documents
- Operator keeps eyes on the screen and hands on the keyboard
- Data entry can be done remotely
- Data entry performance improves and files are archived at the same time
SimpleInvoice Invoice Processing Solution
SimpleInvoice is a preconfigured solution that uses the OCR and dictionary matching functionality of the SimpleIndex scanning and indexing software to automatically scan, name, and organize incoming invoices into your chosen folder structure of searchable PDF files.
SimpleInvoice requires minimal configuration to get started. It comes with everything you need to index most common invoice styles. The customer and vendor lists, as well as your particular Purchase Order and Invoice number styles, can be customized for your company.
Use SimpleInvoice to:
- Automatically receive and enter invoices in your accounting software, especially QuickBooks
- Create full-text searchable invoice files
- Create an organized filing system for archiving invoices
- Quickly find specific invoices based on vendor, date, invoice number and other index fields
Please Contact Us to find out more about SimpleInvoice!
Template Autofill
This feature added in SimpleIndex 8.1 allows you to spell out the specific OCR pattern of a vendor’s invoice number as a column in your Vendor database. When processing invoices only the template specific to that vendor is loaded and all other templates are ignored. This greatly improves the accuracy of the invoice number capture, since each vendor uses a different label and numbering scheme for their invoices.
FAQ Related to Invoice Processing
- Regular Expression (RegEx) - Syntax or Type
- Will your SimpleQB allow me to scan in old invoices or bank statements directly into QuickBooks?
- Is there a way to just use part of a bar code or OCR value? For example, extract "50" from the value "124450"
- Can SimpleQB be used to scan in receipts and invoices which are then matched to the files kept in the QuickBooks System?
- How do you train the OCR engine for better accuracy?
Compare Leading Solutions
SimpleIndex™
Kodak Capture Pro™
Kofax Express™
PaperVision™ Capture Desktop
Note: This video depicts PaperVision Capture Desktop, a now discontinued software that has since been replaced by the similarly functioning updated version of PaperFlow.
Office Gemini DiamondVision™
Testing Methods
The benchmark times were recorded using all available software shortcuts, and by performing data entry and user interactions as fast as possible. The same scanner and computer hardware was used for each test. Much care was taken to ensure that each application yielded the most accurate OCR results possible given the sample documents. Unfortunately none our competitors could accurately capture the account number on all 10 pages. The extra time to correct these errors accounts for 15-30% of the difference in processing times. The difference in accuracy is due in large part to SimpleIndex‘s pattern matching OCR feature, which the other programs lack.
Database Integration
Unprecedented Control of Database Interactions
Open database integration is a powerful feature of SimpleIndex® and one that furthers its interoperability with custom programs. Instead of using a proprietary database, SimpleIndex allows you to map its index fields to cells in any database table. It can be configured to create new records, update existing ones or retrieve them for viewing. Using these three basic database functions, SimpleIndex is able to interface with and operate on any database.
Automatic Indexing Using Existing Data
The Autofill feature of SimpleIndex is an easy way to associate many index fields with one document without retyping data that already exists in another database. Autofill uses a database lookup to retrieve records that match a key value entered by the user. Blank index fields are then filled in automatically with the data from this lookup. The result is a document database with many different possible search fields, of which only one needed to be entered during scanning.
The key field may be typed by the user, or it may be read from the document automatically using barcode recognition or OCR. The lookup is performed either when the user changes this field or when the index values are saved. If the lookup finds multiple matching records, the user will be notified and the first set of values will be used by default.
When used with pre-index batches, key information can be read automatically from barcodes or OCR and matched to database records with a single click. Search on up to 99 index fields without a single keystroke!
Output to XML or Any Text Format
SimpleIndex 8 offers export to XML files, as well as the SimpleExport module. SimpleExport makes it possible to integrate with any third party database, application, or document management systems (DMS) that accepts an XML or text file import.
SimpleExport converts the index values from a SimpleIndex job to any output format through XSL Transformations (XSLT). You may use one of the built-in scripts to export to a pre-defined system or design your own. SimpleExport runs automatically after each batch is exported, or it can run as an unattended Windows service.
SimpleExport can also be used as a standalone application for automating XSLT transformations, converting CSV or TSV (delimited text files) to XML, converting images to base64 or integrating other scanning applications with third party software.
Match and Attach Mode
SimpleIndex features “Match and Attach” mode, which combines the power of the Autofill feature with the ability to update existing records in your database. This lets you “attach” images to existing database records by “matching” them with values from SimpleIndex.
With other systems you must create a temporary database and write a custom import to match the scanned images to existing records. When indexing with OCR or barcode recognition, the entire process can be accomplished with our 1-click interface.
KB Articles for Database Integration
- Oracle database is slow to respond
- What is Document Imaging?
- Using alternate database schemas
- Multiple Sort Fields on Search
- Access Database Connection String
- How do I delete an image and it's database entry?
- Is it possible to search for and retrieve documents with Windows desktop search?
- Will your SimpleQB allow me to scan in old invoices or bank statements directly into QuickBooks?
- How do I use the Media Wizard to create searchable DVDs or thumb drives?
- How do I export index data to a database?