When you scan a document that has text or numeric data on it, you are able to read and understand what is written in the scanned image. However, to a computer, the resulting image file is just as meaningless an assortment of pixels as a landscape photo. In order to transform this information into an editable format that you can search through, copy, and modify without retyping it manually, you will need the an Optical Character Recognition (OCR) software.
There is a wide variety of OCR software available. While they all share the ability to convert images of machine printed (not handwritten) text or numbers into an editable format, the various software often have different features, accuracy, prices, and language options.
You can find the various types of OCR software with a description of each below.
Users within a single department, working from home or who have a small business can simply scan their documents to a folder that is shared to everyone. In this “ad-hoc” scenario you only need some basic document scanning software to simplify and bring consistency to your filing system.
If you want to move to the next level, there are Desktop Document Management options that provide an all-in-one means for capture, storage, search and retrieval of documents. Additionally, they provide security, advanced capabilities and ease of use above that of the ad-hoc methods
Simplex versus Duplex scanning is a function of your scanner driver. SimpleIndex uses both TWAIN and ISIS drivers. ISIS drivers are faster for high-speed scanners and are preferred. To configure duplex on an ISIS scanner: 1 Select “Use ISIS Driver” from the Scan menu if it is not selected 2 If this is your first time using ISIS, click “Select a Scanner” to select the driver for your scanner 3 Click Display Scanner Settings to display the ISIS driver settings 4 Find the setting for Simplex/Duplex and set appropriately. Each scanner model has a different driver interface. Refer to your scanner’s documentation if you cannot find the duplex setting. To set the scanner settings using TWAIN: 1 Select “Use TWAIN Driver” from the Scan menu 2 If this is your first time scanning with TWAIN, click “Select a Scanner” and select your scanner 3 If “Display Scanner Settings” is not checked in the Scan menu, click it to select it 4 When
Yes it is possible to scan with Kofax Adrenaline based scanners using the Adrenaline TWAIN or ISIS data source.
To configure ISIS, you may need to select one of the general Kofax drivers such as “Any Fujitsu Scanner with Adrenaline” or “Kofax Arenaline Scanner”.
To configure the TWAIN data source:
1. Download the Kofax Adrenaline TWAIN Data Source from Kofax Adrenaline Patches
2. Configure a scan source in KSM (in your computer’s Control Panel) called TWAIN SOURCE
3. Select the Kofax Adrenaline TWAIN driver from the Select Scanner dialog in SimpleIndex
There should be little difference in performance due to using the TWAIN interface, and all the Kofax image processing controls will still be available.
Is there a way to just use part of a bar code or OCR value? For example, extract “50” from the value “124450”
To do this example, create a barcode field (Field 1 for example) and a 2nd field with type “Fixed”. In the template for the 2nd field, enter %FIELD1[5,2]% to get “50” from “124450”.
%FIELD1% would get the entire value for Field #1, the barcode field. By adding the [5,2] you tell SimpleIndex to start at the 5th character (5) and take 2 characters from the value (50).
I have a scanner with Virtual ReScan (VRS) that is not scanning properly. How do I solve this issue?
|Any time that you have a scanner with Virtual ReScan (VRS) you must pick Kofax VRS as the scanner instead of the scanner model itself. With VRS you assign the scanner to VRS and then anytime you pick Kofax VRS as the scanner in your scanning software the assigned scanner will be the scanner that is used.|
This is done through the TWAIN or ISIS settings for your scanner. To access these settings select “Display Scanner Settings” from the “Scan” file menu. Next click the “Run Job” button and before the scanning process starts your scanners TWAIN or ISIS settings dialog box will appear.
Every TWAIN or ISIS dialog is different, but any of them have a clear option for changing page size or auto detecting page size.
SimpleIndex is compatible with any device that has a TWAIN or ISIS driver. This includes virtually all makes and models of scanner, as well as many specialty scanners, digital cameras and other devices. In the few instances when a scanner has a proprietary driver, it can still be used with SimpleIndex by first scanning to a folder and setting SimpleIndex‘s input to “From Folder” on the Input tab. For many high-speed scanners (over 50 pages/minute), ISIS drivers provide improved throughput versus TWAIN. It is recommended you purchase the ISIS driver option with these scanners. ISIS drivers also let you save your scanner settings to a file that can be distributed with your SimpleIndex configuration. You can find more information on selecting the best scanner for your specific requirements on the ScanStore Scanners Guide, as well as a wide assortment of scanners available for purchase.
You can tell SimpleIndex what types of files it should process and which file types to ignore. This is done by clicking “Job Options” On the “Batch” tab you will find a field labeled “Input file types or mask”. These are the file types that SimpleIndex will input files from. The default types are: TIF,PDF,JPG,GIF,BMP,DOC,XLS,PPT,DOCX,XLSX,PPTX,VSD,DWG,AVI,MP3 To process all files, enter * SimpleIndex will ignore any file whose extension does not appear on the list. In SimpleIndex 6 or above you can enter file masks to filter input files. Some examples are: abc*.pdf (PDF files starting with “abc”) ab??ef.* (All files starting with “ab”, 2 characters and “ef”) It is possible to have some file types open automatically in their default application. This can be done by inserting a pipe “|” into the list. Any file types after the pipe will be opened in their default application. For example: TIF,PDF,JPG|WAV,M
This is done using the “Min File Size” setting on the Batch tab. This setting uses the image file size in bytes to determine which files are blank pages. For 200dpi, compressed, black & white TIFF images (the default format used by SimpleIndex) this is usually around 2000 bytes.
Depending on your scanner settings and how much black is in your images, blank pages could be significantly larger. If you are scanning small pages the margin for error will be less. You will want to do some trial and error testing to ensure the setting is right for your images.
First make sure that you scanner’s TWAIN driver (found on the included CD or manufacturer website) is installed and that the scanner shows up in the ‘Scanners and Cameras’ section of the ‘Control Panel’. On the Batch tab of Job Options change the Input Files From setting from Folder to Scanner. With this option set, the scanner automatically starts when you click the Run Job button to process a new batch. From the Scan menu you have the option to select your scanner, set compression settings, display or hide the scanner’s TWAIN settings interface or have SimpleIndex prompt the user when the feeder is empty. The Scan to Input Folder command in the Scan menu lets you separate the scanning from processing. Use this to scan a sample image to draw OCR zones. Images are scanned to numbered TIFF images in the designated Input Folder.
So you want to organize your documents using barcodes? That is an excellent idea! Not only will it improve the speed and accuracy of your document management workflow, but it is easier to set up than it sounds. The guide is written to give you real information instead of marketing, but you can follow the links links to read about the relevant features of SimpleIndex and other document management solutions on ScanStore. Other Useful Guides How to Scan Documents Scanners Guide OCR Guide ICR Guide Data Capture Guide Software Review Scanning Guides? Ain’t nobody got time for that! Read no further! Contact our experts and we’ll configure the whole scanning process for you remotely using the demo version of SimpleIndex! Why use barcodes in document scanning? There are many benefits to using barcodes while scanning your documents. Traditional scanning methods require you to scan your documents in pre-separated batches and then manually name and organize the resulting files. Barcod
The best way to see how the SimpleIndex processing workflow compares to other leading desktop scanning solutions is to see the same process performed side-by-side in each program. Below are videos we recorded of the same batch of documents being scanned and indexed in Kofax Express™, Kodak Capture Pro™, PaperVision™ Capture Express and Office Gemini DiamondVision™. In each one we configured the software to perform the same tasks: Scan a batch of 10 pages Capture a 7-digit account number using Zone OCR Correct any fields that fail to recognize Use a database lookup to populate additional index fields Export the batch to PDF files Using our standard benchmark batch* we recorded the following processing times: SimpleIndex: 0:45 Kodak Capture Pro: 1:50 Kofax Express: 2:20 PaperVision Capture Desktop: 3:00 DiamondVision: 3:20 As you will see in the videos below, SimpleIndex provides the most efficient scanning and indexing workflow of any major document capture application. SimpleIndex™ [vi