SimpleIndex PDF archiving capabilities include high-speed scanning, full-text OCR, intelligent data capture with pattern matching and database validation, automatic organization of PDF files into standardized folder and filename conventions and export of documents and data to CSV, XML or any SQL database.
You can set SimpleIndex to assume that it needs to check every PDF file and fix it.
Go to this location in the Windows Registry:
Create a New String Value called “FixAllPDF” and set the value to 1
If you want to keep all the pages in the same order that they were imported, even though they all go with different bookmarks then do the following.
1. Open the configuration in Notepad.
2. Search for <BOOKMARK_PAGE_ORDER>
3. Change this line from “false” to “true”: <BOOKMARK_PAGE_ORDER>true</BOOKMARK_PAGE_ORDER>
4. Save and close.
Windows Search works great with SimpleIndex because all index data can be saved to the folder and file names as well as the file properties, and OCR text can be saved to hidden layers in PDF files. Windows Search will read all of these elements when building its index and will return any matching files when you search. Using Windows Search on a file server allows for instantaneous searching across terabytes of documents and text for all of the users on your network. IFilters allow Windows Search to search within file contents. Here are three popular PDF IFilters that will enable text searching for PDF files: Foxit PDF IFilter (commercial) TET PDF IFilter (free/commercial) Adobe PDF IFilter (32-bit / 64-bit) (free) If you have issues with PDF text searching in Windows 10, this article has detailed instructions for resolving PDF IFilter issues: https://fixedit.itxpress.biz/2018/07/05/searching-pdfs-in-windows-10/
Simplex versus Duplex scanning is a function of your scanner driver. SimpleIndex uses both TWAIN and ISIS drivers. ISIS drivers are faster for high-speed scanners and are preferred. To configure duplex on an ISIS scanner: 1 Select “Use ISIS Driver” from the Scan menu if it is not selected 2 If this is your first time using ISIS, click “Select a Scanner” to select the driver for your scanner 3 Click Display Scanner Settings to display the ISIS driver settings 4 Find the setting for Simplex/Duplex and set appropriately. Each scanner model has a different driver interface. Refer to your scanner’s documentation if you cannot find the duplex setting. To set the scanner settings using TWAIN: 1 Select “Use TWAIN Driver” from the Scan menu 2 If this is your first time scanning with TWAIN, click “Select a Scanner” and select your scanner 3 If “Display Scanner Settings” is not checked in the Scan menu, click it to select it 4 When
Is the document Search/Retrieval and View functions available in SimpleIndex or available only with the SimpleSearch add-on module?
All of the search functions can be used with any SimpleIndex license. SimpleSearch is only needed to enable searching from other workstations.
This is done through the TWAIN or ISIS settings for your scanner. To access these settings select “Display Scanner Settings” from the “Scan” file menu. Next click the “Run Job” button and before the scanning process starts your scanners TWAIN or ISIS settings dialog box will appear.
Every TWAIN or ISIS dialog is different, but any of them have a clear option for changing page size or auto detecting page size.
SimpleIndex is compatible with any device that has a TWAIN or ISIS driver. This includes virtually all makes and models of scanner, as well as many specialty scanners, digital cameras and other devices. In the few instances when a scanner has a proprietary driver, it can still be used with SimpleIndex by first scanning to a folder and setting SimpleIndex‘s input to “From Folder” on the Input tab. For many high-speed scanners (over 50 pages/minute), ISIS drivers provide improved throughput versus TWAIN. It is recommended you purchase the ISIS driver option with these scanners. ISIS drivers also let you save your scanner settings to a file that can be distributed with your SimpleIndex configuration. You can find more information on selecting the best scanner for your specific requirements on the ScanStore Scanners Guide, as well as a wide assortment of scanners available for purchase.
I have a scanner/copier that creates PDF and TIFF files and saves them to my file server. Can I use SimpleIndex to create a searchable CD/DVD from these files?
This feature is included in SimpleIndex at no additional cost and is called the Media Wizard.
The Media Wizard is located in the “Send” file menu and is called Media Wizard. It allows you to burn your images, indexes, a database and a free SimpleSearch viewer for just the CD or DVD. It also provides an easy way to get the maximum amount of information on the media that you want it on.
You set up the Media Wizard by pointing it to your image folder and database and you select the media that you would like to put it on. It then saves a file folder with all the files that you would need in the size of the media you are using in the location that you designate. You then burn these files using the burning application of your choice.
On the Database tab there dropdown in the lower portion of the panel for Full Text OCR Field. Put the name of the field that will store the full-text data there. This must be configured both for Insert and Retrieval mode configurations. The database field needs to be sufficient length to store the entire text of your document. Of course, the Insert Mode configuration must have “Enable Full Page OCR” checked to generate full text data from images. Text from MS Office documents, PDF files and existing OCR text files can be used without setting this option. When designing your Retrieval Mode configuration, create a Text field to use for full text search queries. On the Database tab, set the corresponding “Database Field Name” to the full text database field. When searching on your full text field, SimpleIndex finds the text you enter no matter where it appears in the document. It is able to match partial words. It does not perform boolean or natural language search
MS Office and PDF files generated by software or PDF printer drivers already have the text you need to recognize in the file. Scanned documents need to use OCR to read text from an image of the page. With Office and PDF files, SimpleIndex can just read the text, which is much faster and accurate than image OCR. To recognize index fields from the document text, first create OCR fields on the Index tab as you would normally. Next, on the Zones & OCR options tab, check the “Use Full Page OCR for this Field” option for each OCR field. This tells SimpleIndex to process the existing file text. If the index value is a unique pattern of digits or list of possible values, use Template or Dictionary matching to locate the value within the text. Please see the manual for details on Template and Dictionary matching. If the value appears in a specific location in each file, coordinates can be used to locate it. When processing text, the X, Y, Width and Height settings correspond to
First make sure that you scanner’s TWAIN driver (found on the included CD or manufacturer website) is installed and that the scanner shows up in the ‘Scanners and Cameras’ section of the ‘Control Panel’. On the Batch tab of Job Options change the Input Files From setting from Folder to Scanner. With this option set, the scanner automatically starts when you click the Run Job button to process a new batch. From the Scan menu you have the option to select your scanner, set compression settings, display or hide the scanner’s TWAIN settings interface or have SimpleIndex prompt the user when the feeder is empty. The Scan to Input Folder command in the Scan menu lets you separate the scanning from processing. Use this to scan a sample image to draw OCR zones. Images are scanned to numbered TIFF images in the designated Input Folder.
SimpleIndex® is a great solution for small businesses and departments that need a quick and affordable way to scan, organize and view documents. SimpleIndex provides a wide variety of retrieval options, many of which require no special software to find and view documents. The most affordable solution uses Windows folders and filenames to organize and find documents on a shared network drive. SimpleIndex lets you use index field values to create folders and filenames automatically, automating the process with barcodes, OCR and database lookups where possible. Other applications force the user to create folders and name files manually, making ad-hoc document management too labor-intensive to be practical. You may also use SimpleIndex with SimpleSearch to create a keyword-searchable database that lets you find and view documents based on one or more index values. There are several advantages to using SimpleSearch instead of Windows folders: Find documents based on specific keywords or phr
So you want to organize your documents using barcodes? That is an excellent idea! Not only will it improve the speed and accuracy of your document management workflow, but it is easier to set up than it sounds. The guide is written to give you real information instead of marketing, but you can follow the links links to read about the relevant features of SimpleIndex and other document management solutions on ScanStore. Other Useful Guides How to Scan Documents Scanners Guide OCR Guide ICR Guide Data Capture Guide Software Review Scanning Guides? Ain’t nobody got time for that! Read no further! Contact our experts and we’ll configure the whole scanning process for you remotely using the demo version of SimpleIndex! Why use barcodes in document scanning? There are many benefits to using barcodes while scanning your documents. Traditional scanning methods require you to scan your documents in pre-separated batches and then manually name and organize the resulting files. Barcod