Adobe Acrobat Portable Document Format processing.
Can I split a PDF based on bookmark values?
SimpleIndex can create PDF files with bookmarks based on the index data captured in your batch.
Going the other way–splitting an existing PDF file based on the bookmark value–is not a built-in feature of SimpleIndex. However there are inexpensive command line utilities that you can integrate with SimpleIndex in order to accomplish this.
For example, the CoolUtils PDFSplitter and A-PDF Split both offer this function starting around $35.
The command line to split the PDF can be integrated into the Pre-Process setting in SimpleIndex, found under the Advanced Settings section of the Configuration Wizard. An example pre-process using PDFSplitter to split based on the second level bookmark values would be:
PDFSplitter.exe “c:\Images\BookmarkFile.pdf” “%CONFIGFILEFOLDER%\Input” -em bookmarks -b 2
What is the point of SimpleQC?
SimpleQC is now SimpleView with many enhancements. In a nutshell it is designed to let you quickly browse folders containing multi-page TIFF or PDF documents. The two main uses for this are:
1 Review scanned documents for Quality Control
Occasionally a scanned document will be too light or too dark to be read. This can happen quite often with some types of paper. Use SimpleView to find these pages quickly and rescan them. You can also correct page order, rotation, skew, etc.
2 Use as a document viewer
SimpleIndex and other scanning applications create folders and files on your hard drive or network to store documents. Use SimpleView to quickly browse image thumbnails by folder and filename. Auto-rotate, enhance and OCR images as needed.
SimpleView is different from other thumbnail viewers because:
-It loads multi-page TIFF files very quickly
-It displays thumbnails for files as well as pages within multi-page files on the same screen
-It has many functions for document QC such as auto-selecting even and odd pages or files for rotation, rescan pages
-It displays thumbnails for PDF files and displays them in the Acrobat viewer
-With Acrobat Standard or Pro you can enable editing & signing of PDF files
-Viewing of office documents and electronic formats are also available
- Published in SimpleView
Is it possible to search for and retrieve documents with Windows desktop search?
Windows Search works great with SimpleIndex because all index data can be saved to the folder and file names as well as the file properties, and OCR text can be saved to hidden layers in PDF files. Windows Search will read all of these elements when building its index and will return any matching files when you search.
Using Windows Search on a file server allows for instantaneous searching across terabytes of documents and text for all of the users on your network.
IFilters allow Windows Search to search within file contents.
Here are three popular PDF IFilters that will enable text searching for PDF files:
- Foxit PDF IFilter (commercial)
- TET PDF IFilter (free/commercial)
- Adobe PDF IFilter (32-bit / 64-bit) (free)
If you have issues with PDF text searching in Windows 10, this article has detailed instructions for resolving PDF IFilter issues:
https://fixedit.itxpress.biz/2018/07/05/searching-pdfs-in-windows-10/
- Published in Database & Retrieval, Export, Office PDF Text Processing
Will your SimpleQB allow me to scan in old invoices or bank statements directly into QuickBooks?
SimpleQB enables you to scan and link documents to QuickBooks transactions through a custom menu. When you have an invoice, purchase order, work order, bank reconciliation or other transaction open, simply click the Scan menu to scan a document and save it with that transaction.
To view the documents for a transaction, just click the View menu.
You can also use SimpleSearch to find and view documents scanned with SimpleQB without the QuickBooks interface.
- Published in Database & Retrieval, SimpleQB
Can SimpleIndex create searchable PDF Image+Text files with hidden text?
Yes, it can. You can configure this setting in the Job Settings Wizard by going to the OCR step and checking “Enable full-page OCR”. There are many settings in the OCR step that you can used to customize the output and recognition of images.
SimpleIndex has two different OCR engines (Standard and Professional) that can be used to produced PDF Image + Text files or Searchable PDFs.
Related Links
- Published in Export, OCR, Office PDF Text Processing