Processing and text extraction of Microsoft Office, Adobe PDF files, HTML files and other electronic documents.
Do Not Combine Pages to 1 Bookmark
Monday, 29 July 2019
Please refer to the Wiki Documentation for the complete PDF Bookmarking reference. If you want to keep pages in bookmarks separate instead of combining them into a single bookmark when the same bookmark value is found in several interspersed images in the batch do the following: 1. Open the Job Configuration file in Notepad.2. Search for this value: <BOOKMARK_PDF1>3. Enter
Can I split a PDF based on bookmark values?
Friday, 12 July 2019
Please refer to the Wiki Documentation for the PDF Bookmarking reference. SimpleIndex can create PDF files with bookmarks based on the index data captured in your batch. Going the other way–splitting an existing PDF file based on the bookmark value–is not a built-in feature of SimpleIndex. However there are inexpensive command line utilities that you can integrate with
Is it possible to search for and retrieve documents with Windows desktop search?
Wednesday, 28 February 2018
Please refer to the Wiki Documentation for the complete Searchable PDF reference. Windows Search works great with SimpleIndex because all index data can be saved to the folder and file names as well as the file properties, and OCR text can be saved to hidden layers in PDF files. Windows Search will read all of these elements
- Published in Database & Retrieval, Export, Office PDF Text Processing
Can SimpleIndex create searchable PDF Image+Text files with hidden text?
Wednesday, 28 February 2018
Yes, it can. You can configure this setting in the Job Settings Wizard by going to the OCR step and checking “Enable full-page OCR”. There are many settings in the OCR step that you can used to customize the output and recognition of images. SimpleIndex has two different OCR engines (Standard and Professional) that can
- Published in Export, OCR, Office PDF Text Processing

