SimpleIndex lets you create searchable PDF documents from scanned images using OCR to convert the pages to text and overlay it on the original scan. This creates a unique scanned document that’s fully searchable and lets you highlight and copy text, while preserving the original page formatting for readability.
Unlike other basic OCR applications, SimpleIndex also lets you automatically tag and organize documents using keywords identified with pattern matching, database lookups or bar codes. Structured data can be used to populate a database, document management system, SharePoint and other repositories.
Full-Page OCR Indexing Demo
This sample job demonstrates the ability for SimpleIndex to convert scanned documents to searchable PDF files and extract index data from the OCR text. It also demonstrates the multi-user workflow capabilities.
Step 1 uses a full-page OCR process on each image.
Field data is extracted from the full-page text using template and dictionary matching algorithms.
This is done in Pre-Index mode to allow unattended processing.
Data is saved to a database so it can be reviewed and corrected in Step 2.
Step 2 uses Database Update mode to find images with missing index values and allow the user to manually enter the correct data.
Step 3 uses a SimpleSearch configuration to search and view the indexed images, including full text searches.
Find Out More
- Download or get an Online Demo
- Dynamic OCR Features in SimpleIndex
- Full-Page OCR Wiki Pages
- OCR Features and Settings Wiki Pages
- OCR Software Guide on SimpleOCR
Learn More:
PDF Text Processing Demo
This sample job demonstrates the PDF text processing capabilities of SimpleIndex by extracting the Document Number, Date, Document Type, Customer and Total from a number of documents without OCR, by processing the text layer of PDF files.
Computer-generated PDF files, such as those created using PDF printer drivers, already contain digitized text. SimpleIndex reads the text and performs Template and Dictionary Matching to locate and extract the correct data values from the text.
Since the existing text is being used, OCR is not performed. This makes processing much faster and 100% accurate, especially compared to solutions using zone OCR.
While this demo runs interactively, text processing jobs can run in unattended mode since the data does not need to be verified.
Full-Page OCR can also be used to get text from scanned PDF files with no existing text. SimpleIndex will also detect when a PDF file has existing text and only perform OCR on the documents that need it to improve performance.
Find Out More
- Download or get an Online Demo
- PDF Text Processing Features in SimpleIndex
- PDF Features and Settings Wiki Pages
- Full-Page OCR Wiki Pages
- OCR Features and Settings Wiki Pages
- OCR Software Guide on SimpleOCR
Learn More:
FAQ Related to PDF Text Processing
- I have a duplex scanner. How to set up SimpleIndex to scan two sided documents automatically?
- Features
- Patent ID and Title Extraction
- Take control of Sales Tax exemption forms
- Instant Integration With Any Application
- Affordable Document Management
- Indexing Solutions with Barcode Recognition
- Document Classification
Searching and Viewing Documents
If you have not yet decided on a plan for how to organize your electronic documents for later retrieval, you should take some time to consider the possible options.
There are several ways to search and view documents processed with SimpleIndex®
- Use SimpleSearch to use keyword searches to find and view indexed documents
- Use SimpleView to browse folders, search files, view, edit and annotate scanned documents without a database
- Use a Document Management System for integrated searching, viewing, workflow, security, compliance auditing and other records management functions
- Use Cloud Storage platforms like Google Drive, Box and OneDrive
- Use SharePoint to share documents online with custom metadata, create custom document workflows and employ records management standards
- Link files to a custom application using the Command Line Interface or RPA bot
- Integrate with your Database to associate documents with records via link or binary field
- Work with our Professional Services Team or an Authorized Dealer to create a customized solution or direct integration with virtually any application
Choosing a Document Management Solution
Given the variety of free or very low cost file storage solutions available, why would you invest thousands of dollars in a document management system?
- When high security or access tracking logs are required
- Compliance with regulations like HIPAA, Sarbanes-Oxley, FINRA, FOIA, SEC, etc.
- There are document-based workflows that can benefit from automation
- Users need to view files without installing software licenses (like DWG, VSD or PSD)
If you already have a database or business app that you use to search for records, and that application has the ability to store or link external documents to those records, this is usually the best choice.
If your business application doesn’t have document management capabilities, there are integrations that can overlay a button or hotkey that lets users open associated files from any screen.
If your business has many different types of documents spread across multiple departments that use different applications, and they sometimes need to be able to access each others’ documents, then a central repository is the better solution.
Cloud Storage platforms like Google Drive, Box and OneDrive provide low-cost, secure online storage that makes it very easy to share documents worldwide on any device. However they don’t have the ability to do field-level indexing to allow for more granular searches, lack the compliance tracking features of a more robust DMS, and don’t have integrated viewers that can display some of the less common file formats without having the application installed.
Find Out More
Read more about Affordable Document Management solutions with SimpleIndex.
Check out our How to Scan Documents for a detailed guide to creating scanning and retrieval systems.
Our Professional Services department can help you with every step of our project, and often have you up and running in just a couple of hours.
Please Contact Us to schedule a demo or ask us any questions you have!
Learn More:
KB Articles for Document Management
- Oracle database is slow to respond
- What is Document Imaging?
- Using alternate database schemas
- Multiple Sort Fields on Search
- Access Database Connection String
- How do I delete an image and it's database entry?
- Is it possible to search for and retrieve documents with Windows desktop search?
- Will your SimpleQB allow me to scan in old invoices or bank statements directly into QuickBooks?
- How do I use the Media Wizard to create searchable DVDs or thumb drives?
- How do I export index data to a database?
- 1
- 2