SIGN IN YOUR ACCOUNT TO HAVE ACCESS TO DIFFERENT FEATURES

CREATE AN ACCOUNT FORGOT YOUR PASSWORD?

FORGOT YOUR DETAILS?

AAH, WAIT, I REMEMBER NOW!

CREATE ACCOUNT

ALREADY HAVE AN ACCOUNT?
QUESTIONS? CALL: 865-637-8986
  • SIGN UP
  • LOGIN

SimpleIndex - Document Scanning and OCR Recognition Software

SimpleIndex - Document Scanning and OCR Recognition Software

T (865) 637-8986
Email: info@simpleindex.com

SimpleIndex by SimpleSoftware
500 W Summit Hill Dr SW # 302, Knoxville, TN 37902

  • LEARN MORE
    • GENERAL INFO
      • Getting Started
      • How To Scan Documents
      • Barcode Scanning Guide
      • Searching & Viewing
      • Schedule a Web Demo
      • News & Updates
    • FEATURES
      • Streamlined Interface
      • TWAIN and ISIS Scanning
      • Dynamic OCR
      • Database Integration
      • Required Documents Check
      • Integrated & Unattended Processing
      • SharePoint Document Scanning
    • –
      • Document Classification
      • PDF & MS Office Text Parsing
      • Barcode Recognition
      • Optical Mark Recognition
      • Match Documents to Existing Data
      • Imprinting & Watermarking
      • Screenshot OCR
  • SOLUTIONS
    • General
      • All-In-One Scanning & Sorting Tool
      • Affordable Document Management
      • Instant Integration
      • Network Scanners & Copiers
      • Remote Document Capture
      • Reduce Click Charges for Data Capture
    • Specific
      • Mortgage & Loan Documents
      • Material Safety Data Sheets (MSDS)
      • Patent ID and Title Extraction
      • Federal Tax Returns
      • Invoice Processing
  • SUITE
    • SimpleCoversheet – Print Bar Codes
    • SimpleExport – Data File Converter
    • SimpleView – Search, View & Edit
    • SimpleQB – QuickBooks Integrator
    • SimpleOCR – Freeware OCR
    • Buy Suite Apps
    • Buy Suite Bundles
  • ORDER NOW
    • COMPARE VERSIONS
      • Versions & Feature
      • Price List (PDF)
    • SIMPLEINDEX WORKSTATION
      • Machine License
      • Concurrent User
      • Subscription License
    • SIMPLEINDEX SERVER
    • SUITE APPLICATIONS
    • SUITE BUNDLES
    • SUPPORT & MAINTENANCE
      • Annual Maintenance Renewals
    • FIND A DEALER
      • Dealer Locator
      • Become a Dealer
  • DEMOS
    • TRIAL DOWNLOADS
    • SCHEDULE A DEMO
    • COMPARE SOLUTIONS
    • VIDEO DEMOS
      • Zone OCR with Template Matching
      • Invoice Processing with Full Page OCR
      • PDF Invoice OCR Demo
      • Sort and Index MS Office Documents
    • SIMPLEINDEX UNIVERSITY
      • SimpleIndex University – 100 Series
      • SimpleIndex University – 200 Series
      • SimpleIndex University – 300 Series
  • CONTACT
    • Contact Us
    • Support
    • FAQ
    • Privacy Policy
  • My Account
    • Downloads
  • MY CART
    No products in cart.
HOME > Searchable PDF OCR

Searchable PDF OCR with SimpleIndexSimpleIndex lets you create searchable PDF documents from scanned images using OCR to convert the pages to text and overlay it on the original scan. This creates a unique scanned document that’s fully searchable and lets you highlight and copy text, while preserving the original page formatting for readability.

Unlike other basic OCR applications, SimpleIndex also lets you automatically tag and organize documents using keywords identified with pattern matching, database lookups or bar codes. Structured data can be used to populate a database, document management system, SharePoint and other repositories.

Download document scanning and OCR software.

Regular Expression (RegEx) – Syntax or Type

Monday, 29 July 2019 by Simple Software

The Syntax or Type of Regular Expression/RegEx that SimpleIndex uses is .NET

Barcode OCRClipboard OCRInvoice OCROCROCR Form ProcessingOCR ScanningScreen Scraping OCRScreenshot OCRSearchable PDF OCRZone OCR
Read more
No Comments

Check and Repair All PDF Files

Monday, 29 July 2019 by Simple Software

You can set SimpleIndex to assume that it needs to check every PDF file and fix it.

Go to this location in the Windows Registry:

Computer\HKEY_LOCAL_MACHINE\SOFTWARE\WOW6432Node\SimpleIndex\Misc

Create a New String Value called “FixAllPDF” and set the value to 1

Office PDF Document IndexingOffice PDF Text ProcessingOffice to PDFPDFPDF Archive Scanning SoftwarePDF Barcode RecognitionPDF Data Extraction SoftwarePDF FormsSearchable PDF OCR
Read more
No Comments

Keep Pages in Original Order when Bookmarking

Monday, 29 July 2019 by Simple Software

If you want to keep all the pages in the same order that they were imported, even though they all go with different bookmarks then do the following.

1.  Open the configuration in Notepad.
2.  Search for <BOOKMARK_PAGE_ORDER>
3.  Change this line from “false” to “true”:  <BOOKMARK_PAGE_ORDER>true</BOOKMARK_PAGE_ORDER>
4.  Save and close.

BookmarkOffice PDF Document IndexingOffice PDF Text ProcessingOffice to PDFPDFPDF Archive Scanning SoftwarePDF Barcode RecognitionPDF Data Extraction SoftwarePDF FormsSearchable PDF OCR
Read more
No Comments

Is it possible to search for and retrieve documents with Windows desktop search?

Wednesday, 28 February 2018 by dwilder

Windows Search works great with SimpleIndex because all index data can be saved to the folder and file names as well as the file properties, and OCR text can be saved to hidden layers in PDF files. Windows Search will read all of these elements when building its index and will return any matching files when you search. Using Windows Search on a file server allows for instantaneous searching across terabytes of documents and text for all of the users on your network. IFilters allow Windows Search to search within file contents. Here are three popular PDF IFilters that will enable text searching for PDF files: Foxit PDF IFilter (commercial) TET PDF IFilter (free/commercial) Adobe PDF IFilter (32-bit / 64-bit) (free) If you have issues with PDF text searching in Windows 10, this article has detailed instructions for resolving PDF IFilter issues: https://fixedit.itxpress.biz/2018/07/05/searching-pdfs-in-windows-10/

ArchiveContentverseDatabase & RetrievalDocument Management SoftwareIndexingMicrosoft Word Data ExtractionOffice PDF Document IndexingOffice PDF Text ProcessingPaperless OfficePaperVisionPDF Archive Scanning SoftwareQuickBooks Document ManagementRecords ManagementSearchSearchable PDF OCRText Processing
Read more
  • Published in Database & Retrieval, Export, Office PDF Text Processing
No Comments

I’m using full page OCR. The information is all appearing in the txt file but it is losing format about half way through. Data to the right is ending up at the end of the txt doc. Can this be fixed?

Wednesday, 28 February 2018 by dwilder

SimpleIndex version 7 solves this problem with the incorporation of the FineReader OCR engine. Full text in PDFs will now flow with the formatting of the PDF.

Legacy Versions: SimpleIndex can also be used with other OCR applications and servers to improve accuracy, formatting and performance. Use the OCR applications to convert the scanned images to text or searchable PDF, and SimpleIndex can extract index values from the text and automatically sort and organize the files.

Full Text IndexingOCROCR Form ProcessingOCR ScanningOffice PDF Text ProcessingPDF Data Extraction SoftwareSearchable PDF OCRText ProcessingZone OCR
Read more
  • Published in OCR
No Comments

How do you train the OCR engine for better accuracy?

Wednesday, 28 February 2018 by dwilder

Training has been removed with version 7 due to the addition of the ABBYY FineReader OCR engine.

Invoice OCROCROCR Form ProcessingOCR ScanningScreen Scraping OCRScreenshot OCRSearchable PDF OCRZone OCR
Read more
  • Published in OCR
No Comments

How do you configure full text searching in Retrieval mode?

Wednesday, 28 February 2018 by dwilder

On the Database tab there dropdown in the lower portion of the panel for Full Text OCR Field. Put the name of the field that will store the full-text data there. This must be configured both for Insert and Retrieval mode configurations. The database field needs to be sufficient length to store the entire text of your document. Of course, the Insert Mode configuration must have “Enable Full Page OCR” checked to generate full text data from images. Text from MS Office documents, PDF files and existing OCR text files can be used without setting this option. When designing your Retrieval Mode configuration, create a Text field to use for full text search queries. On the Database tab, set the corresponding “Database Field Name” to the full text database field. When searching on your full text field, SimpleIndex finds the text you enter no matter where it appears in the document. It is able to match partial words. It does not perform boolean or natural language search

ArchiveDatabaseDatabase & RetrievalDocument Management SoftwareExportFull Text IndexingIndexingIntegrationMS AccessMySQLOCROCR Form ProcessingOCR ScanningODBCOffice PDF Text ProcessingOraclePaperless OfficePDF Archive Scanning SoftwarePDF Data Extraction SoftwareQuickBooks Document ManagementRecords ManagementSearchSearchable PDF OCRServerSharePoint ScanningSQL ServerText ProcessingZone OCR
Read more
  • Published in Database & Retrieval, OCR
No Comments

How do you configure OCR to read index information from MS Office or PDF documents?

Wednesday, 28 February 2018 by dwilder

MS Office and PDF files generated by software or PDF printer drivers already have the text you need to recognize in the file. Scanned documents need to use OCR to read text from an image of the page. With Office and PDF files, SimpleIndex can just read the text, which is much faster and accurate than image OCR. To recognize index fields from the document text, first create OCR fields on the Index tab as you would normally. Next, on the Zones & OCR options tab, check the “Use Full Page OCR for this Field” option for each OCR field. This tells SimpleIndex to process the existing file text. If the index value is a unique pattern of digits or list of possible values, use Template or Dictionary matching to locate the value within the text. Please see the manual for details on Template and Dictionary matching. If the value appears in a specific location in each file, coordinates can be used to locate it. When processing text, the X, Y, Width and Height settings correspond to

Microsoft Word Data ExtractionMS OfficeOffice PDF Document IndexingOffice PDF Text ProcessingOffice to PDFPaperless OfficePDFPDF Archive Scanning SoftwarePDF Barcode RecognitionPDF Data Extraction SoftwarePDF FormsSearchable PDF OCRText Processing
Read more
  • Published in OCR, Office PDF Text Processing
No Comments

How can I improve recognition rates for my OCR fields?

Wednesday, 28 February 2018 by dwilder

There are several things you can do to improve accuracy for OCR. -Scan at 300dpi, black & white for best results. -Adjust the scan settings to remove background noise and improve the definition of characters. -For Zone OCR, field recognition can often vary based on the surrounding white space and text in the zone. Try varying the size of the zone to achieve optimal results. -For template matching, make sure all variations of the field format are included in the template list. -For dictionary matching, add common variations and OCR mistakes to the “thesaurus” list. -On the Zones & OCR tab (accessed from the Job Options) you can adjust the Max Errors setting to allow for more mistakes in the dictionary matching process. -Use the Strip Spaces, Strip Characters, Replace Characters and Case Fixing options to standardize the field format prior to matching. Please refer to the manual for details on how to configure these options. Find out more about Optical Character Recog

Clipboard OCRInvoice OCROCROCR Form ProcessingOCR ScanningScreen Scraping OCRScreenshot OCRSearchable PDF OCRZone OCR
Read more
  • Published in OCR
No Comments

Can OCR text be saved to MS Word or HTML formats?

Wednesday, 28 February 2018 by dwilder

Yes. On the Zones & OCR tab of the Job Options, there is a dropdown list for “Full-page OCR file type”. By default it is set to TEXT, but can be changed to WORD, HTML or PDF.

If the output file type is set to PDF, OCR text will be embedded as hidden text in the PDF file.

Find out more about Optical Character Recognition on the SimpleOCR Guide.

ExportFull Text IndexingOCROCR Form ProcessingOCR ScanningOffice PDF Text ProcessingPDF Data Extraction SoftwareSearchable PDF OCRText ProcessingZone OCR
Read more
  • Published in Licensing & Installation, OCR
No Comments

Can SimpleIndex create searchable PDF Image+Text files with hidden text?

Wednesday, 28 February 2018 by dwilder

If you enable full-page OCR and output to PDF, the full-page OCR text will be inserted as invisible text on each page.

With the addition of the FineReader Engine in version 7, SimpleIndex now creates PDF files with fully searchable text formatted to flow with the image of the document.

Find out more about Optical Character Recognition on the SimpleOCR Guide.

ExportFull Text IndexingOCROCR Form ProcessingOCR ScanningOffice PDF Text ProcessingPDF Data Extraction SoftwareSearchable PDF OCRText ProcessingZone OCR
Read more
  • Published in Export, OCR, Office PDF Text Processing
No Comments

Document Management

Wednesday, 04 October 2017 by dwilder
Designing Your Document Management Solution If you have not yet decided on a plan for how to organize your scanned images for later retrieval, you should take some time to consider the possible options. There are several ways to search and view documents scanned with SimpleIndex®: Use SimpleSearch to use keyword searches to find and view indexed documents Use SimpleView to browse folders, search files, view, edit and annotate scanned documents without a database Use a third-party document management system for integrated searching, viewing, workflow, HIPAA compliance and other document-centric functions Use SharePoint to share documents online, create custom document workflows and employ records management standards Use the Windows Search Bar to search the names AND contents of files stored on Windows Server 2008 or later. Integrate SimpleIndex directly with your custom application using the Command Line Interface Work with our professional services team or an Authorized Dealer to crea
ArchiveContentverseDatabase & RetrievalDocument ImagingDocument Management SoftwareFull Text IndexingPaperless OfficePaperVisionPDF Archive Scanning SoftwarePersonal Document ManagementQuickBooks Document ManagementRetrievalSearchSearchable PDF OCR
Read more
No Comments

MS Office & PDF Text Parsing

Tuesday, 03 October 2017 by dwilder
Office Videos | PDF Video The template and dictionary matching capabilities of SimpleIndex‘s OCR function can be used to extract index information from the text of existing MS Office and PDF files, or any file with an accompanying TXT file. SimpleIndex® will search the document for matches on unique patterns and value lists, then index the document with the matching data. Zone coordinates can be set to limit the search area to pre-defined regions on standard forms. The result is a fully automated indexing and renaming process for all your electronic documents! Using existing text, SimpleIndex can index and rename hundreds of files each minute and achieve perfect accuracy. These files can then be quickly searched with SimpleIndex Retrieval, SharePoint and Google search engines, or uploaded into your company’s document/content management system or custom business applications. Enhanced Text Parsing & PDF Support MS Office and PDF text parsing features are now included in
Automatic Data CaptureClassificationIndexingMicrosoft Word Data ExtractionMS OfficeOffice PDF Document IndexingOffice PDF Text ProcessingOffice to PDFPaperless OfficePDFPDF Archive Scanning SoftwarePDF Barcode RecognitionPDF Data Extraction SoftwarePDF FormsSearchable PDF OCRText Processing
Read more
No Comments

Contact Us Today!

Search Knowledge Base

Recent KB Articles

  • SimpleIndex Standard Workstation
  • SimpleIndex v9.2
  • SimpleIndex v9.2 Announcement
  • SimpleIndex Upgrades
  • SimpleIndex - Affordable document scanning and OCR
  • SimpleIndex Trial Download
  • SimpleIndex v9.1
  • SimpleIndex Trial Download - Appointment Requested

Feature Cloud

Wilder Paperless Office Export PDF Archive Scanning Software TIFF PDF Forms Import RPA Office PDF Text Processing SharePoint Scanning SimpleIndex Automatic Data Capture Mortgage Office to PDF Database & Retrieval Required Documents Auditing Barcode Printing Licensing & Installation Classification Subscription Command-Line Server Keyword Indexing Full Text Indexing Batch Scanning Image Scanning File Indexing Optical Mark Recognition Licensing Clipboard OCR Document Imaging XSLT Invoice OCR PDF Barcode Recognition Document Numbering System Command Line Interface Bookmark Records Management Metadata Barcode Recognition Software Workstation OCR Scanning Concurrent OCR XML

Online Support Options

Simple Software provides an interactive Frequently Asked Questions database and Live Support chat system, as well as free Training Videos.

How to Buy

Solutions start at just $500! Buy SimpleIndex online or from an Authorized Dealer in your area. Price List (PDF).

Authorized Dealers

Authorized DealersSimpleIndex is a great addition to any system integrator's product line. Become an Authorized Dealer.

Get a Web Demo

Get a free online demo with a scanning specialist who can configure SimpleIndex on your computer remotely.
Sign up for a demo now!

Download a Trial

SimpleIndex TrialFully functional 30-day demos are available for all Simple Software applications.
Download Now!

SimpleIndex Applications

SimpleIndex Applications See how SimpleIndex can be used in your business.
"Out-of-the-Box" Solutions
Case Studies
Common Applications
Industry-Specific Applications

© 2020 All rights reserved. SimpleIndex by SimpleSoftware.

TOP });