SIGN IN YOUR ACCOUNT TO HAVE ACCESS TO DIFFERENT FEATURES

CREATE AN ACCOUNT FORGOT YOUR PASSWORD?

FORGOT YOUR DETAILS?

AAH, WAIT, I REMEMBER NOW!

CREATE ACCOUNT

ALREADY HAVE AN ACCOUNT?
QUESTIONS? CALL: 865-637-8986
  • SIGN UP
  • LOGIN

SimpleIndex

SimpleIndex

T (865) 637-8986
Email: info@simpleindex.com

SimpleIndex by SimpleSoftware
500 W Summit Hill Dr SW # 302, Knoxville, TN 37902

  • LEARN MORE
    • GENERAL INFO
      • Getting Started
      • How To Scan Documents
      • Barcode Scanning Guide
      • Searching & Viewing
      • Schedule a Web Demo
      • News & Updates
    • FEATURES
      • Streamlined Interface
      • TWAIN and ISIS Scanning
      • Dynamic OCR
      • Database Integration
      • Required Documents Check
      • Integrated & Unattended Processing
      • SharePoint Document Scanning
    • –
      • Document Classification
      • PDF & MS Office Text Parsing
      • Barcode Recognition
      • Optical Mark Recognition
      • Match Documents to Existing Data
      • Imprinting & Watermarking
      • Screenshot OCR
  • SOLUTIONS
    • General
      • All-In-One Scanning & Sorting Tool
      • Affordable Document Management
      • Instant Integration
      • Network Scanners & Copiers
      • Remote Document Capture
      • Reduce Click Charges for Data Capture
    • Specific
      • Mortgage & Loan Documents
      • Material Safety Data Sheets (MSDS)
      • Patent ID and Title Extraction
      • Federal Tax Returns
      • Invoice Processing
  • SUITE
    • SimpleCoversheet – Print Bar Codes
    • SimpleExport – Data File Converter
    • SimpleView – Search, View & Edit
    • SimpleQB – QuickBooks Integrator
    • SimpleOCR – Freeware OCR
    • Buy Suite Apps
    • Buy Suite Bundles
  • ORDER NOW
    • COMPARE VERSIONS
      • Versions & Feature
      • Price List (PDF)
    • SIMPLEINDEX WORKSTATION
      • Machine License
      • Concurrent User
      • Subscription License
    • SIMPLEINDEX SERVER
    • SUITE APPLICATIONS
    • SUITE BUNDLES
    • SUPPORT & MAINTENANCE
      • Annual Maintenance Renewals
    • FIND A DEALER
      • Dealer Locator
      • Become a Dealer
  • DEMOS
    • TRIAL DOWNLOADS
    • SCHEDULE A DEMO
    • COMPARE SOLUTIONS
    • VIDEO DEMOS
      • Zone OCR with Template Matching
      • Invoice Processing with Full Page OCR
      • PDF Invoice OCR Demo
      • Sort and Index MS Office Documents
    • SIMPLEINDEX UNIVERSITY
      • SimpleIndex University – 100 Series
      • SimpleIndex University – 200 Series
      • SimpleIndex University – 300 Series
  • CONTACT
    • Contact Us
    • Support
    • FAQ
    • Privacy Policy
  • My Account
    • Downloads
  • MY CART
    No products in cart.
  • Home
  • Knowledge Base - Article

Searchable PDF OCR with SimpleIndexSimpleIndex lets you create searchable PDF documents from scanned images using OCR to convert the pages to text and overlay it on the original scan. This creates a unique scanned document that’s fully searchable and lets you highlight and copy text, while preserving the original page formatting for readability.

Unlike other basic OCR applications, SimpleIndex also lets you automatically tag and organize documents using keywords identified with pattern matching, database lookups or bar codes. Structured data can be used to populate a database, document management system, SharePoint and other repositories.

Download document scanning and OCR software.

Regular Expression (RegEx) – Syntax or Type

Monday, 29 July 2019 by Simple Software

The Syntax or Type of Regular Expression/RegEx that SimpleIndex uses is .NET

Barcode OCRClipboard OCRInvoice OCROCROCR Form ProcessingOCR ScanningScreen Scraping OCRScreenshot OCRSearchable PDF OCRZone OCR
Read more
No Comments

Check and Repair All PDF Files

Monday, 29 July 2019 by Simple Software

You can set SimpleIndex to assume that it needs to check every PDF file and fix it.

Go to this location in the Windows Registry:

Computer\HKEY_LOCAL_MACHINE\SOFTWARE\WOW6432Node\SimpleIndex\Misc

Create a New String Value called “FixAllPDF” and set the value to 1

Office PDF Document IndexingOffice PDF Text ProcessingOffice to PDFPDFPDF Archive Scanning SoftwarePDF Barcode RecognitionPDF Data Extraction SoftwarePDF FormsSearchable PDF OCR
Read more
No Comments

Keep Pages in Original Order when Bookmarking

Monday, 29 July 2019 by Simple Software

If you want to keep all the pages in the same order that they were imported, even though they all go with different bookmarks then do the following.

1.  Open the configuration in Notepad.
2.  Search for <BOOKMARK_PAGE_ORDER>
3.  Change this line from “false” to “true”:  <BOOKMARK_PAGE_ORDER>true</BOOKMARK_PAGE_ORDER>
4.  Save and close.

BookmarkOffice PDF Document IndexingOffice PDF Text ProcessingOffice to PDFPDFPDF Archive Scanning SoftwarePDF Barcode RecognitionPDF Data Extraction SoftwarePDF FormsSearchable PDF OCR
Read more
No Comments

Is it possible to search for and retrieve documents with Windows desktop search?

Wednesday, 28 February 2018 by dwilder

Windows Search works great with SimpleIndex because all index data can be saved to the folder and file names as well as the file properties, and OCR text can be saved to hidden layers in PDF files. Windows Search will read all of these elements when building its index and will return any matching files when you search.

Using Windows Search on a file server allows for instantaneous searching across terabytes of documents and text for all of the users on your network.

IFilters allow Windows Search to search within file contents.

Here are three popular PDF IFilters that will enable text searching for PDF files:

  • Foxit PDF IFilter (commercial)
  • TET PDF IFilter (free/commercial)
  • Adobe PDF IFilter (32-bit / 64-bit) (free)

If you have issues with PDF text searching in Windows 10, this article has detailed instructions for resolving PDF IFilter issues:

https://fixedit.itxpress.biz/2018/07/05/searching-pdfs-in-windows-10/

ArchiveContentverseDatabase & RetrievalDocument Management SoftwareIndexingMicrosoft Word Data ExtractionOffice PDF Document IndexingOffice PDF Text ProcessingPaperless OfficePaperVisionPDF Archive Scanning SoftwareQuickBooks Document ManagementRecords ManagementSearchSearchable PDF OCRText Processing
Read more
  • Published in Database & Retrieval, Export, Office PDF Text Processing
No Comments

I’m using full page OCR. The information is all appearing in the txt file but it is losing format about half way through. Data to the right is ending up at the end of the txt doc. Can this be fixed?

Wednesday, 28 February 2018 by dwilder

SimpleIndex version 7 solves this problem with the incorporation of the FineReader OCR engine. Full text in PDFs will now flow with the formatting of the PDF.

Legacy Versions: SimpleIndex can also be used with other OCR applications and servers to improve accuracy, formatting and performance. Use the OCR applications to convert the scanned images to text or searchable PDF, and SimpleIndex can extract index values from the text and automatically sort and organize the files.

Full Text IndexingOCROCR Form ProcessingOCR ScanningOffice PDF Text ProcessingPDF Data Extraction SoftwareSearchable PDF OCRText ProcessingZone OCR
Read more
  • Published in OCR
No Comments

How do you train the OCR engine for better accuracy?

Wednesday, 28 February 2018 by dwilder

Training has been removed with version 7 due to the addition of the ABBYY FineReader OCR engine.

Invoice OCROCROCR Form ProcessingOCR ScanningScreen Scraping OCRScreenshot OCRSearchable PDF OCRZone OCR
Read more
  • Published in OCR
No Comments

How do you configure full text searching in Retrieval mode?

Wednesday, 28 February 2018 by dwilder

On the Database tab there dropdown in the lower portion of the panel for Full Text OCR Field. Put the name of the field that will store the full-text data there. This must be configured both for Insert and Retrieval mode configurations. The database field needs to be sufficient length to store the entire text of your document.

Of course, the Insert Mode configuration must have “Enable Full Page OCR” checked to generate full text data from images. Text from MS Office documents, PDF files and existing OCR text files can be used without setting this option.

When designing your Retrieval Mode configuration, create a Text field to use for full text search queries. On the Database tab, set the corresponding “Database Field Name” to the full text database field.

When searching on your full text field, SimpleIndex finds the text you enter no matter where it appears in the document. It is able to match partial words. It does not perform boolean or natural language searches. The text entered must match the document text exactly.

ArchiveDatabaseDatabase & RetrievalDocument Management SoftwareExportFull Text IndexingIndexingIntegrationMS AccessMySQLOCROCR Form ProcessingOCR ScanningODBCOffice PDF Text ProcessingOraclePaperless OfficePDF Archive Scanning SoftwarePDF Data Extraction SoftwareQuickBooks Document ManagementRecords ManagementSearchSearchable PDF OCRServerSharePoint ScanningSQL ServerText ProcessingZone OCR
Read more
  • Published in Database & Retrieval, OCR
No Comments

How do you configure OCR to read index information from MS Office or PDF documents?

Wednesday, 28 February 2018 by dwilder

MS Office and PDF files generated by software or PDF printer drivers already have the text you need to recognize in the file. Scanned documents need to use OCR to read text from an image of the page. With Office and PDF files, SimpleIndex can just read the text, which is much faster and accurate than image OCR.

To recognize index fields from the document text, first create OCR fields on the Index tab as you would normally. Next, on the Zones & OCR options tab, check the “Use Full Page OCR for this Field” option for each OCR field. This tells SimpleIndex to process the existing file text.

If the index value is a unique pattern of digits or list of possible values, use Template or Dictionary matching to locate the value within the text. Please see the manual for details on Template and Dictionary matching.

If the value appears in a specific location in each file, coordinates can be used to locate it. When processing text, the X, Y, Width and Height settings correspond to line and column numbers within the file text. This is explained in greater depth in the manual.

SimpleIndex will assume that any TXT file with the same name as a file being processed is the OCR text for that file, so this method can work with any type of file.

Find out more about Optical Character Recognition on the SimpleOCR Guide.

Microsoft Word Data ExtractionMS OfficeOffice PDF Document IndexingOffice PDF Text ProcessingOffice to PDFPaperless OfficePDFPDF Archive Scanning SoftwarePDF Barcode RecognitionPDF Data Extraction SoftwarePDF FormsSearchable PDF OCRText Processing
Read more
  • Published in OCR, Office PDF Text Processing
No Comments

How can I improve recognition rates for my OCR fields?

Wednesday, 28 February 2018 by dwilder

There are several things you can do to improve accuracy for OCR.

-Scan at 300dpi, black & white for best results.

-Adjust the scan settings to remove background noise and improve the definition of characters.

-For Zone OCR, field recognition can often vary based on the surrounding white space and text in the zone. Try varying the size of the zone to achieve optimal results.

-For template matching, make sure all variations of the field format are included in the template list.

-For dictionary matching, add common variations and OCR mistakes to the “thesaurus” list.

-On the Zones & OCR tab (accessed from the Job Options) you can adjust the Max Errors setting to allow for more mistakes in the dictionary matching process.

-Use the Strip Spaces, Strip Characters, Replace Characters and Case Fixing options to standardize the field format prior to matching.

Please refer to the manual for details on how to configure these options.

Find out more about Optical Character Recognition on the SimpleOCR Guide. You may also check out our Advanced OCR Guide to find out how to use third-party OCR applications with SimpleIndex.

Clipboard OCRInvoice OCROCROCR Form ProcessingOCR ScanningScreen Scraping OCRScreenshot OCRSearchable PDF OCRZone OCR
Read more
  • Published in OCR
No Comments

Can OCR text be saved to MS Word or HTML formats?

Wednesday, 28 February 2018 by dwilder

Yes. On the Zones & OCR tab of the Job Options, there is a dropdown list for “Full-page OCR file type”. By default it is set to TEXT, but can be changed to WORD, HTML or PDF.

If the output file type is set to PDF, OCR text will be embedded as hidden text in the PDF file.

Find out more about Optical Character Recognition on the SimpleOCR Guide.

ExportFull Text IndexingOCROCR Form ProcessingOCR ScanningOffice PDF Text ProcessingPDF Data Extraction SoftwareSearchable PDF OCRText ProcessingZone OCR
Read more
  • Published in Licensing & Installation, OCR
No Comments

Can SimpleIndex create searchable PDF Image+Text files with hidden text?

Wednesday, 28 February 2018 by dwilder

If you enable full-page OCR and output to PDF, the full-page OCR text will be inserted as invisible text on each page.

With the addition of the FineReader Engine in version 7, SimpleIndex now creates PDF files with fully searchable text formatted to flow with the image of the document.

Find out more about Optical Character Recognition on the SimpleOCR Guide.

ExportFull Text IndexingOCROCR Form ProcessingOCR ScanningOffice PDF Text ProcessingPDF Data Extraction SoftwareSearchable PDF OCRText ProcessingZone OCR
Read more
  • Published in Export, OCR, Office PDF Text Processing
No Comments

Document Management

Wednesday, 04 October 2017 by dwilder

Designing Your Document Management Solution

If you have not yet decided on a plan for how to organize your scanned images for later retrieval, you should take some time to consider the possible options. There are several ways to search and view documents scanned with SimpleIndex®:

  • Use SimpleSearch to use keyword searches to find and view indexed documents
  • Use SimpleView to browse folders, search files, view, edit and annotate scanned documents without a database
  • Use a third-party document management system for integrated searching, viewing, workflow, HIPAA compliance and other document-centric functions
  • Use SharePoint to share documents online, create custom document workflows and employ records management standards
  • Use the Windows Search Bar to search the names AND contents of files stored on Windows Server 2008 or later.
  • Integrate SimpleIndex directly with your custom application using the Command Line Interface
  • Work with our professional services team or an Authorized Dealer to create a customized solution or direct integration with virtually any application
  • Use a custom database or spreadsheet such as MS Access, SQL Server or Excel to store the index data and provide links to the stored document images

 

Use SimpleSearch

SimpleSearch is included with all versions of SimpleIndex and can also be licensed by itself. SimpleSearch implements the same SimpleIndex interface in “Retrieval” mode, hiding all the menus and toolbars used for scanning. Index fields normally used to assign values are used for searching instead.

SimpleSearch can use SimpleIndex‘s built-in database to perform searches, or connect to any other database, even those for existing business applications. Users simply type index values they want to find and SimpleIndex displays the matches. Partial matching and full text searches are also supported. Displayed documents can be printed, e-mailed or opened in their associated application (Word, Adobe Reader, Excel, etc.). SimpleSearch can view several common image formats, PDF files and can preview files for any OLE-enabled application installed on your computer (MS Office, Adobe applications, AutoCad, etc.)

Using a Document Management System

There are a wide variety of small business and enterprise document management systems available on the market today. They manage stored images and index data, and provide users an interface to search for and view these images. Many perform advanced functions like workflow management, revision tracking and access auditing for HIPAA compliance. SimpleIndex has the ability to interface with these systems, making it an ideal scanning front-end for use with most document management systems on the market.

Integration with document management software is done via the Index Log Files that SimpleIndex creates. Documents are scanned and indexed with SimpleIndex, and a log file is created that lists each image scanned and the index information associated with it. Virtually all document management systems come with standard or optional components that allow you to automatically import images and index information in the format SimpleIndex provides.

Many document management systems have a scanning module that is sold separately, at significantly greater cost than SimpleIndex. With a single scanner, SimpleIndex can provide an easier and more cost-effective scanning interface than the default module. With multiple scanners, the low cost of SimpleIndex makes it possible to implement Distributed Document Capture for a fraction of what it would cost otherwise.

Use the Windows Search Bar

On Windows Server 2008 or later you can enable Windows Search Service and the indexing options that come with it to quickly find your files using the search in the top corner of your Explorer window. Once Windows Search is setup on the server, on your local PC simply pick any shared drive or folder that you would like to rapidly search and add it to a library. When you type a word or phrase in the search bar Windows will search not only all the file names but the content of the files as well. More information on setting up the search capability on Windows Server can be found here.

KB Articles for Document Management

  • Oracle database is slow to respond
  • What is Document Imaging?
  • Using alternate database schemas
  • Multiple Sort Fields on Search
  • Access Database Connection String
  • How do I delete an image and it's database entry?
  • Is it possible to search for and retrieve documents with Windows desktop search?
  • Will your SimpleQB allow me to scan in old invoices or bank statements directly into QuickBooks?
  • How do I use the Media Wizard to create searchable DVDs or thumb drives?
  • How do I export index data to a database?
ArchiveContentverseDatabase & RetrievalDocument ImagingDocument Management SoftwareFull Text IndexingPaperless OfficePaperVisionPDF Archive Scanning SoftwarePersonal Document ManagementQuickBooks Document ManagementRetrievalSearchSearchable PDF OCR
Read more
No Comments

MS Office & PDF Text Parsing

Tuesday, 03 October 2017 by dwilder

Office Videos | PDF Video

The template and dictionary matching capabilities of SimpleIndex‘s OCR function can be used to extract index information from the text of existing MS Office and PDF files, or any file with an accompanying TXT file. SimpleIndex® will search the document for matches on unique patterns and value lists, then index the document with the matching data. Zone coordinates can be set to limit the search area to pre-defined regions on standard forms. The result is a fully automated indexing and renaming process for all your electronic documents!

Using existing text, SimpleIndex can index and rename hundreds of files each minute and achieve perfect accuracy. These files can then be quickly searched with SimpleIndex Retrieval, SharePoint and Google search engines, or uploaded into your company’s document/content management system or custom business applications.

Enhanced Text Parsing & PDF Support

PDF Form Read Write DataMS Office and PDF text parsing features are now included in the Basic version of SimpleIndex, making it much more affordable to enable automatic document sorting on the desktop. Additional Office and PDF features include:

  • Convert any MS Office, HTML, XML and image files to PDF before processing
  • Read and write password protected PDF file
  • Searchable PDF output (Image + Hidden Text)
  • Interactive template builder and tester
  • Easily select PDF or PDF/A output format
  • Native PDF viewer and auto-repair of problematic PDFs
  • Read data from PDF forms
  • Populate blank PDF forms with index data

Batch Convert Office Documents to PDF

If you have Microsoft Office or OpenOffice installed, you can use SimpleIndex to automatically convert MS Office documents to PDF files for archival. PDF files are better for archival than editable formats like Word and Excel. They can be annotated, encrypted, searched and viewed with free PDF readers.

There are many free applications that let you convert documents to PDF one at a time. SimpleIndex lets you convert thousands of files at once while it also extracts data from the text for indexing or data entry automation. This feature is ideal for migrating or archiving Office documents to SharePoint, document management systems and custom web applications.

Quickly Organize Any File on Your Computer

SimpleIndex lets you process any type of file on your computer. If an OLE-enabled viewer is installed, SimpleIndex will display the document on the screen. Other documents can be opened automatically in their default application when they are indexed. Quickly type index field data that can be used to reorganize the files into subfolders and structured filenames for browsing and searching on your network, or uploaded to your document/content management system or custom business application.

If the file has an accompanying text file (*.TXT) with the same name, the text in that file can be used for index field extraction, fully automating the process.

Viewing & Indexing MS Office Documents

SimpleCoversheet Barcode Indexing CoversheetsSimpleIndex features full support for viewing and editing MS Office documents (Word, PowerPoint and Excel) on computers with or without MS Office installed. The full application interface is displayed within the SimpleIndex viewer, letting users view the full content of the documents, edit them with all the features of MS Office and save the changes. Modify privileges can be denied using Windows file security or by the SimpleIndex administration wizard to keep out unauthorized changes.

If MS Office is not installed, SimpleIndex can open and display them in the built-in viewer in read-only mode.

KB Articles for MS Office & PDF Text Parsing

  • Change the Dictionary Separator Value
  • Regular Expression (RegEx) - Syntax or Type
  • Check and Repair All PDF Files
  • Keep Pages in Original Order when Bookmarking
  • Do Not Combine Pages to 1 Bookmark
  • Can I split a PDF based on bookmark values?
  • Is it possible to search for and retrieve documents with Windows desktop search?
  • Can SimpleIndex read bar codes from existing PDF files?
  • Is there a way to just use part of a bar code or OCR value? For example, extract "50" from the value "124450"
  • How do you configure OCR to read index information from MS Office or PDF documents?
Automatic Data CaptureClassificationIndexingMicrosoft Word Data ExtractionMS OfficeOffice PDF Document IndexingOffice PDF Text ProcessingOffice to PDFPaperless OfficePDFPDF Archive Scanning SoftwarePDF Barcode RecognitionPDF Data Extraction SoftwarePDF FormsSearchable PDF OCRText Processing
Read more
No Comments

Contact Us Today!

Search Knowledge Base

Recent KB Articles

  • Command Line Arguments
  • Stop/Turn Off Image Deletion when Blank is Recognized
  • Error in Scanning Batch 743
  • How do I download and utilize TaxStacker after purchasing?
  • How to emulate Server behavior in the client
  • Oracle database is slow to respond
  • SharePoint Login Issues
  • Reset SharePoint Login Information

Feature Cloud

E-Mail Contentverse Patch Code Clipboard OCR Automatic Indexing Software TWAIN & ISIS Scanning SharePoint Migration SAGE QuickBooks Invoice Scanning Scanning Software OCR Form Processing Barcode OCR Front End Scanning Activation Barcode Reading Software Barcode Printing Optical Mark Recognition QR Code Screenshot OCR Coversheet PDF Archive Scanning Software Document Management Software Screen Scraping OCR Image Scanning Bar Code Scanning Server Command Line Interface XSLT Subscription Retrieval Document Scanning Classification Watermark TIFF Automatic Data Capture Paperless Office Document Classification Text Processing Zone OCR Database RPA Barcode Recognition Software Office PDF Document Indexing Solution Optical Character Recognition

Online Support Options

Simple Software provides an interactive Frequently Asked Questions database and Live Support chat system, as well as free Training Videos.

How to Buy

Solutions start at just $500! Buy SimpleIndex online or from an Authorized Dealer in your area. Price List (PDF).

Authorized Dealers

Authorized DealersSimpleIndex is a great addition to any system integrator's product line. Become an Authorized Dealer.

Get a Web Demo

Get a free online demo with a scanning specialist who can configure SimpleIndex on your computer remotely.
Sign up for a demo now!

Download a Trial

SimpleIndex TrialFully functional 30-day demos are available for all Simple Software applications.
Download Now!

SimpleIndex Applications

SimpleIndex Applications See how SimpleIndex can be used in your business.
"Out-of-the-Box" Solutions
Case Studies
Common Applications
Industry-Specific Applications

© 2021 Meta Enterprises, LLC | Knoxville, Tennessee | A Family Owned Company

TOP
});