SIGN IN YOUR ACCOUNT TO HAVE ACCESS TO DIFFERENT FEATURES

CREATE AN ACCOUNT FORGOT YOUR PASSWORD?

FORGOT YOUR DETAILS?

AAH, WAIT, I REMEMBER NOW!

CREATE ACCOUNT

ALREADY HAVE AN ACCOUNT?
QUESTIONS? CALL: 865-637-8986
  • SIGN UP
  • LOGIN

SimpleIndex

SimpleIndex

T (865) 637-8986
Email: info@simpleindex.com

SimpleIndex by SimpleSoftware
500 W Summit Hill Dr SW # 302, Knoxville, TN 37902

  • LEARN MORE
    • GENERAL INFO
      • Getting Started
      • How To Scan Documents
      • Barcode Scanning Guide
      • Searching & Viewing
      • Schedule a Web Demo
      • News & Updates
    • FEATURES
      • Streamlined Interface
      • TWAIN and ISIS Scanning
      • Dynamic OCR
      • Database Integration
      • Required Documents Check
      • Integrated & Unattended Processing
      • SharePoint Document Scanning
    • –
      • Document Classification
      • PDF & MS Office Text Parsing
      • Barcode Recognition
      • Optical Mark Recognition
      • Match Documents to Existing Data
      • Imprinting & Watermarking
      • Screenshot OCR
  • SOLUTIONS
    • General
      • All-In-One Scanning & Sorting Tool
      • Affordable Document Management
      • Instant Integration
      • Network Scanners & Copiers
      • Remote Document Capture
      • Reduce Click Charges for Data Capture
    • Specific
      • Mortgage & Loan Documents
      • Material Safety Data Sheets (MSDS)
      • Patent ID and Title Extraction
      • Federal Tax Returns
      • Invoice Processing
  • SUITE
    • SimpleCoversheet – Print Bar Codes
    • SimpleExport – Data File Converter
    • SimpleView – Search, View & Edit
    • SimpleQB – QuickBooks Integrator
    • SimpleOCR – Freeware OCR
    • Buy Suite Apps
    • Buy Suite Bundles
  • ORDER NOW
    • COMPARE VERSIONS
      • Versions & Feature
      • Price List (PDF)
    • SIMPLEINDEX WORKSTATION
      • Machine License
      • Concurrent User
      • Subscription License
    • SIMPLEINDEX SERVER
    • SUITE APPLICATIONS
    • SUITE BUNDLES
    • SUPPORT & MAINTENANCE
      • Annual Maintenance Renewals
    • FIND A DEALER
      • Dealer Locator
      • Become a Dealer
  • DEMOS
    • TRIAL DOWNLOADS
    • SCHEDULE A DEMO
    • COMPARE SOLUTIONS
    • VIDEO DEMOS
      • Zone OCR with Template Matching
      • Invoice Processing with Full Page OCR
      • PDF Invoice OCR Demo
      • Sort and Index MS Office Documents
    • SIMPLEINDEX UNIVERSITY
      • SimpleIndex University – 100 Series
      • SimpleIndex University – 200 Series
      • SimpleIndex University – 300 Series
  • CONTACT
    • Contact Us
    • Support
    • FAQ
    • Privacy Policy
  • My Account
    • Downloads
  • MY CART
    No products in cart.
  • Home
  • Page

An essential first step to processing mixed batches with many types of documents is classification. Document Classification methods quickly sort documents by type using key content and layout attributes to identify them.

The most popular document classification systems are advanced AI-based machine learning algorithms that automatically learn how to classify documents based on samples and user feedback. These systems are very powerful but also very expensive. Only large organizations processing millions of pages each year can afford these enterprise solutions.

SimpleIndex naturally has a simpler way to do classification based on keyword patterns in the document text. Simply create a list of document types and assign one or more unique keywords or phrases that will only appear in that document type to each. Logical operators for AND, OR and NOT prevent false matches by requiring multiple keywords for matching or excluding documents that contain certain phrases.

Keyword-based classification works for the vast majority of applications at a fraction of the cost of AI classification.

After classification, SimpleIndex can automatically launch separate document indexing workflows for each document type found in the classified batch. This is especially useful when documents have different metadata requirements or business workflows associated with them.

Document Classification

Monday, 01 July 2019 by aaron

An essential first step to processing mixed batches with many types of documents is classification. Document Classification methods quickly sort documents by type using key content and layout attributes to identify them.

The most popular document classification systems are advanced AI-based machine learning algorithms that automatically learn how to classify documents based on samples and user feedback. These systems are very powerful but also very expensive. Only large organizations processing millions of pages each year can afford these enterprise solutions.

SimpleIndex naturally has a simpler way to do classification based on keyword patterns in the document text. Simply create a list of document types and assign one or more unique keywords or phrases that will only appear in that document type to each. Logical operators for AND, OR and NOT prevent false matches by requiring multiple keywords for matching or excluding documents that contain certain phrases.

Keyword-based classification works for the vast majority of applications at a fraction of the cost of AI classification.

After classification, SimpleIndex can automatically launch separate document indexing workflows for each document type found in the classified batch. This is especially useful when documents have different metadata requirements or business workflows associated with them.

ClassificationOCROffice PDF Text ProcessingWorkflow
Read more
No Comments

MS Office & PDF Text Parsing

Tuesday, 03 October 2017 by dwilder

Office Videos | PDF Video

The template and dictionary matching capabilities of SimpleIndex‘s OCR function can be used to extract index information from the text of existing MS Office and PDF files, or any file with an accompanying TXT file. SimpleIndex® will search the document for matches on unique patterns and value lists, then index the document with the matching data. Zone coordinates can be set to limit the search area to pre-defined regions on standard forms. The result is a fully automated indexing and renaming process for all your electronic documents!

Using existing text, SimpleIndex can index and rename hundreds of files each minute and achieve perfect accuracy. These files can then be quickly searched with SimpleIndex Retrieval, SharePoint and Google search engines, or uploaded into your company’s document/content management system or custom business applications.

Enhanced Text Parsing & PDF Support

PDF Form Read Write DataMS Office and PDF text parsing features are now included in the Basic version of SimpleIndex, making it much more affordable to enable automatic document sorting on the desktop. Additional Office and PDF features include:

  • Convert any MS Office, HTML, XML and image files to PDF before processing
  • Read and write password protected PDF file
  • Searchable PDF output (Image + Hidden Text)
  • Interactive template builder and tester
  • Easily select PDF or PDF/A output format
  • Native PDF viewer and auto-repair of problematic PDFs
  • Read data from PDF forms
  • Populate blank PDF forms with index data

Batch Convert Office Documents to PDF

If you have Microsoft Office or OpenOffice installed, you can use SimpleIndex to automatically convert MS Office documents to PDF files for archival. PDF files are better for archival than editable formats like Word and Excel. They can be annotated, encrypted, searched and viewed with free PDF readers.

There are many free applications that let you convert documents to PDF one at a time. SimpleIndex lets you convert thousands of files at once while it also extracts data from the text for indexing or data entry automation. This feature is ideal for migrating or archiving Office documents to SharePoint, document management systems and custom web applications.

Quickly Organize Any File on Your Computer

SimpleIndex lets you process any type of file on your computer. If an OLE-enabled viewer is installed, SimpleIndex will display the document on the screen. Other documents can be opened automatically in their default application when they are indexed. Quickly type index field data that can be used to reorganize the files into subfolders and structured filenames for browsing and searching on your network, or uploaded to your document/content management system or custom business application.

If the file has an accompanying text file (*.TXT) with the same name, the text in that file can be used for index field extraction, fully automating the process.

Viewing & Indexing MS Office Documents

SimpleCoversheet Barcode Indexing CoversheetsSimpleIndex features full support for viewing and editing MS Office documents (Word, PowerPoint and Excel) on computers with or without MS Office installed. The full application interface is displayed within the SimpleIndex viewer, letting users view the full content of the documents, edit them with all the features of MS Office and save the changes. Modify privileges can be denied using Windows file security or by the SimpleIndex administration wizard to keep out unauthorized changes.

If MS Office is not installed, SimpleIndex can open and display them in the built-in viewer in read-only mode.

KB Articles for MS Office & PDF Text Parsing

  • Change the Dictionary Separator Value
  • Regular Expression (RegEx) - Syntax or Type
  • Check and Repair All PDF Files
  • Keep Pages in Original Order when Bookmarking
  • Do Not Combine Pages to 1 Bookmark
  • Can I split a PDF based on bookmark values?
  • Is it possible to search for and retrieve documents with Windows desktop search?
  • Can SimpleIndex read bar codes from existing PDF files?
  • Is there a way to just use part of a bar code or OCR value? For example, extract "50" from the value "124450"
  • How do you configure OCR to read index information from MS Office or PDF documents?
Automatic Data CaptureClassificationIndexingMicrosoft Word Data ExtractionMS OfficeOffice PDF Document IndexingOffice PDF Text ProcessingOffice to PDFPaperless OfficePDFPDF Archive Scanning SoftwarePDF Barcode RecognitionPDF Data Extraction SoftwarePDF FormsSearchable PDF OCRText Processing
Read more
No Comments

Contact Us Today!

Search Knowledge Base

Recent KB Articles

  • Command Line Arguments
  • Stop/Turn Off Image Deletion when Blank is Recognized
  • Error in Scanning Batch 743
  • How do I download and utilize TaxStacker after purchasing?
  • How to emulate Server behavior in the client
  • Oracle database is slow to respond
  • SharePoint Login Issues
  • Reset SharePoint Login Information

Feature Cloud

File Indexing Invoice OCR Unattended Fast Scanning Automatic Indexing Software Personal Document Management QR Code Barcode OCR Paperless Office Server Screenshot OCR Barcode Reading Software QuickBooks Invoice Scanning OMR Database Database & Retrieval Imprinting & Watermarking Scanning Software Barcode Printing Contentverse Text Processing Keyword Indexing Zone OCR ISIS Driver TWAIN & ISIS Scanning MS Access Barcode Recognition Software Oracle MS Office Bar Code Scanning XSLT Separation Archive SimpleSend Document Imaging SimpleQB OCR 1-Click Processing QuickBooks Document Management Remote Capture PaperVision Watermark Subscription Coversheet Activation

Online Support Options

Simple Software provides an interactive Frequently Asked Questions database and Live Support chat system, as well as free Training Videos.

How to Buy

Solutions start at just $500! Buy SimpleIndex online or from an Authorized Dealer in your area. Price List (PDF).

Authorized Dealers

Authorized DealersSimpleIndex is a great addition to any system integrator's product line. Become an Authorized Dealer.

Get a Web Demo

Get a free online demo with a scanning specialist who can configure SimpleIndex on your computer remotely.
Sign up for a demo now!

Download a Trial

SimpleIndex TrialFully functional 30-day demos are available for all Simple Software applications.
Download Now!

SimpleIndex Applications

SimpleIndex Applications See how SimpleIndex can be used in your business.
"Out-of-the-Box" Solutions
Case Studies
Common Applications
Industry-Specific Applications

© 2021 Meta Enterprises, LLC | Knoxville, Tennessee | A Family Owned Company

TOP
});