SIGN IN YOUR ACCOUNT TO HAVE ACCESS TO DIFFERENT FEATURES

Login with Google
CREATE AN ACCOUNT FORGOT YOUR PASSWORD?

FORGOT YOUR DETAILS?

AAH, WAIT, I REMEMBER NOW!

CREATE ACCOUNT

ALREADY HAVE AN ACCOUNT?

Login with Google

QUESTIONS? CALL: 865-637-8986
  • SIGN UP
  • LOGIN

SimpleIndex

  • LEARN MORE
    • GENERAL INFO
      • Getting Started
      • How To Scan Documents
      • Barcode Scanning Guide
      • Searching & Viewing
      • News & Updates
      • Schedule a Consultation
    • FEATURES
      • Streamlined Interface
      • TWAIN and ISIS Scanning
      • Zone OCR and Dynamic OCR
      • Database Integration
      • Required Documents Check
      • Automated Processing & 1-Click Interface
      • SharePoint Document Scanning
      • Convert Email to PDF
    • –
      • Document Classification
      • PDF & MS Office Text Parsing
      • Barcode Recognition
      • Handwriting Recognition Software
      • Optical Mark Recognition
      • Match Documents to Existing Data
      • Imprinting & Watermarking
      • Screenshot OCR
  • SOLUTIONS
    • General
      • All-In-One Scanning & Sorting Tool
      • Affordable Document Management
      • Instant Integration
      • Network Scanners & Copiers
      • Remote Document Capture
      • Reduce Click Charges for Data Capture
      • Compare with Other Solutions
    • Specific
      • Sales Tax Exemption Forms
      • Federal Tax Returns
      • Invoice Processing
      • Automatic Image Splitting
      • Material Safety Data Sheets (MSDS)
      • Patent ID and Title Extraction
      • Mortgage & Loan Documents
    • Feature Demos
      • Zone OCR with Template Matching
      • PDF Text Processing
      • Organize Office Documents
      • AP to QuickBooks Online with RPA
      • PDF Form Filling with XML & RPA
      • Full-Page OCR & Multi-User Workflow
      • CRM Integration with RPA
  • SUITE
    • SimpleCoversheet – Print Bar Codes
    • SimpleExport – Data File Converter
    • SimpleView – Search, View & Edit
    • SimpleQB – QuickBooks Integrator
    • SimpleOCR – Freeware OCR
    • Buy Suite Apps
    • Buy Suite Bundles
  • DOWNLOAD
  • SHOP
    • COMPARE VERSIONS
    • SIMPLEINDEX WORKSTATION
      • Machine License
      • Concurrent User
      • Subscription License
    • SIMPLEINDEX SERVER
    • SUITE APPLICATIONS
    • SUITE BUNDLES
    • MAINTENANCE & RENEWALS
    • MANAGE SUBSCRIPTIONS
    • FIND A DEALER
      • Dealer Locator
      • Become a Dealer
    • CONTACT SALES
  • SUPPORT
    • WIKI HELP
    • KNOWLEDGE BASE
    • SIMPLEINDEX UNIVERSITY
      • SimpleIndex University – 100 Series
      • SimpleIndex University – 200 Series
      • SimpleIndex University – 300 Series
    • PRIVACY POLICY
    • CONTACT SUPPORT
  • My Account
    • MANAGE SUBSCRIPTIONS
    • Downloads
  • MY CART
    No products in cart.
  • Home
  • Simple Software Knowledge Base - Article

Automatic archival of Microsoft Office documents to PDF via batch conversion, indexing and document management workflow.

If you have Microsoft Office or OpenOffice installed, you can use SimpleIndex to automatically convert MS Office documents to PDF files for archival. PDF files are better for archival than editable formats like Word and Excel. They can be annotated, encrypted, searched and viewed with free PDF readers.

There are many free applications that let you convert documents to PDF one at a time. SimpleIndex lets you convert thousands of files at once while it also extracts data from the text for indexing or data entry automation. This feature is ideal for migrating or archiving Office documents to SharePoint, document management systems and custom web applications.

Check and Repair All PDF Files

Monday, 29 July 2019 by Simple Software

Please refer to the Wiki Documentation for the complete PDF reference.

You can set SimpleIndex to assume that it needs to check every PDF file and fix it.

Go to this location in the Windows Registry:

Computer\HKEY_LOCAL_MACHINE\SOFTWARE\WOW6432Node\SimpleIndex\Misc

Create a New String Value called “FixAllPDF” and set the value to 1

Office PDF Document IndexingOffice PDF Text ProcessingOffice to PDFPDFPDF Archive Scanning SoftwarePDF Barcode RecognitionPDF Data Extraction SoftwarePDF FormsUnattended Processing
Read more
No Comments

Keep Pages in Original Order when Bookmarking

Monday, 29 July 2019 by Simple Software

If you want to keep all the pages in the same order that they were imported, even though they all go with different bookmarks then do the following.

1.  Open the configuration in Notepad.
2.  Search for <BOOKMARK_PAGE_ORDER>
3.  Change this line from “false” to “true”:  <BOOKMARK_PAGE_ORDER>true</BOOKMARK_PAGE_ORDER>
4.  Save and close.

Office PDF Document IndexingOffice PDF Text ProcessingOffice to PDFPDFPDF Archive Scanning SoftwarePDF Barcode RecognitionPDF BookmarkingPDF Data Extraction SoftwarePDF FormsUnattended Processing
Read more
No Comments

How do you select what types of files to process?

Wednesday, 28 February 2018 by dwilder

Please refer to the Wiki Documentation for the complete Batch Processing Stages reference.

You can tell SimpleIndex what types of files it should process and which file types to ignore.

This is done by clicking “Job Options” On the “Batch” tab you will find a field labeled “Input file types or mask”. These are the file types that SimpleIndex will input files from. The default types are:

TIF,PDF,JPG,GIF,BMP,DOC,XLS,PPT,DOCX,XLSX,PPTX,VSD,DWG,AVI,MP3

To process all files, enter *

SimpleIndex will ignore any file whose extension does not appear on the list.

In SimpleIndex 6 or above you can enter file masks to filter input files. Some examples are:

abc*.pdf (PDF files starting with “abc”)
ab??ef.* (All files starting with “ab”, 2 characters and “ef”)

It is possible to have some file types open automatically in their default application. This can be done by inserting a pipe “|” into the list. Any file types after the pipe will be opened in their default application. For example:

TIF,PDF,JPG|WAV,MP3,WMV,AVI

will cause SimpleIndex to display image files and open sound and video files in the default media player.

File IndexingImage ScanningOffice PDF Document IndexingOffice to PDF
Read more
  • Published in Import
No Comments

How do you configure OCR to read index information from MS Office or PDF documents?

Wednesday, 28 February 2018 by dwilder

Please refer to the Wiki Documentation for the complete Zones & OCR Settings reference.

MS Office and PDF files generated by software or PDF printer drivers already have the text you need to recognize in the file. Scanned documents need to use OCR to read text from an image of the page. With Office and PDF files, SimpleIndex can just read the text, which is much faster and accurate than image OCR.

To recognize index fields from the document text, first create OCR fields on the Index tab as you would normally. Next, on the Zones & OCR options tab, check the “Use Full Page OCR for this Field” option for each OCR field. This tells SimpleIndex to process the existing file text.

If the index value is a unique pattern of digits or list of possible values, use Template or Dictionary matching to locate the value within the text. Please see the manual for details on Template and Dictionary matching.

If the value appears in a specific location in each file, coordinates can be used to locate it. When processing text, the X, Y, Width and Height settings correspond to line and column numbers within the file text. This is explained in greater depth in the manual.

SimpleIndex will assume that any TXT file with the same name as a file being processed is the OCR text for that file, so this method can work with any type of file.

Find out more about Optical Character Recognition on the SimpleOCR Guide.

Microsoft Word Data ExtractionMS OfficeOffice PDF Document IndexingOffice PDF Text ProcessingOffice to PDFPaperless OfficePDFPDF Archive Scanning SoftwarePDF Barcode RecognitionPDF Data Extraction SoftwarePDF FormsText ProcessingUnattended Processing
Read more
  • Published in OCR, Office PDF Text Processing
No Comments

Organize Office Documents with Text Parsing

Tuesday, 23 January 2018 by Simple Software

This video shows the Sort My Documents sample job included with the SimpleIndex trial download. It shows how you can organize office documents automatically by parsing the file’s text for relevant metadata and keywords. You can then use those keywords to tag documents with metadata and create standardized folders and filenames.

Organize Office Document Automatically with Text Parsing

First we sort Word documents, Excel spreadsheets and PowerPoint presentations automatically using the SimpleIndex template and dictionary matching algorithms that match patterns and keywords in the parsed text.

Then the files are organized into folders and filenames using the Sales Rep, Customer, Document Type and Date values extracted from the text.

Organize Office Documents for Cloud Storage

You can also upload organized files to SharePoint or Cloud Storage platforms without the chaos and disorganization you inevitably get when users create their own folders and filenames.

Organize Office Documents for Document Management

In the video, we use SimpleSearch to search and view the sorted files. But you can just as easily use any third party document management system or custom database to perform keyword or full-text searching.

You can use the SimpleView embedded viewer to view Office documents, PDF files and images in a common interface. In the video we use the full version of Word, Excel, and PowerPoint to edit Office documents right from the search screen.

Find Out More

  • Download or get an Online Demo
  • MS Office Text Processing Features in SimpleIndex
  • MS Office Features and Settings Wiki Pages
  • OCR Features and Settings Wiki Pages
  • OCR Software Guide on SimpleOCR

Learn More:

Scan, file, and process document data quickly and efficiently with Simple Software's tailored OCR automation and one-click processing that fits your unique business needs
Use SimpleIndex OCR to convert scanned and digital images to searchable PDF files for automated sorting, filing, and export to applications such as Word, Excel, PowerPoint, etc.
Automatically and accurately convert scanned and digital documents with simple and complicated tables to PDF files, Excel, Google Sheets, CVS, and other spreadsheet formats with Simple Software's advanced document management suite including Abbyy FineReader, Simple Index, SimpleView, SimpleSearch, SimpleSend, SimpleExport, SimpleCoversheet, and AI components
SimpleIndex OCR automation analyzes, classifies, verifies, and exports digital data into searchable documents that can be stored and processed in a variety of SQL applications
Simple Software's Document management OCR (Optical Character Recognition)  uses smart machine-learning and AI (artificial intelligence) methods to convert handwritten and digital text and image data into machine-readable text, allowing users to search, index, and organize large volumes of document data
Simultaneously recognize, analyze, and export large volumes of documents and digital data with SimpleIndex batch OCR (Optical Character Recognition) for processes  dealing with large volumes of documents, such as digitizing paper and scanned archives or processing document data in a document management system
SimpleIndex Server enterprise OCR lets users automatically perform optical character recognition on thousands of documents at a time, scaling to meet the demands of the large volumes of document conversions and processes tailored to the user's specifications
SimpleIndex OCR data capture technology and page layout analysis automatically identifies common invoice data elements (vendor, date, amount, invoice number, line item data, etc.) to automate data entry, eliminate errors, automate retrieval, match POs (Purchase Orders), validate quantities and prices, automate electronic document management and storage

FAQ Related to Organizing Office Documents

  • Features
  • Take control of Sales Tax exemption forms
  • Instant Integration With Any Application
  • Document Classification
  • Zone OCR and Dynamic OCR
  • Exclude Index Field from Index Log
  • Change the Font Size of Index Fields
  • Large Documents (>500 pages) are Slow to Process
Document Classification, Full Text Indexing, MS Office, Office PDF Document Indexing, Office PDF Text Processing, Office to PDF, Paperless Office, Search, SharePoint Migration, SharePoint Scanning, Text Processing

Document ClassificationFull Text IndexingMS OfficeOffice PDF Document IndexingOffice PDF Text ProcessingOffice to PDFPaperless OfficeSearchSharePoint MigrationSharePoint ScanningText Processing
Read more
No Comments

MS Office & PDF Text Parsing

Tuesday, 03 October 2017 by dwilder

Office Videos | PDF Video

The template and dictionary matching capabilities of SimpleIndex‘s OCR function can be used to extract index information from the text of existing MS Office and PDF files, or any file with an accompanying TXT file. SimpleIndex® will search the document for matches on unique patterns and value lists, then index the document with the matching data. Zone coordinates can be set to limit the search area to pre-defined regions on standard forms. The result is a fully automated indexing and renaming process for all your electronic documents!

Using existing text, SimpleIndex can index and rename hundreds of files each minute and achieve perfect accuracy. These files can then be quickly searched with SimpleIndex Retrieval, SharePoint and Google search engines, or uploaded into your company’s document/content management system or custom business applications.

Enhanced Text Parsing & PDF Support

MS Office and PDF text parsing features are now included in the Basic version of SimpleIndex, making it much more affordable to enable automatic document sorting on the desktop. Additional Office and PDF features include:

  • Convert any MS Office, HTML, XML and image files to PDF before processing
  • Read and write password protected PDF file
  • Searchable PDF output (Image + Hidden Text)
  • Interactive template builder and tester
  • Easily select PDF or PDF/A output format
  • Native PDF viewer and auto-repair of problematic PDFs
  • Read data from PDF forms
  • Populate blank PDF forms with index data

Batch Convert Office Documents to PDF

If you have Microsoft Office or OpenOffice installed, you can use SimpleIndex to automatically convert MS Office documents to PDF files for archival. PDF files are better for archival than editable formats like Word and Excel. They can be annotated, encrypted, searched and viewed with free PDF readers.

There are many free applications that let you convert documents to PDF one at a time. SimpleIndex lets you convert thousands of files at once while it also extracts data from the text for indexing or data entry automation. This feature is ideal for migrating or archiving Office documents to SharePoint, document management systems and custom web applications.

Quickly Organize Any File on Your Computer

SimpleIndex lets you process any type of file on your computer. If an OLE-enabled viewer is installed, SimpleIndex will display the document on the screen. Other documents can be opened automatically in their default application when they are indexed. Quickly type index field data that can be used to reorganize the files into subfolders and structured filenames for browsing and searching on your network, or uploaded to your document/content management system or custom business application.

If the file has an accompanying text file (*.TXT) with the same name, the text in that file can be used for index field extraction, fully automating the process.

Viewing & Indexing MS Office Documents

SimpleIndex features full support for viewing and editing MS Office documents (Word, PowerPoint and Excel) on computers with or without MS Office installed. The full application interface is displayed within the SimpleIndex viewer, letting users view the full content of the documents, edit them with all the features of MS Office and save the changes. Modify privileges can be denied using Windows file security or by the SimpleIndex administration wizard to keep out unauthorized changes.

If MS Office is not installed, SimpleIndex can open and display them in the built-in viewer in read-only mode.

Learn More:

Scan, file, and process document data quickly and efficiently with Simple Software's tailored OCR automation and one-click processing that fits your unique business needs
Use SimpleIndex OCR to convert scanned and digital images to searchable PDF files for automated sorting, filing, and export to applications such as Word, Excel, PowerPoint, etc.
Automatically and accurately convert scanned and digital documents with simple and complicated tables to PDF files, Excel, Google Sheets, CVS, and other spreadsheet formats with Simple Software's advanced document management suite including Abbyy FineReader, Simple Index, SimpleView, SimpleSearch, SimpleSend, SimpleExport, SimpleCoversheet, and AI components
SimpleIndex OCR automation analyzes, classifies, verifies, and exports digital data into searchable documents that can be stored and processed in a variety of SQL applications
Simple Software's Document management OCR (Optical Character Recognition)  uses smart machine-learning and AI (artificial intelligence) methods to convert handwritten and digital text and image data into machine-readable text, allowing users to search, index, and organize large volumes of document data
Simultaneously recognize, analyze, and export large volumes of documents and digital data with SimpleIndex batch OCR (Optical Character Recognition) for processes  dealing with large volumes of documents, such as digitizing paper and scanned archives or processing document data in a document management system
SimpleIndex Server enterprise OCR lets users automatically perform optical character recognition on thousands of documents at a time, scaling to meet the demands of the large volumes of document conversions and processes tailored to the user's specifications
SimpleIndex OCR data capture technology and page layout analysis automatically identifies common invoice data elements (vendor, date, amount, invoice number, line item data, etc.) to automate data entry, eliminate errors, automate retrieval, match POs (Purchase Orders), validate quantities and prices, automate electronic document management and storage

KB Articles for MS Office & PDF Text Parsing

  • Change the Dictionary Separator Value
  • Regular Expression (RegEx) - Syntax or Type
  • Check and Repair All PDF Files
  • Keep Pages in Original Order when Bookmarking
  • Do Not Combine Pages to 1 Bookmark
  • Can I split a PDF based on bookmark values?
  • Is it possible to search for and retrieve documents with Windows desktop search?
  • Can SimpleIndex read bar codes from existing PDF files?
  • Is there a way to just use part of a bar code or OCR value? For example, extract "50" from the value "124450"
  • How do you configure OCR to read index information from MS Office or PDF documents?
Automatic Data Capture, File Indexing, Microsoft Word Data Extraction, MS Office, Office PDF Document Indexing, Office PDF Text Processing, Office to PDF, offline OCR, on-prem OCR, on-site OCR, One-time payment OCR, Paperless Office, PDF, PDF Archive Scanning Software, PDF Barcode Recognition, PDF Data Extraction Software, PDF Forms, Self-hosted OCR, Subscription free OCR, Sunshine Software OCR, Text Processing, Unattended Processing
Automatic Data CaptureFile IndexingMicrosoft Word Data ExtractionMS OfficeOffice PDF Document IndexingOffice PDF Text ProcessingOffice to PDFoffline OCRon-prem OCRon-site OCROne-time payment OCRPaperless OfficePDFPDF Archive Scanning SoftwarePDF Barcode RecognitionPDF Data Extraction SoftwarePDF FormsSelf-hosted OCRSubscription free OCRSunshine Software OCRText ProcessingUnattended Processing
Read more
No Comments

Search

Contact Us Today!

=

Search Knowledge Base

Recent KB Articles

  • How to test if ISIS drivers work properly?
  • How to test if TWAIN drivers work properly?
  • How to test if TWAIN/ISIS drivers work properly?
  • SimpleIndex OCR Server 1M
  • SimpleIndex Pro Server 1M
  • SimpleIndex Standard Server 1M
  • SimpleIndex Professional Workstation
  • Simple Software Server Processing Add-on for SimpleIndex

Feature Cloud

QuickBooks Online Front End Scanning Personal Document Management Keyword Indexing Command Line Interface SharePoint Migration Microsoft Word Data Extraction cropping Imprinting & Watermarking Full Text Indexing PDF SimpleSend Convert Email to PDF ISIS Driver Image Scanning RegEx OCR Cloud Storage Scanning RPA offline OCR Document Imaging SimpleCoversheet image splitting Bar Code Printing MS Office TWAIN XML on-prem OCR Imprinting PDF Barcode Recognition Server OCR Scanning Software PDF417 OMR QuickBooks Document Management Batch Scanning Sunshine OCR PDF Data Extraction Software Barcode Reading Software Bar Codes Sunshine Software OCR Document Scanning Barcode Printing File Indexing Export

Online Support Options

Check our Wiki Help, Knowledge Base and Training Videos, or Contact Support if you still need Help

How to Buy

Solutions start at just $500! Buy SimpleIndex online or from an Authorized Dealer in your area.

Authorized Dealers

Authorized DealersSimpleIndex is a great addition to any system integrator's product line. Become an Authorized Dealer.

Get a Web Demo

Get a free online demo with a scanning specialist who can configure SimpleIndex on your computer remotely.
Sign up for a demo now!

Download a Trial

SimpleIndex Trial30-day trial downloads are available for all Simple Software applications.
Download Now!

SimpleIndex Applications

SimpleIndex Applications Packaged apps built with SimpleIndex.
SimpleInvoice for AP
Sales Tax Manager
Mortgage LoanStacker
MSDS and Patents
SimpleIndex

© 2023 Meta Enterprises, LLC | Knoxville, Tennessee | A Family Owned Company
© 2023 SimpleSoftware | Consulting Services in the Field of Software as a Service

TOP
Manage Cookie Consent
We use cookies to optimize our website and our service.
Functional cookies Always active
The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
Preferences
The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
Statistics
The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
Marketing
The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
Manage options Manage services Manage {vendor_count} vendors Read more about these purposes
View preferences
{title} {title} {title}
});