SIGN IN YOUR ACCOUNT TO HAVE ACCESS TO DIFFERENT FEATURES

Login with Google
FORGOT YOUR PASSWORD?

FORGOT YOUR DETAILS?

AAH, WAIT, I REMEMBER NOW!
QUESTIONS? CALL: 865-637-8986
  • LOGIN

SimpleIndex

  • LEARN MORE
    • GENERAL INFO
      • Getting Started
      • How To Scan Documents
      • Barcode Scanning Guide
      • Searching & Viewing
      • Sunshine Software
      • News & Updates
      • Schedule a Consultation
    • FEATURES
      • Streamlined Interface
      • Automated & 1-Click Processing
      • TWAIN and ISIS Scanning
      • Zone OCR and Dynamic OCR
      • Handwriting Recognition Software
      • Amazon Textract OCR and ICR
      • Screenshot OCR
      • Document Classification
      • Database Integration
    • –
      • PDF & MS Office Text Parsing
      • Email Document Processing
      • Barcode Recognition
      • Optical Mark Recognition
      • Match Documents to Existing Data
      • Required Documents Check
      • Imprinting & Watermarking
      • SharePoint Document Scanning
      • AI and SimpleIndex
  • SOLUTIONS
    • General
      • All-In-One Scanning & Sorting Tool
      • Affordable Document Management
      • Instant Integration
      • Network Scanners & Copiers
      • Remote Document Capture
      • Reduce Click Charges for Data Capture
      • Compare with Other Solutions
    • Feature Demos
      • Zone OCR with Template Matching
      • PDF Text Processing
      • Organize Office Documents
      • Automatic Image Splitting
      • Amazon Textract OCR and ICR
      • Full-Page OCR & Multi-User Workflow
      • PDF Form Filling with XML & RPA
      • AP to QuickBooks Online with RPA
      • CRM Integration with RPA
    • Marketplace
      • Sales Tax Exemption Forms OCR
      • Invoice Processing
      • Automatic Web Image Optimization
      • Material Safety Data Sheets (MSDS) Indexing
      • Patent ID and Title Extraction OCR
      • Federal Tax Returns
      • Mortgage & Loan Documents
  • SUITE
    • SimpleCoversheet – Print Bar Codes
    • SimpleExport – Data File Converter
    • SimpleView – Search, View & Edit
    • SimpleQB – QuickBooks Integrator
    • SimpleOCR – Freeware OCR
    • Buy Suite Apps
  • DOWNLOAD
  • SHOP
    • COMPARE VERSIONS
    • SIMPLEINDEX
      • Workstation License
      • Concurrent License
      • Subscription License
    • SIMPLEINDEX SERVER
    • SOLUTIONS
      • LoanStacker
      • Material Safety Data Sheets (MSDS) Indexing OCR
      • Patent ID and Title Extraction OCR
      • Sales Tax Exemption Forms OCR
      • SimpleInvoice
      • TaxStacker Add-on for SimpleIndex
    • ADDONS AND EXPANSIONS
    • MAINTENANCE & CONSULTING
    • MANAGE SUBSCRIPTIONS
    • FIND A DEALER
      • Dealer Locator
      • Become a Dealer
    • CONTACT SALES
  • SUPPORT
    • WIKI HELP
    • KNOWLEDGE BASE
    • SIMPLEINDEX UNIVERSITY
    • PRIVACY POLICY
    • CONTACT SUPPORT
    • NEWSLETTER
    • SCHEDULE A CONSULTATION
  • My Account
    • MANAGE SUBSCRIPTIONS
    • Downloads
    • Register Account
    • Login
  • MY CART
    No products in cart.
  • Home
  • Simple Software Knowledge Base - Article
  • Large Documents (>500 pages) are Slow to Process

Large Documents (>500 pages) are Slow to Process

by Cary Wiedman / Thursday, 06 February 2020 / Published in Indexing & UI, SimpleView

Welcome to our Knowledge Base

Created OnFebruary 6, 2020
byCary Wiedman
Print

When working with PDF image files containing a high number of pages (typically in excess of 500, but can vary by file and PC running the job) SimpleIndex may run into performance issues as it attempts to hold all of those pages in memory and perform the requested operations (full-text OCR in particular can tax a system in these circumstances).

SimpleIndex 11:

Use the Fast Import and Fast Export options to use our new, optimized import and export that can split or merge PDF files with thousands of pages in a matter of seconds. These options disable the optional features that require the slower import and export operations and allow for much faster processing.

Older Versions of SimpleIndex:

A workaround in this scenario is to convert the large PDF to a folder of smaller PDFs files that can be managed more easily. In order to minimize the impact on production and tax the user(s) with extra steps, you can use a third-party splitting tool that can be called from the Command Line. One such option that has worked well is PDFSplitter from CoolUtils

One way to automate this process is to use PDFSplitter’s command line ability in conjunction with SimpleIndex’s Pre-processing function. For simplicity let’s consider a 600 page PDF with a filename generated at the time of scanning using indexes provided on a coversheet or keyed by an operator. The goal now is to take that large file and perform a full-text conversion on it.

Our SimpleIndex job, Full Page OCR.sic let’s say, launches and before getting to work calls PDFSplitter from the Pre-processing step with a command such as

PDFSplitter.exe “C:\Images\Smith – John – Medical History.pdf” C:\Images\Pages\ -cp 100

PDFSplitter will run and break that document every 100 pages creating 6 PDFs in the folder C:\Images\Pages. It maintains the original filename, simply adding “001-100” and so on to the name. After PDFSplitter is complete the Full Page OCR job begins its process and, given that the original filename is still part of the split files’ naming schema, it can produce one full-text PDF in the final output folder.

Related Wiki Help Pages:

  • File Input Settings
  • File Output Settings
  • Pre-Process and Post-Process Settings
  • Tweet
Tagged under: Automatic Indexing Software, Command Line Interface, Command-Line, File Indexing, Office PDF Document Indexing, Office PDF Text Processing, PDF, PDF Forms, Scanned Document Indexing

Search

Connect with Us!

What is 7+4?


Search Knowledge Base

Recent KB Articles

  • How much do Simple Software products cost?
  • The 'Microsoft.ACE.OLEDB.12.0' provider is not registered on the local machine.
  • Enable License Log
  • Change License Files Path
  • License Activation Instructions for Simple Software Products
  • What are SimpleIndex Specifications?
  • On what versions of Windows does SimpleIndex run?
  • License Site Update v9.2.50 and Earlier

Feature Cloud

Batch Scanning Aztec and QR Code</li> <li>Recognize 30 different 1D barcode formatsCode 39 Archive Email to PDF Bar Code Printing Automatic PDF Separation 2 of 5 Automatic Data Capture Bar Codes Bates Numbering Software Automatic Indexing Software Business Process Automation a generic barcode coversheet can be used to separate the scanned images into multi-page files Barcode Printing Barcode OCR 1-Click Processing Barcode Recognition Software Bar Code Scanning accessability Checkbox Recognition Barcode Reading Software

Online Support Options

Check our Wiki Help, Knowledge Base and Training Videos, or Contact Support if you still need Help

How to Buy

Solutions start at just $500! Buy SimpleIndex online or from an Authorized Dealer in your area.

Authorized Dealers

Authorized DealersSimpleIndex is a great addition to any system integrator's product line. Become an Authorized Dealer.

Get a Web Demo

Get a free online demo with a scanning specialist who can configure SimpleIndex on your computer remotely.
Sign up for a demo now!

Download a Trial

SimpleIndex Trial30-day trial downloads are available for all Simple Software applications.
Download Now!

SimpleIndex Applications

SimpleIndex Applications Packaged apps built with SimpleIndex.
SimpleInvoice for AP
Sales Tax Manager
Mortgage LoanStacker
MSDS and Patents
SimpleIndex

© 2025 Meta Enterprises, LLC | Knoxville, Tennessee | A Family Owned Company
© 2025 SimpleSoftware | Consulting Services in the Field of Software as a Service

TOP
SimpleIndex
Manage Cookie Consent
We use cookies to optimize our website and our service.
Functional cookies Always active
The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
Preferences
The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
Statistics
The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
Marketing
The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
  • Manage options
  • Manage services
  • Manage {vendor_count} vendors
  • Read more about these purposes
View preferences
  • {title}
  • {title}
  • {title}
});