Batch Scan To HTML using SimpleIndex
SimpleIndex is designed to enable a desktop scanner or network copier to quickly process multiple documents in batches, extract data from barcodes and text with OCR, then use that data to organize the files automatically, or export it to a database or XML file. It a fast, easy and inexpensive way to do batch Scan To HTML.
With SimpleIndex you can:
Once configured, SimpleIndex is the easiest scanning solution available. It can scan documents, OCR them to searchable PDF files, read key values from the text, insert relevant bookmarks and automatically file the documents on your network and/or database with a single click! It is designed for organizations that need to scan many documents on an ongoing basis.
If you are looking for a desktop Scan To HTML to convert a few scanned documents to editable MS Word or PDF, please visit the Scan To HTML page at ScanStore or download our freeware Scan To HTML, SimpleOCR.
More on Scan To HTML
Scan To HTMLFind Scan To HTML here. FineReader, ReadIRIS, OmniPage, TextBridge & more. Reviews, expert advice, demo downloads and outsourced OCR services.
Find out MoreProduct Information Index
Getting Started Guide
Simple Software University
Frequently Asked Questions
Other Simple Software Products
There are several OCR (Optical Character Recognition) software solutions available to convert scanned images to text, Word, Excel, HTML or searchable PDF. The differences between them can often be obscure, leaving many to wonder why some Scan To HTML cost about $100 while others cost $500 or more.
The main features that differentiate Scan To HTML are:
- Character recognition accuracy
- Page layout reconstruction accuracy
- Support for languages
- User interface design
- Output file formats (Word, Excel, PDF, eBook, etc.)
- OCR speed and support for multi-core CPUs
- Batch processing modes
- Advanced PDF encryption or compression
- Special features for niche projects
Because of the infinite combinations of document types, OCR engines, project requirements and special features, it may be possible that one engine will perform better with your particular documents than another. Use our handy OCR feature comparison chart to determine which OCR program best meets your requirements. ScanStore provides demo downloads for most Scan To HTML with your ScanStore User Account if you prefer to try before you buy.
Scan To HTML Categories
The Scan To HTML guide is divided into the following categories:
Applications | OCR Servers | Mac OCR | PDF Converters | Personal
Hebrew/Arabic/Farsi OCR | Chinese/Japanese/Korean/Thai OCR
And The Winner Is...
The OCR experts at ScanStore have tested the latest versions of FineReader, OmniPage, ReadIRIS, CVision PDF Compressor, TextBridge and SimpleOCR and we consider ABBYY FineReader 11 the best overall value for business users, while ReadIRIS is the best Scan To HTML for under $150.
The key deciding factors were:
- User interface design
- Page layout reconstruction capabilities
- Extensive language support
- Engine stability when processing large files
- Availability and quality of technical support
Though other testing labs have ranked OmniPage's overall accuracy slightly higher, we find the difference is nearly negligible. All modern Scan To HTML has very good accuracy, so we recommend going with the one that has particular special features like ReadIRIS Corporate's CardIRIS, FineReader's camera OCR and screenshot reader, or OmniPage Pro's form data collection, auto-redaction and barcode filing capabilities.
Businesses with many documents to process should use our SimpleIndex batch document scanning software with the FineReader OCR engine to scan and OCR large batches of documents. Barcode and OCR can also be used to sort and file documents into folders, databases or SharePoint.
FineReader Professional is a highly accurate and easy to use Scan To HTML that includes host of features including digital camera OCR, intelligent document layouts, image enhancement, barcode recognition, and command line integration. FineReader is our pick for Scan To HTML because its document layout retention will save you much time in reformatting documents you convert for editing.
FineReader Corporate Edition offers unique concurrent licensing that makes it possible for many users who need occasional use of OCR to share a small pool of active licenses. With accuracy comparable to OmniPage, superior technical support services, and a user interface that many users find preferable, we think FineReader Corporate is the best choice of Scan To HTML for business
Affordable Scan To HTML for business and home users. ReadIRIS Pro provides a very accurate OCR recognition rate at a low cost, but still has some of the advanced features that higher priced professional Scan To HTML includes. The main limitation is that the Pro version is limited to documents under 50 pages.
Adds support for files over 50 pages, business card recognition, as well as automatic processing of hot folders.
OmniPage Standard is a low-cost version of OmniPage designed for personal users. It has the same recognition capabilities as OmniPage Pro, but does not offer advanced features like batch processing and forms recognition. Compare OmniPage Standard to Pro.
OmniPage Pro 18 has several unique features that make it stand out for a variety of applications. Some of these include auto-redaction, SharePoint integration, automatic filing with barcodes, PDF auto-bookmarking, form data collection and MFP support. Most of these new features are not available in the Standard edition. Compare OmniPage Standard to Pro.
PdfCompressor Desktop Edition (OCR) is a more economical version of PdfCompressor Professional (OCR), designed for lower-volume users. This version requires files to be processed individually and files must not exceed 100 pages. An excellent choice for someone who needs the power, but not the volume.
PdfCompressor produces the most efficient image documents possible for high volume scanning environments by combining highly accurate OCR, advanced file compression, and batch PDF conversion. PdfCompressor can compress scans by a factor of 10-100, enabling documents to be stored, transmitted, accessed, and hosted more efficiently and less expensively.
ReadIRIS Pro Scan To HTML now includes Arabic (PC version only), Farsi, and Hebrew character recognition in their base package. No special version or add-on is required.
Adds the ability to recognize files over 50 pages, business cards and monitor a hot folder to automatically process images in the background.
Enables Arabic and Farsi character recognition in the IRISDocument high-volume server OCR solution.
FineReader Professional 11 supports Hebrew character recognition and now has an optional version with additional Arabic character recognition.
FineReader Corporate also supports Hebrew character recognition and now has an optional version with additional Arabic character recognition.
Hebrew and Arabic languages are available as an add-on to the ABBYY Recognition Server high-volume OCR solution.
NovoVerus supports an exceptionally wide range of Roman, Asian, Cyrillic and Middle Eastern languages, including challenging global languages like Arabic, Persian (Farsi, Dari), Pashto, Hebrew, Urdu, Chinese (Simplified & Traditional), Korean, Russian, English and Spanish.
ABBYY PDF Transformer 3.0's intuitive, versatile, multilingual tool enables you to
easily convert any type of PDF into editable formats with the original layout and formatting retained.
PdfCompressor Desktop Edition (OCR) is a more economical version of
PdfCompressor Professional (OCR), designed for lower-volume users.
Using the latest in image compression technology PdfCompressor
makes the most compact, Web-friendly PDF files available.
Nuance PDF Converter Professional is the complete PDF solution for business
users, offering an unmatched combination of creation, editing and conversion features.
SimpleIndex combines ABBYY FineReader OCR technology with powerful pattern matching features to extract useful data from OCR text and use it to file documents automatically. Perfect for small to mid-sized businesses that need to digitize many documents at once. Also supports other labor-saving technologies like barcode recognition, zone OCR and database lookups.
OCR results can be saved to text, MS Word or searchable PDF and PDF/A files. Data can be saved to CSV (Excel), any SQL database, embedded in folders and filenames or used as file SharePoint 2010 metadata.
Affordable desktop and server licensing with no pay-per-click makes SimpleIndex the most cost effective software of its kind!
Innovative server-based Scan To HTML for performing centralized enterprise-wide OCR processing. Processor license allows anyone on the network to submit files for OCR. Complex XML job specifications can be submitted to control output. Support available for Arabic and Asian languages.
IRISDocument Server is a lower cost solution compared to recognition server, but lacks some of the more advanced features and has slightly lower accuracy.
Several versions are available with varying monthly page processing limitations, letting you scale your solution to meet your budget requirements. Asian, Arabic and Hebrew language packs are also available.
Innovative server-based Scan To HTML CVISION Maestro Recognition Server has been engineered and designed for industrial strength, corporate volume scanning & OCR needs. Maestro provides a flexible OCR solution delivered from a centralized server which enables organizations to easily integrate into their existing document and imaging workflow, while providing multiple workflow accessibility allowing users to perform many image processing functions beyond OCR.
Designed for service bureaus and large scanning departments, PaperVision Capture OCR by Digitech brings an uprecedented level of efficiency and power to information capture. Work with everything, implement any custom process you want, and track any statistic you need. Provides index value population and document break insertion as an automated process.
FineReader Corporate includes Chinese, Japanese, and Thai character recognition in their base package. No special version or add-on is required.
ReadIRIS Pro Scan To HTML now includes Japanese, Traditional Chinese, Simplified Chinese and Korean character recognition in their base package. No special version or add-on is required.
ReadIRIS Corporate version is the same as Pro but with the ability to recognize files over 50 pages, business cards and monitor a hot folder to automatically process images in the background.
ReadIRIS Pro Scan To HTML for Mac now includes Japanese, Traditional Chinese, Simplified Chinese and Korean character recognition in their base package. No special version or add-on is required.
ReadIRIS Corporate version for Mac is the same as Pro but with the ability to recognize files over 50 pages, business cards and monitor a hot folder to automatically process images in the background.
Add-on that enables Chinese, Japanese and Korean character recognition in the IRISDocument server OCR solution.
Add-on license is available for ABBYY Recognition Server to add Chinese, Japanese & Korean (CJK) language support. Thai character recognition language pack is also available, but is sold separately from CJK.
ABBYY FineReader Express is an easy to use Scan To HTML for creating editable and searchable files from scanned documents, PDFs and digital camera images. The one-click conversion feature instantly turns paper documents into various electronic formats, including RTF Format, Excel, and searchable PDF.
OmniPage Scan To HTML for Macintosh users. This version does not use the latest OmniPage engine, but provides good accuracy nonetheless. Once again, Nuance does not make a demo download available.
Affordable Scan To HTML for Macintosh users using the latest version of the IRIS OCR engine.
Adds support for files over 50 pages, business card recognition, as well as automatic processing of hot folders.
The classic TextBridge engine is still available as
an affordable alternative to high priced engines. It gives you an
ultra-low cost, yet still accurate Scan To HTML with out all the extra
features. Allows you to output to web pages, Word documents and
more while retaining the layout of the page, but does not include PDF
**NOT Compatible with Windows Vista**
Our own freeware OCR application provides acceptable accuracy for those who just need to convert a few pages and can't justify the cost of commercial OCR software. Developers can use the command-line and SDK versions to integrate SimpleOCR with their custom applications.
|Can SimpleIndex integrate with Microsoft SharePoint?|
|I'm using full page OCR. The information is all appearing in the txt file but it is losing format about half way through. Data to the right is ending up at the end of the txt doc. Can this be fixed?|
|How do I configure SimpleIndex to scan documents?|
|How can I use barcodes or blank pages as separator pages to indicate document breaks?|
|How do you configure OCR to read index information from MS Office or PDF documents?|
|Can SimpleIndex read barcodes off of PDF files in a folder?|
|Why are my barcodes not being recognized properly?|
|How can I improve recognition rates for my OCR fields?|
|I know nothing about databases. Can I still use the database and Retrieval Mode features?|
|Some pages in my documents have unwanted barcodes that are being read. How can I exclude these from recognition?|
Whether you are a small business trying to manage your paper, a government agency or non-profit trying to scan on a budget, or a multi-national corporation looking to distribute scanning throughout the enterprise, SimpleIndex is the perfect solution for scan to html.
This page has been optimized to help you find Scan To HTML quickly using the major search engines. Please use the links on this page to find more information on scan to html.