Subscription free OCR refers to “on-premises optical character recognition software”, aka Sunshine Software. Why Sunshine? Because sky is clear and there are no clouds
On-premises OCR, or Optical Character Recognition, refers to a technology or software system that performs character recognition and text extraction from images or documents locally, within an organization’s own infrastructure, rather than relying on external servers or cloud-based services. In other words, the OCR process takes place on the organization’s own hardware or servers, rather than sending data to a third-party cloud service for processing.
Here are some key points about on-premises OCR:
- Local Processing: On-premises OCR software is installed and operates on the organization’s own servers or computers. This means that sensitive or confidential data doesn’t leave the organization’s premises, which can be a critical requirement for companies with strict data security and privacy regulations.
- Data Privacy and Security: Organizations that deal with highly sensitive information, such as medical records, legal documents, or financial data, often prefer on-premises OCR to maintain better control over their data and ensure compliance with privacy regulations.
- Network Independence: On-premises OCR doesn’t rely on an internet connection for processing, making it suitable for environments with limited or unreliable internet connectivity.
- Customization: On-premises OCR solutions can often be customized to meet specific business needs and integrated into existing workflows and systems.
- Cost Considerations: While on-premises OCR provides greater control and security, it typically requires more substantial upfront investments in hardware, software, and IT infrastructure, compared to cloud-based OCR services. Additionally, organizations are responsible for maintenance, updates, and support. However, many cloud solutions make you to sign contracts charging you annually. Because of that accumulated costs of cloud solutions are often more expensive than it seem. On-premise Sunshine solutions are offered for onetime fixed payment, thou. Here is a study on Total Cost of Ownership for OCR solutions exploring the costs.
- Scalability: Scalability with on-premises OCR may require additional investment in hardware and software licenses as the volume of OCR processing increases.
In contrast, cloud-based OCR services rely on external servers and infrastructure to process OCR tasks, offering scalability and reduced upfront costs, but they may raise concerns about data privacy and security, especially for organizations dealing with sensitive data. However, remember that cloud is just someones’ server that you do not have any control of.
The choice between on-premises and cloud-based OCR depends on an organization’s specific needs, regulatory requirements, and available resources. Some organizations may opt for a hybrid approach, where critical or sensitive data is processed on-premises, while non-sensitive data is processed in the cloud for efficiency and scalability.
SimpleInvoice Invoice Processing Solution
SimpleInvoice is a preconfigured solution that uses the OCR and dictionary matching functionality of the SimpleIndex scanning and indexing software to automatically capture key information from invoices needed for Accounts Payable processing.
SimpleInvoice requires minimal configuration to get started, and comes with everything you need to capture most common invoice styles.
Use SimpleInvoice to:
- Capture data from paper and electronic invoices in a single workflow
- Automatically receive and enter Accounts Payable data in your accounting software
- Create full-text searchable invoice files
- Create an organized filing system for archiving invoices
- Quickly find and view invoices based on vendor, date, invoice number, or full-text search
- Direct integration with QuickBooks on-premise using SimpleQB
- Works with RPA bots to integrate with QuickBooks Online and other accounting systems
Uses Templates, Not Training
Most data on an invoice matches common patterns like dates and total amounts. The one exception is the invoice number, which has a different format for every vendor.
Using the Template Autofill feature in SimpleIndex, you to spell out the specific OCR pattern of a vendor’s invoice number as a column in your Vendor database. When processing invoices, it first identifies the vendor, then searches for the matching pattern in the text to find the invoice number.
This solution is far simpler than the machine learning algorithms employed by enterprise invoice OCR systems, which is why SimpleIndex is a fraction of the cost. It’s also simpler than other template-based systems that require you to locate every field for every vendor.
Enterprise Accounts Payable Automation
If your AP workflow requires advanced features like line item capture, GL coding, PO matching, VAT calculation, complex approval workflows, or if you have thousands of vendors to process, then an enterprise invoice processing solution is more appropriate.
Don’t worry, we can help you out with that too!
Find Out More
SimpleInvoice is included for free with any SimpleIndex license. Download SimpleIndex Now!
Some initial setup is required, and we can help you out with that too. Our Professional Services department can have you up and running in just a couple of hours.
Check out SimpleQB or our AP Automation RPA Bot to see how we integrate with your accounting software to automate the entry of transaction data.
Please Contact Us to find out more about SimpleInvoice!
Learn More:
Full-Page OCR Indexing Demo
This sample job demonstrates the ability for SimpleIndex to convert scanned documents to searchable PDF files and extract index data from the OCR text. It also demonstrates the multi-user workflow capabilities.
Step 1 uses a full-page OCR process on each image.
Field data is extracted from the full-page text using template and dictionary matching algorithms.
This is done in Pre-Index mode to allow unattended processing.
Data is saved to a database so it can be reviewed and corrected in Step 2.
Step 2 uses Database Update mode to find images with missing index values and allow the user to manually enter the correct data.
Step 3 uses a SimpleSearch configuration to search and view the indexed images, including full text searches.
Find Out More
- Download or get an Online Demo
- Dynamic OCR Features in SimpleIndex
- Full-Page OCR Wiki Pages
- OCR Features and Settings Wiki Pages
- OCR Software Guide on SimpleOCR
Learn More:
FAQ Related to Full-Page OCR
PDF Text Processing Demo
This sample job demonstrates the PDF text processing capabilities of SimpleIndex by extracting the Document Number, Date, Document Type, Customer and Total from a number of documents without OCR, by processing the text layer of PDF files.
Computer-generated PDF files, such as those created using PDF printer drivers, already contain digitized text. SimpleIndex reads the text and performs Template and Dictionary Matching to locate and extract the correct data values from the text.
Since the existing text is being used, OCR is not performed. This makes processing much faster and 100% accurate, especially compared to solutions using zone OCR.
While this demo runs interactively, text processing jobs can run in unattended mode since the data does not need to be verified.
Full-Page OCR can also be used to get text from scanned PDF files with no existing text. SimpleIndex will also detect when a PDF file has existing text and only perform OCR on the documents that need it to improve performance.
Find Out More
- Download or get an Online Demo
- PDF Text Processing Features in SimpleIndex
- PDF Features and Settings Wiki Pages
- Full-Page OCR Wiki Pages
- OCR Features and Settings Wiki Pages
- OCR Software Guide on SimpleOCR
Learn More:
FAQ Related to PDF Text Processing
- I have a duplex scanner. How to set up SimpleIndex to scan two sided documents automatically?
- Features
- Patent ID and Title Extraction
- Take control of Sales Tax exemption forms
- Instant Integration With Any Application
- Affordable Document Management
- Indexing Solutions with Barcode Recognition
- Document Classification
Zone OCR with Template Matching
This video shows the Zone OCR Invoice Processing sample job. Zone OCR is the traditional method for extracting index data from printed text appearing in fixed locations on every page.
The video also shows how Zone OCR is enhanced with SimpleIndex‘s Template Matching and Dictionary Matching features, giving you much more margin for error than other solutions.
Find Out More
- Download or get an Online Demo
- Dynamic OCR Features in SimpleIndex
- OCR Features and Settings Wiki Pages
- OCR Software Guide on SimpleOCR
Learn More:
FAQ Related to Zone OCR
- Automatic Image Splitting
- Zone OCR and Dynamic OCR
- Language Pack for Standard/Tesseract OCR
- Languages Supported in SimpleSoftware OCR Engines
- Change the Dictionary Separator Value
- Change the OCR Font or Type
- Regular Expression (RegEx) - Syntax or Type
- I'm using full page OCR. The information is all appearing in the txt file but it is losing format about half way through. Data to the right is ending up at the end of the txt doc. Can this be fixed?
Compare Leading Solutions
SimpleIndex™
Kodak Capture Pro™
Kofax Express™
PaperVision™ Capture Desktop
Note: This video depicts PaperVision Capture Desktop, a now discontinued software that has since been replaced by the similarly functioning updated version of PaperFlow.
Office Gemini DiamondVision™
Testing Methods
The benchmark times were recorded using all available software shortcuts, and by performing data entry and user interactions as fast as possible. The same scanner and computer hardware was used for each test. Much care was taken to ensure that each application yielded the most accurate OCR results possible given the sample documents.
Unfortunately none our competitors could accurately capture the account number on all 10 pages. The extra time to correct these errors accounts for 15-30% of the difference in processing times. The difference in accuracy is due in large part to SimpleIndex‘s pattern matching OCR feature, which the other programs lack.
Keep in mind these videos were recording using the latest version available at the time this test was taken. Results may vary with with later versions.
Learn More:
- 1
- 2