Cloud OCR

From Simple Wiki
Revision as of 09:36, 29 April 2022 by Aaron (talk | contribs) (Created page with "SimpleIndex 10.1 adds Amazon Textract cloud-based OCR to enable the following features in SimpleIndex: * Highest accuracy of any available OCR engine * Recognition of both pr...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

SimpleIndex 10.1 adds Amazon Textract cloud-based OCR to enable the following features in SimpleIndex:

  • Highest accuracy of any available OCR engine
  • Recognition of both print and cursive handwriting
  • Automatic extraction of form field labels and values without templates
  • Automatic extraction of standard fields from Invoices and Receipts
  • Capture of line item data from Invoices
  • Convert documents to JSON with coordinates and location of all text

Requirements to Use Textract[edit | edit source]

Connect to Your AWS Account[edit | edit source]

Using Textract requires an AWS account, which will incur charges for any documents processed using the Textract OCR option.


AWSText Engine[edit | edit source]

AWSForms Engine[edit | edit source]

AWSInvoice Engine[edit | edit source]

VENDOR_NAME~AMERICAN WASTE MANAGEMENT SERVICES TOTAL~$372.00 RECEIVER_ADDRESS~BILL TO: P-8973 AMERICAN REFINING GROUP 77 NORTH KENDALL AVE BRADFORD PA 16701 INVOICE_RECEIPT_DATE~07/31/2021 INVOICE_RECEIPT_ID~210743 PAYMENT_TERMS~30 DAYS SUBTOTAL~$372.00 TAX~$0.00 LINE1EXPENSE_ROW~JULY2021 RENTAL 07/31/2021 7/1-/31/2021 31.00 $12.00DAY $372.00 $372.00