Cloud OCR
SimpleIndex 10.1 adds Amazon Textract cloud-based OCR to enable the following features in SimpleIndex:
- Highest accuracy of any available OCR engine
- Recognition of both print and cursive handwriting
- Automatic extraction of form field labels and values without templates
- Automatic extraction of standard fields from Invoices and Receipts
- Capture of line item data from Invoices
- Convert documents to JSON with coordinates and location of all text
Requirements to Use Textract[edit | edit source]
Connect to Your AWS Account[edit | edit source]
Using Textract requires an AWS account, which will incur charges for any documents processed using the Textract OCR option.
AWSText Engine[edit | edit source]
AWSForms Engine[edit | edit source]
AWSInvoice Engine[edit | edit source]
VENDOR_NAME~AMERICAN WASTE MANAGEMENT SERVICES TOTAL~$372.00 RECEIVER_ADDRESS~BILL TO: P-8973 AMERICAN REFINING GROUP 77 NORTH KENDALL AVE BRADFORD PA 16701 INVOICE_RECEIPT_DATE~07/31/2021 INVOICE_RECEIPT_ID~210743 PAYMENT_TERMS~30 DAYS SUBTOTAL~$372.00 TAX~$0.00 LINE1EXPENSE_ROW~JULY2021 RENTAL 07/31/2021 7/1-/31/2021 31.00 $12.00DAY $372.00 $372.00