Focusing on What Matters Most
Instead of performing OCR or parsing the entire document text, modern intelligent document processing systems skip over irrelevant information and focus on specific sections. This helps to avoid unnecessary OCR or full-text parsing that can slow-down the entire process. If documents are born-digital, then OCR can be skipped altogether to immediately interpret the document and extract the needed data.
By now, it should be clear that while OCR is an important step within intelligent document processing for scanned documents, it delivers only text, not interpretation. If the documents are digital, OCR is not required at all. To move to a capability where document-based information can be used within an automated transaction, several levels of capabilities are required in order to go from text to real structured data.