Document Identification, Separation, and Classification
The capability of a capture system to identify document types and distinguish the start and end of each document is crucial for large-scale processing. Although using document separators such as pages or barcodes is common, modern solutions offer features that eliminate the need for 'batch preparation'—the task of organizing documents before scanning.
Advanced systems can automate document classification. A system capable of automatically distinguishing between invoices or receipts reduces manual preparation. Employing machine learning that develops and improves document classification eliminates the need for a person or team to organize and prepare documents for the next processing step.
Automated classification empowers a business analyst to establish document types effortlessly by submitting samples of a specific document class to the system. This process teaches the system to recognize future ingested documents.
The software analyzes the document and identifies key characteristics important for determining the type during production. Additionally, systems can then create individual documents automatically from the stream of scanned images.
This enable individuals to import a batch of documents while removing the necessity of sorting or inserting separator pages. Additionally, the system identifies attributes that specify particular pages, such as the first or last page, while treating all other pages as those in-between.