| International Journal of Computer Applications |
| Foundation of Computer Science (FCS), NY, USA |
| Volume 187 - Number 90 |
| Year of Publication: 2026 |
| Authors: Homi Dhumal, Harsh Dixit, Manav Shah |
10.5120/ijca2026926589
|
Homi Dhumal, Harsh Dixit, Manav Shah . Vendor-Agnostic Invoice Processing Framework: Integrating OCR, Canonical Modeling, and Human-in-the-Loop Validation. International Journal of Computer Applications. 187, 90 ( Mar 2026), 30-35. DOI=10.5120/ijca2026926589
Automated invoice processing still faces ongoing unresolved difficulties that could be related to non-standardized format, inaccuracy of optical character recognition, and the need to maintain the financial integrity as well as the audit compliance. Current academic and commercial solutions do not fully address the issues and an integrated approach to ensure numerical inaccuracy, regulatory compliance, and auditing is not developed. To overcome this weakness, this paper proposes a validation-based pipeline of invoice processing, combining OCR extraction, canonical data modelling, and carefully organization human-in the-loop validation controls. It is a pipeline that normalizes the extracted fields to a vendor-neutral schema to ensure a seamless interoperability of enterprise resource planning and imposes arithmetic and accounting validation constraints. Experimental evaluation has shown improved retrieval of financial information and reducing numerical inconsistencies caused by OCR errors.