The only constraint of the service is that the PDF must be ‘application generated’, i.e. produced directly from an application. Machine generated PDFs almost always carry the raw invoice data within the PDF itself. Our service reads the data direct from the PDF and maps it to an e-invoice structure.
If the PDF is generated by scanning a paper invoice, an image or ‘photo’ of the invoice will be passed in the PDF. Unfortunately the only way to process an image PDF is to use OCR (Optical Character Recognition) technology – which by its very nature cannot guarantee data quality without an human operator to review. For image files, we offer our Cloud Capture service which takes advantage of the market leading OCR engine to capture the relevant invoice data.