ReceiptManager/receipt-parser-legacy

Support for PDF receipts

Open

#32 opened on Oct 26, 2019

View on GitHub
 (1 comment) (1 reaction) (0 assignees)Python (762 stars) (192 forks)batch import
enhancementhelp wanted

Description

Not sure if this use case is shared among others: I use Scanbot to scan my receipts as multi-page PDFs. Would be great if this tool could work on these pdfs.

Scanbot does a sort of OCR itself, but it doesn't seem to be that good, in the sense that it adds too much noise: a receipt contains so much text, and I'm only interested in the articles, price per article, to see price evolution across multiple weeks.

Contributor guide