Created by: duskybomb
Added invoice2data (A modular Python library to extract data from PDF invoices)
What is this Python project?
A modular Python library to support your accounting process.
- extracts text from PDF files using different techniques, like pdftotext, pdfminer or tesseract OCR.
- searches for regex in the result using a YAML-based template system
- saves results as CSV, JSON or XML or renames PDF files to match the content.
What's the difference between this Python project and similar ones?
I haven't come across any such project in python
--
Anyone who agrees with this pull request could vote for it by adding a