Looking for a dataset of electronic invoices with the following specs:
Type: Electronic invoices, not scanned docs, US invoices preferably
File Type: Pdf or jpg/png…
Quantity: At least 500 total invoices, preferably over 1,000
Additional details: The dataset needs to contain both correct and incorrect invoices. Incorrect invoices would be invoices that contain errors, inaccuracies or issues in them. Correct invoices need to have a tag in the name that indicates they are correct, same thing for the incorrect invoices. Not sure if this is the best move but I would be ok with having 2 separate datasets, 1 dataset of correct invoices and another dataset of incorrect invoices.
I am also open to suggestions of sites or resources that have invoices for web scrapping purposes.
I am willing to provide additional details if it helps.
Thanks in advance!
submitted by /u/souley16
[link] [comments]