How to handle multi page invoices #279

DNFSiF · 2024-01-06T08:32:14Z

I try to use donut with a custom invoice dataset to get fields like for example invoice numbers and totals.
The invoices can be single paged or multi paged, so the fields could be across different pages.

Has anyone experience with multi page invoices?
Should I merge the pages together to a single image?
Do I train different models for different page counts?

Thanks for any advise! 😄

felixvor · 2024-02-02T20:29:47Z

You could think about increasing the input dimensions and forwarding multiple pages as one image, but it does not scale well and no hardware can realistically handle that compute with more than a few pages. What we did was try to find the values we want to label to a page using fuzzy matches in OCR (for example using libraries like rapidfuzz). If we find the label as a substring on a pages OCR, we label that page for the donut training. Maybe that helps you, good luck!

balajiChundi · 2024-02-07T03:15:13Z

"Sending in multiple pages for each request", if you define your use case like this - model's max_positional_embeddings (you might have to parameter tune) might not be sufficient to incorporate all the info in a single response and higher possibilities of repetition of text. Instead, you can build a single page prediction model at a time and handle the predictions later.

xdevfaheem · 2024-02-20T14:12:06Z

@balajiChundi can you elaborate a bit what you mean?

balajiChundi · 2024-04-22T15:41:23Z

First and preferred way: Get the predictions from the model twice, once per each page (for a two page invoice), you can parallelize the model predictions for a faster output. PS: This worked for me.
Second : (Didn't work for me), I concatenated the images like stitching them vertically, trained the model. The problem with this is, data prep is very clumsy and time-taking and cannot actually decide on the max_token that we get as output, So this is not at all recommended.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to handle multi page invoices #279

How to handle multi page invoices #279

DNFSiF commented Jan 6, 2024

felixvor commented Feb 2, 2024

balajiChundi commented Feb 7, 2024

xdevfaheem commented Feb 20, 2024

balajiChundi commented Apr 22, 2024

How to handle multi page invoices #279

How to handle multi page invoices #279

Comments

DNFSiF commented Jan 6, 2024

felixvor commented Feb 2, 2024

balajiChundi commented Feb 7, 2024

xdevfaheem commented Feb 20, 2024

balajiChundi commented Apr 22, 2024