Apply Transforms Field to Bounding Boxes #295

zkalson · 2024-03-31T00:59:44Z

I understand that Document AI performs preprocessing to correct for issues like skew in documents sent to the API, and that any transformations applied are provided in the response under the transforms field. It would be massively helpful to have a function I can call on the Document object to the undo the preprocessing so that the bounding boxes are relative to the submitted document, not the preprocessed document.

I've spent a few weeks trying to implement this myself (and have a support request that's been in limbo for about a month and a half), and unfortunately haven't been able to make any progress. Admittedly, I have pretty limited experience with OpenCV, so I may be missing something.

Attached are images of a document I uploaded to GCP, the corresponding preprocessed image that GCP returns in the images field, and the output of the bounding boxes when I attempt to apply the transforms. If you look closely at the text layer, it doesn't match up with the original image.

blunderbuss-gcf bot assigned nayaknishant Mar 31, 2024

zkalson mentioned this issue Apr 18, 2024

Convert Document AI Object to Preserve Layout Text? #159

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Apply Transforms Field to Bounding Boxes #295

Apply Transforms Field to Bounding Boxes #295

zkalson commented Mar 31, 2024 •

edited

Loading

Apply Transforms Field to Bounding Boxes #295

Apply Transforms Field to Bounding Boxes #295

Comments

zkalson commented Mar 31, 2024 • edited Loading

zkalson commented Mar 31, 2024 •

edited

Loading