Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

train.py file doesn’t exist #5

Open
ssatz opened this issue Apr 27, 2024 · 5 comments
Open

train.py file doesn’t exist #5

ssatz opened this issue Apr 27, 2024 · 5 comments

Comments

@ssatz
Copy link

ssatz commented Apr 27, 2024

Hi, thanks for the repo. I am trying to train the pub1m table structure on yolo8. I couldn’t find the train.py file.
Label table row accuracy is no at all good. Any suggestions

@ssatz ssatz changed the title train.py file doesn’t exisr train.py file doesn’t exist Apr 27, 2024
@whn09
Copy link
Owner

whn09 commented May 27, 2024

I used Yolov5 to train the model, but I think Yolov8 is similar

@ssatz
Copy link
Author

ssatz commented May 27, 2024

hi thanks this has been solved. I looked into wrong Yolo version.
I think you didn't train the Fintab Dataset?

@whn09
Copy link
Owner

whn09 commented May 28, 2024

Yes, I didn't train the Fintab Dataset. But maybe you can merge the dataset with PubTables-1M. I also recommend you to set Yolo size to at least 800 to get best performance.

@ssatz
Copy link
Author

ssatz commented May 28, 2024

Yes I just trained for fintab with 640 size and table detection is not good and getting short rows. I will try it with 800.

@ssatz
Copy link
Author

ssatz commented Jun 25, 2024

I trained both Fintab and PubTables- 1m and normal structure the results are good and i think need to train PaddleOCR for text detection.

Here is the Yolo format data https://huggingface.co/datasets/Codeplug/pub-fintab-yolov
Metrics:

image

image

For training the dataset we have used Runpod

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants