Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve inference speed on sagemaker serverless while preserving accuracy #106

Closed
rbavery opened this issue Mar 27, 2023 · 1 comment
Closed

Comments

@rbavery
Copy link
Contributor

rbavery commented Mar 27, 2023

Goal: get to 2-3 seconds per image.

according to yolov5 benchmarks and our past tests, inference speeds are possible, but with potential requirements to compromise on image size: ultralytics/yolov5#6613

PR: ultralytics/yolov5#6613

@nathanielrindlaub
Copy link
Member

We got to ~3.5 seconds per image with the ONNX-compiled model deployed on Sagemaker Serverless - so chalking that up to a win for now (see #105). Nice job @rbavery thanks for your persistence on this!!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants