Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Model Cant be load and stuck at optimize #742

Open
1 task done
ImannKamal opened this issue Jun 26, 2024 · 5 comments
Open
1 task done

Model Cant be load and stuck at optimize #742

ImannKamal opened this issue Jun 26, 2024 · 5 comments
Assignees
Labels
bug Something isn't working

Comments

@ImannKamal
Copy link

Search before asking

  • I have searched the HUB issues and found no similar bug report.

HUB Component

Models

Bug

Model Cant be load and stuck at optimize, I try to wait for the first time it stuck at optimize and the system ask me to train it again at epoch 99 but when I train it again it still have the warning unable to load to best model.
model

Environment

No response

Minimal Reproducible Example

No response

Additional

No response

@ImannKamal ImannKamal added the bug Something isn't working label Jun 26, 2024
Copy link

👋 Hello @ImannKamal, thank you for raising an issue about Ultralytics HUB 🚀! Please visit our HUB Docs to learn more:

  • Quickstart. Start training and deploying YOLO models with HUB in seconds.
  • Datasets: Preparing and Uploading. Learn how to prepare and upload your datasets to HUB in YOLO format.
  • Projects: Creating and Managing. Group your models into projects for improved organization.
  • Models: Training and Exporting. Train YOLOv5 and YOLOv8 models on your custom datasets and export them to various formats for deployment.
  • Integrations. Explore different integration options for your trained models, such as TensorFlow, ONNX, OpenVINO, CoreML, and PaddlePaddle.
  • Ultralytics HUB App. Learn about the Ultralytics App for iOS and Android, which allows you to run models directly on your mobile device.
    • iOS. Learn about YOLO CoreML models accelerated on Apple's Neural Engine on iPhones and iPads.
    • Android. Explore TFLite acceleration on mobile devices.
  • Inference API. Understand how to use the Inference API for running your trained models in the cloud to generate predictions.

If this is a 🐛 Bug Report, please provide screenshots and steps to reproduce your problem to help us get started working on a fix.

If this is a ❓ Question, please provide as much information as possible, including dataset, model, environment details etc. so that we might provide the most helpful response.

We try to respond to all issues as promptly as possible. Thank you for your patience!

@ImannKamal
Copy link
Author

fail
this is the third time i try to training the model and it still stuck and cant deploy the model

@ultralytics ultralytics deleted a comment from pderrenger Jun 26, 2024
@sergiuwaxmann
Copy link
Member

@ImannKamal Hello!
Can you share the model ID with me so I can investigate this further?

@sergiuwaxmann sergiuwaxmann self-assigned this Jun 26, 2024
@ImannKamal
Copy link
Author

Member
https://hub.ultralytics.com/models/T16PM3Aeq2HdX8C10t0u
For your reference, Im training the model using google colab. So, i think all the package is up to date because the hub auto update right?

@sergiuwaxmann
Copy link
Member

@ImannKamal I just checked and it looks like Ultralytics HUB didn't receive the final weights, reason why it shows "Disconnected" - most likely because of the internet connection. It looks like the last checkpoint is for epoch 99 (which is the final epoch).
Maybe you can try resuming the training? However, I am not sure if ultralytics will simply optimize the weights or try to resume which might not be possible as you are resuming the training from the final epoch.
I logged this bug and our team will work on fixing it as fast as possible.
Thank you for understanding!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants