`ultralytics 8.1.39` add YOLO-World training #9268

Laughing-q · 2024-03-24T13:08:35Z

Usage:

from ultralytics import YOLOWorld

model = YOLOWorld("yolov8s-worldv2.pt")
model.train(data="coco128.yaml")

TODO:

LVIS docs
Verify WorldTrainerFromScratch

🛠️ PR Summary

_{Made with ❤️ by Ultralytics Actions}

🌟 Summary

Enhancements in LVIS dataset support, model training updates, and technical refinements across the Ultralytics framework.

📊 Key Changes

Added support and detailed documentation for the LVIS dataset.
Updated CLIP model installation to use the Ultralytics repository.
Introduced refactorings in model training for the YOLO-World architecture, including enhanced CLIP model support.
Refined data augmentation processes, particularly for multi-modal (image + text) training scenarios.
Made improvements in dataset management, including better cache file handling and support for additional datasets.
Adjustments in model and dataset configurations for more accurate training results.

🎯 Purpose & Impact

Enhanced Dataset Support: Including LVIS enhances the model's ability to train on a wider range of object categories, improving versatility and accuracy.
Model Training and Evaluation Improvements: Updates in model training, especially with the introduction of YOLO-World-related features, pave the way for more advanced multi-modal learning capabilities.
Robust Data Handling: Improved dataset caching and configurations streamline the data preparation process, making model training more efficient.
Technical Refinements: General code enhancements contribute to a more robust, efficient, and flexible machine learning pipeline.

Signed-off-by: Glenn Jocher <glenn.jocher@ultralytics.com>

glenn-jocher · 2024-03-29T01:28:28Z

@Laughing-q hey I looked through this, this is really nice and comprehensive, including added docs and reference section pages.

I added a few missing docstrings and updated the val.py class_map (I think it needs to be zero-indexed), but that's the only changes I've made. Are there are sections you're not sure about, or is this ready to merge to main now?

Laughing-q · 2024-03-29T01:53:14Z

@glenn-jocher Thanks for reviewing! I updated the class_map to start from index 1 because that's what LVIS data needs when I intended to save json and manually validated mAP by using their API. Then I figured that for user cases they might not really care about whether the index starts from 0 or 1 so I eventually pushed the update. Do you think we should keep the index starting from 0? then probably create a class_map for LVIS just like we did for COCO?

EDIT: This reminds me we probably want do the same thing to LVIS evaluation as well, introduce a is_lvis variable maybe and do evaluation by using LVIS api when it's True.

Laughing-q · 2024-03-29T01:57:00Z

@glenn-jocher rest of the PR I think is all good, but since you were asking, I feel like it's better to hold it for another several hours and let me check if there're any places to improve, since it's a PR adding more than 2000 lines(1200 lines are from lvis.yaml though..). What do you think?

Laughing-q · 2024-03-29T07:59:12Z

@glenn-jocher ok I've added lvis api support to evaluate the final results just like we did for coco 9b2ecaf.
Also I add an extra +1 for category_id for lvis dataset while saving predictions in json format so it can be evaluated correctly e76a479.
I think the PR is all set! :)

glenn-jocher · 2024-03-31T14:30:42Z

@Laughing-q PR merged!!

Signed-off-by: Glenn Jocher <glenn.jocher@ultralytics.com> Co-authored-by: UltralyticsAssistant <web@ultralytics.com> Co-authored-by: Glenn Jocher <glenn.jocher@ultralytics.com>

ccl-private · 2024-06-28T09:18:31Z

@Laughing-q Hi, can you help me take a look at this issue? #13793

Laughing-q added 30 commits February 20, 2024 16:34

move functions

05685eb

add YOLOMultiModalDataset

0df488d

fix

9c68108

fix

2d85967

add trainer

3572bd7

move text_model to trainer

6a91d9f

update param init&& fix augmentation

d2a4ad2

fix ema

dda2831

fix validate

348a3bd

update

f993469

add Grounding dataset

18af796

fix

9051d07

add YOLOConcatDataset

7174002

fix

1ae599a

fix

334aa8c

fix

83ec3ed

fix

eb823ec

add lvis.yaml

5bc77cd

update

9aa1840

update dataset.py

3569c29

fix

5e33e71

update docstring

e76d6af

update

d90e5a7

add WorldTrainerFromScratch

b3de099

clean up

2e9b3ec

fix lvis.yaml

e7d4059

update WorldTrainerFromScratch

a40dabf

Merge branch 'main' into yolo-world-training

8530b9a

update

8d53066

update

1e4d6f5

glenn-jocher added 2 commits March 29, 2024 02:22

Update val.py

10c8e1b

Improved stop epochs robustness

086514e

Signed-off-by: Glenn Jocher <glenn.jocher@ultralytics.com>

glenn-jocher and others added 2 commits March 29, 2024 03:21

Merge branch 'main' into yolo-world-training

132a071

update minival

29afd43

Laughing-q force-pushed the yolo-world-training branch from 1c17e33 to 9b2ecaf Compare March 29, 2024 06:28

Laughing-q added 5 commits March 29, 2024 14:29

add lvis evaluation

9b2ecaf

fix

e76a479

update tests

c64d52d

attempt to fix final_eval

1585655

update val.py

d1e2f34

glenn-jocher added 6 commits March 31, 2024 00:13

Merge branch 'main' into yolo-world-training

549d986

Merge branch 'main' into yolo-world-training

21d1af3

Merge branch 'main' into yolo-world-training

0d67755

Merge branch 'main' into yolo-world-training

8c3291b

Merge branch 'main' into yolo-world-training

a6cb665

Merge branch 'main' into yolo-world-training

b10aef0

glenn-jocher changed the title ~~YOLO-World: Add training support~~ ultralytics 8.1.39 add YOLO-World training support Mar 31, 2024

Update __init__.py

fb99ef1

glenn-jocher changed the title ~~ultralytics 8.1.39 add YOLO-World training support~~ ultralytics 8.1.39 add YOLO-World training Mar 31, 2024

glenn-jocher merged commit e9187c1 into main Mar 31, 2024
13 checks passed

glenn-jocher deleted the yolo-world-training branch March 31, 2024 14:30

glenn-jocher removed the TODO Items that needs completing label Mar 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`ultralytics 8.1.39` add YOLO-World training #9268

`ultralytics 8.1.39` add YOLO-World training #9268

Laughing-q commented Mar 24, 2024 •

edited by github-actions bot

Loading

glenn-jocher commented Mar 29, 2024 •

edited

Loading

Laughing-q commented Mar 29, 2024 •

edited

Loading

Laughing-q commented Mar 29, 2024

Laughing-q commented Mar 29, 2024 •

edited

Loading

glenn-jocher commented Mar 31, 2024

ccl-private commented Jun 28, 2024

ultralytics 8.1.39 add YOLO-World training #9268

ultralytics 8.1.39 add YOLO-World training #9268

Conversation

Laughing-q commented Mar 24, 2024 • edited by github-actions bot Loading

🛠️ PR Summary

🌟 Summary

📊 Key Changes

🎯 Purpose & Impact

glenn-jocher commented Mar 29, 2024 • edited Loading

Laughing-q commented Mar 29, 2024 • edited Loading

Laughing-q commented Mar 29, 2024

Laughing-q commented Mar 29, 2024 • edited Loading

glenn-jocher commented Mar 31, 2024

ccl-private commented Jun 28, 2024

`ultralytics 8.1.39` add YOLO-World training #9268

`ultralytics 8.1.39` add YOLO-World training #9268

Laughing-q commented Mar 24, 2024 •

edited by github-actions bot

Loading

glenn-jocher commented Mar 29, 2024 •

edited

Loading

Laughing-q commented Mar 29, 2024 •

edited

Loading

Laughing-q commented Mar 29, 2024 •

edited

Loading