Pip package creation #3357

SkalskiP · 2021-05-26T17:25:55Z

🛠️ PR Summary

_{Made with ❤️ by Ultralytics Actions}

🌟 Summary

Enhancements to Ultralytics' YOLOv5 Docker, GitHub CI workflows, and Python compatibility checks.

📊 Key Changes

📁 Refactored the Python package structure to accommodate setup.py for easier installs and imports.
🐍 Expanded Python version testing (3.7, 3.8, 3.9) in CI workflows for improved compatibility checks.
📦 Introduced a new CI workflow specifically for CPU-based testing across different Python versions.
🛠 Git-ignore rules and Docker ignore rules were updated to reflect the new package structure.
🐳 Docker build improvements, adjusting paths within the container.
✅ The addition of setup.py makes it possible to install YOLOv5 as a local package using pip install -e ..

🎯 Purpose & Impact

These changes are geared towards making YOLOv5 more versatile and developer-friendly:
- Easier to check for compatibility with multiple Python versions.
- Simplifying the process of deploying on various environments with Docker.
- Facilitating the ease of local development and testing through package installation.
Users can look forward to a more robust, versatile tool, with streamlined contributions and deployment processes.
Contributors can enjoy an improved testing suite that covers a wider range of scenarios.

glenn-jocher · 2021-05-27T17:00:14Z

@sheromon hi I wanted to invite you to contribute to this pip package PR. I know you made some good progress in #2886, but we wanted to start a new PR with a few changes. I'd like to get you on the author list for the change as well though since you did some great work in #2886!

One issue we're having is deserializing the currently saved models (saved with the current dir structure) when loading them in the new PR structure. Is this a problem you had in your PR?

sheromon · 2021-05-27T21:31:11Z

Haha, thanks, @glenn-jocher. I'm not worried about getting credit, I just thought it could be helpful to have the yolov5 namespace. Unfortunately, I don't have any datapoints that would be helpful. I didn't cross-use models created with code using two different directory structures, if that makes sense. I think that's what you're talking about.

fcakyon · 2021-05-27T21:35:36Z

@SkalskiP yolov5_in_syspath contextmanager is a direct copy from my repo: https://github.com/fcakyon/yolov5-pip/blob/8249d6188f40cbe709167c37f2f6a063ad5d6c2f/yolov5/utils/general.py#L696

you could have at least give credit to the original author :)

fcakyon · 2021-05-27T21:36:52Z

@glenn-jocher i have fixed that issue with yolov5_in_syspath contextmanager: https://github.com/fcakyon/yolov5-pip/blob/8249d6188f40cbe709167c37f2f6a063ad5d6c2f/yolov5/utils/general.py#L696

sheromon · 2021-05-27T21:53:07Z

FWIW, in my ideal world, what's in @fcakyon's yolov5-pip repo would get combined with what's in ultralytics/yolov5, but if there are issues with that (and I think that's what I heard), then oh well. Also in my ideal world, I can get in shape by eating Nutella crepes for breakfast, lunch, and dinner, so yeah.

SkalskiP · 2021-05-28T07:05:45Z

Hi @fcakyon I'm very sorry that you felt that way. This is just a PR draft for now, where we are trying to consider potential solutions and test them. It is very possible that the target solution will be different. But I have added credits and a link to the original repository.

And yes @sheromon and @fcakyon. We are concerned about models that were serialized in the old directory structure but will now be deserialized in the new one. For some reason, we need to use it like that:

def attempt_load(weights, map_location=None, inplace=True):    
    with yolov5_in_syspath():        
        from models.yolo import Detect, Model

So we actually need to import from models.yolo import Detect, Model not from yolov5.models.yolo import Detect, Model and that it is slightly worrying.

glenn-jocher · 2021-05-28T13:58:54Z

@SkalskiP I looked through this a bit. detect.py fails because our example images directory was deleted:
Exception: ERROR: /Users/glennjocher/PycharmProjects/yolov5/yolov5/data/images does not exist

Do you know of a way to revert the two deleted files? If not we may need to start a new PR, otherwise the git package will store the newly added files separately than the deleted ones, growing our git download size which we don't want.

EDIT: Once I replace the deleted images detect.py works again:

EDIT2: I see bus.jpg is pretty large, 473kb, I wonder if we might want to pass it through tinyjpg. Inference results may be slightly worsened, but the filesize would reduce significantly. Unfortunately the git package would only grow though as mentioned before... but the pip package would be reduced right?

glenn-jocher · 2021-05-28T14:10:59Z

@SkalskiP if I comment out the with statement and change imports to:
from yolov5.models.yolo import Detect, Model

I don't get a serialization error, I get a model difference error:
AttributeError: 'Detect' object has no attribute 'inplace'

This is due to the PR Detect and Model modules having different types than the v5.0 release checkpoints in https://github.com/ultralytics/yolov5/releases/tag/v5.0

One option for fixing this would be load and re-save all official v5.0 models in a new v5.1 release. detect.py has this checkpoint update capability with the --update flag:

python detect.py --update

yolov5/detect.py

Lines 179 to 182 in ba6f3f9

    
           if opt.update:  # update all models (to fix SourceChangeWarning) 
        
               for opt.weights in ['yolov5s.pt', 'yolov5m.pt', 'yolov5l.pt', 'yolov5x.pt']: 
        
                   detect(opt=opt) 
        
                   strip_optimizer(opt.weights)

glenn-jocher · 2021-05-28T14:56:03Z

@SkalskiP I've cleaned up the PR a bit, removed some un-needed changes, and applied a few fixes. Everything seems to work correctly now with the sys path fix being required in only one location (attempt_load() in experimental.py).

I tested train, test, detect, hubconf, all work. It looks like we have significant conflicts built-up over the week of master updates though, and we need to figure out how best to revert the data/images directory deletion if possible.

EDIT: Perhaps rather than fix the conflicts would it make more sense to start a new PR with the same changes, retaining the data/images directory?

SkalskiP · 2021-05-28T15:21:06Z

Thanks, @glenn-jocher I'll take a look at those changes right now. Just from reading your comments:

data directory was not deleted, it was moved inside ytolov5 directory.
let me try to fix those conflicts

…ebase_to_prepare_for_pip_package_creation' into feature/restructuring_yolov5_codebase_to_prepare_for_pip_package_creation

fcakyon · 2021-05-29T14:50:19Z

@SkalskiP to entry points to work, you need to copy all changes from #3382

SkalskiP · 2021-05-29T14:58:44Z

@fcakyon doing that right now :) btw @fcakyon those are very good proposals <3

fcakyon · 2021-05-29T15:01:42Z

@SkalskiP i have a bit of experience with pip packaging, glad to be helpful here :)

SkalskiP · 2021-05-29T15:02:48Z

@glenn-jocher and @fcakyon my opinion is:

That we should not suggest users to install our project using requirements.txt. They should use setup.py. If they'll use pip package then great they use setup.py by default. If they clone repo they should do:

git clone https://github.com/ultralytics/yolov5
pip install -e .

I added script entry points suggested by @fcakyon. But even without them, there should not be a problem with running them both from yolov5 and yolov5/yolov5. There are ways to do it without sys.path.append, which - in my opinion - is unacceptable. Right now users can do it like that:

# from yolov5
python -m yolov5.detect --source yolov5/data/images/bus.jpg

# from yolov5/yolov5
python -m detect --source data/images/bus.jpg

# from yolov5
yolov5_detect --source yolov5/data/images/bus.jpg

# from yolov5/yolov5
yolov5_detect --source data/images/bus.jpg

# from yolov5
python yolov5/detect.py --source yolov5/data/images/bus.jpg

# from yolov5/yolov5
python detect.py --source data/images/bus.jpg

pip install -e . does much more than just install dependencies. For example, it does all sys operation for you.

SkalskiP · 2021-05-29T15:04:12Z

@glenn-jocher I cleaned up and removed those sys.path.append as well as is_pip function. All in all, we have support for 3.6.2-3.9 and we can run our script from multiple locations.

SkalskiP · 2021-05-29T16:59:20Z

@glenn-jocher, @fcakyon what do you thank? Are there are any more changes we should do? :)

glenn-jocher · 2021-05-29T18:55:56Z

@SkalskiP I see what you say in #3357 (comment), but in the real world users are going to do what they like, not what we tell them, so they will pip install -r requirements.txt as they've always done and then proceed directly to trying to run detect.py etc., which will shortly be followed by them raising a bug report when they see errors they don't understand, and then I'll have to spend my time explaining these breaking changes one by one to everyone:

…prepare_for_pip_package_creation

SkalskiP · 2021-05-29T22:14:47Z

I merged develop into feature/restructuring_yolov5_codebase_to_prepare_for_pip_package_creation so we are up to date with changes
@glenn-jocher I realize that you are the creator of this repository and can only advise you. But my opinion is as follows:

Remove requirements.txt from the repository and stick to setup.py. requirements.txt is redundant and will create confusion. (no requirements.txt -> no pip install -r requirements.txt)
Update README.md with proper instructions.
Stay away from sys.path.append if it's not necessary. (serialization - necessary; people do not know how to use python - not necessary)

glenn-jocher · 2021-05-30T19:01:42Z

@SkalskiP @fcakyon thanks for the updates guys!

I'm not sure what the best technical approach is here, but in terms of UX, we require minimization of breaking changes as an absolute priority. Remember we're not launching a new product here, we're updating a mid-lifecycle product, so this is not the appropriate moment for drastic changes to well established user workflows.

Keep in mind there are countless tutorials and videos on YouTube, Medium, Reddit, etc. about how to train and deploy YOLOv5 using universally established python norms like pip install -r requirements.txt and running simple commands like python train.py, so I'm very adverse to changes that would render all of these tutorials outdated or incorrect suddenly.

Our demographic also skews heavily towards novices in the field who may just be getting started with python, we don't want to create any barriers to entry or possible pain points, instead we want things to "just work" as Jobs used to say.

glenn-jocher · 2021-05-30T19:15:56Z

@SkalskiP @fcakyon maybe another option is to push as many updates as makes sense (such as #3382 for train, test, detect, export) to develop to better align the repo with the pip package requirements without actually taking the final step.

This might allow for easier maintenance for @fcakyon and give us some time to explore minimal-impact solutions with the least breaking changes. Then once we find the best solution the final pip PR would be trivial or at least less complicated.

SkalskiP · 2021-05-30T21:07:00Z

@glenn-jocher I understand all your concerns. However, in my opinion, pip package code distribution is an obvious step forward. This would allow many users to incorporate yolov5 into their projects much more easily.

Giving up this significant improvement for such (in my opinion) minor reasons as adding an extra level of directory depth and potentially (but not mandatorily) a different installation method is a mistake. As you know, yolov5 has quite a bit of technology debt, and if we don't have the ability and space to change anything in the future for similar reasons as we have today, it's hard to see how we can implement the plans we discussed.

I also do not quite understand the "just work" argument, as using setup.py basically guarantees stuff will work.

Regardless, if you want to take the "baby steps", then I would say, keep requirements.txt, use sys.path.append hack but don't cancel the pip feature all the way.

Let me know what you decided :)

SkalskiP · 2021-05-30T21:20:30Z

I took the time and I looked through some open source repositories and here is what I found:

PyTorchLightning/pytorch-lightning use setup.py + requirements.txt
qubvel/segmentation_models use setup.py + requirements.txt
huggingface/transformers use setup.py
huggingface/datasets use setup.py
facebookresearch/detectron2 use setup.py

I found more examples, but I think these few already illustrate quite well that pip package is a must-have and certainly a step in the right direction.

…prepare_for_pip_package_creation

…ebase_to_prepare_for_pip_package_creation' into feature/restructuring_yolov5_codebase_to_prepare_for_pip_package_creation # Conflicts: # .github/workflows/ci-testing-requirements-install.yml

SkalskiP · 2021-06-01T21:05:24Z

@glenn-jocher:

I added the latest changes from develop
I have implemented the changes we discussed. We have a separate CI for the scenario with installation via requirements.txt and setup.py. If you can take a peek and let me know if you have any more concerns.

…prepare_for_pip_package_creation # Conflicts: # requirements.txt # yolov5/utils/general.py

github-actions · 2021-09-06T00:13:22Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions YOLOv5 🚀 and Vision AI ⭐.

SkalskiP and others added 5 commits May 26, 2021 17:11

initial commit

91258b6

update

e4c40ae

detect.py is working

fccd101

add setup.py description

050a1a7

W&B comment

a6fd9f0

glenn-jocher changed the title ~~feature/restructuring_yolov5_codebase_to_prepare_for_pip_package_creation~~ Pip package creation May 27, 2021

add credits for yolov5_in_syspath

4c40c99

glenn-jocher added 2 commits May 28, 2021 15:49

check_requirements() fix

09a4c84

revert check_requirements() fix

76de106

glenn-jocher added 4 commits May 28, 2021 16:33

reverting changes to datasets.py (unnecessary)

617193a

reverting changes to detect.py (unnecessary)

e5242fb

cleanup

591f541

zidane.jpg dir fix

6c86bb7

glenn-jocher marked this pull request as ready for review May 28, 2021 14:57

check_requirements() path fix

389cb60

SkalskiP added 5 commits May 28, 2021 17:38

after merge of develop

7a27be0

bring back files that were deleted by accident

f8b759c

update ci-testing.yml

e32084c

update ci-testing.yml

b7ed3db

Merge remote-tracking branch 'origin/feature/restructuring_yolov5_cod…

cbe543f

…ebase_to_prepare_for_pip_package_creation' into feature/restructuring_yolov5_codebase_to_prepare_for_pip_package_creation

remove is_pip

d1c778b

add main to scripts

74acbe8

Merge branch 'develop' into feature/restructuring_yolov5_codebase_to_…

29fcc59

…prepare_for_pip_package_creation

SkalskiP requested a review from glenn-jocher May 29, 2021 22:20

SkalskiP added 5 commits June 1, 2021 18:18

Merge branch 'develop' into feature/restructuring_yolov5_codebase_to_…

1f186d8

…prepare_for_pip_package_creation

making sure that both new and old version of scripts will work

c1c51e2

making sure that both new and old version of scripts will work

074647a

Merge remote-tracking branch 'origin/feature/restructuring_yolov5_cod…

df50307

…ebase_to_prepare_for_pip_package_creation' into feature/restructuring_yolov5_codebase_to_prepare_for_pip_package_creation # Conflicts: # .github/workflows/ci-testing-requirements-install.yml

making sure that both new and old version of scripts will work

b58ebe5

SkalskiP added 2 commits June 1, 2021 23:14

fix setup.py

178a2bd

Merge branch 'develop' into feature/restructuring_yolov5_codebase_to_…

b2ad42e

…prepare_for_pip_package_creation # Conflicts: # requirements.txt # yolov5/utils/general.py

Base automatically changed from develop to master June 8, 2021 08:22

github-actions bot added the Stale label Sep 6, 2021

github-actions bot closed this Sep 12, 2021

glenn-jocher deleted the feature/restructuring_yolov5_codebase_to_prepare_for_pip_package_creation branch September 18, 2021 13:04

glenn-jocher mentioned this pull request Dec 6, 2021

Pip package #5897

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pip package creation #3357

Pip package creation #3357

SkalskiP commented May 26, 2021 •

edited by UltralyticsAssistant

Loading

glenn-jocher commented May 27, 2021

sheromon commented May 27, 2021

fcakyon commented May 27, 2021

fcakyon commented May 27, 2021

sheromon commented May 27, 2021

SkalskiP commented May 28, 2021

glenn-jocher commented May 28, 2021 •

edited

Loading

glenn-jocher commented May 28, 2021 •

edited

Loading

glenn-jocher commented May 28, 2021 •

edited

Loading

SkalskiP commented May 28, 2021

fcakyon commented May 29, 2021 •

edited

Loading

SkalskiP commented May 29, 2021

fcakyon commented May 29, 2021

SkalskiP commented May 29, 2021

SkalskiP commented May 29, 2021

SkalskiP commented May 29, 2021

glenn-jocher commented May 29, 2021 •

edited

Loading

SkalskiP commented May 29, 2021

glenn-jocher commented May 30, 2021

glenn-jocher commented May 30, 2021 •

edited

Loading

SkalskiP commented May 30, 2021

SkalskiP commented May 30, 2021

SkalskiP commented Jun 1, 2021

github-actions bot commented Sep 6, 2021

Pip package creation #3357

Pip package creation #3357

Conversation

SkalskiP commented May 26, 2021 • edited by UltralyticsAssistant Loading

🛠️ PR Summary

🌟 Summary

📊 Key Changes

🎯 Purpose & Impact

glenn-jocher commented May 27, 2021

sheromon commented May 27, 2021

fcakyon commented May 27, 2021

fcakyon commented May 27, 2021

sheromon commented May 27, 2021

SkalskiP commented May 28, 2021

glenn-jocher commented May 28, 2021 • edited Loading

glenn-jocher commented May 28, 2021 • edited Loading

glenn-jocher commented May 28, 2021 • edited Loading

SkalskiP commented May 28, 2021

fcakyon commented May 29, 2021 • edited Loading

SkalskiP commented May 29, 2021

fcakyon commented May 29, 2021

SkalskiP commented May 29, 2021

SkalskiP commented May 29, 2021

SkalskiP commented May 29, 2021

glenn-jocher commented May 29, 2021 • edited Loading

SkalskiP commented May 29, 2021

glenn-jocher commented May 30, 2021

glenn-jocher commented May 30, 2021 • edited Loading

SkalskiP commented May 30, 2021

SkalskiP commented May 30, 2021

SkalskiP commented Jun 1, 2021

github-actions bot commented Sep 6, 2021

SkalskiP commented May 26, 2021 •

edited by UltralyticsAssistant

Loading

glenn-jocher commented May 28, 2021 •

edited

Loading

glenn-jocher commented May 28, 2021 •

edited

Loading

glenn-jocher commented May 28, 2021 •

edited

Loading

fcakyon commented May 29, 2021 •

edited

Loading

glenn-jocher commented May 29, 2021 •

edited

Loading

glenn-jocher commented May 30, 2021 •

edited

Loading