Accelerate video inference #9487

mucunwuxian · 2022-09-19T06:38:29Z

@glenn-jocher
I hope this PR finds you well.
I adjust code of reading video, using 'cap.grab()' and 'cap. retrieve'.
It's a small fix.

🛠️ PR Summary

_{Made with ❤️ by Ultralytics Actions}

🌟 Summary

Improved video frame retrieval in data loading process for YOLOv5.

📊 Key Changes

Replaced a single cap.read() call with cap.grab() called in a loop, followed by cap.retrieve() for video frames.
The number of times cap.grab() is called is determined by self.vid_stride, ensuring that frames are skipped according to the stride specified.

🎯 Purpose & Impact

🎯 The update optimizes video processing by skipping over frames more efficiently when a stride is used.
✨ This can lead to better memory management and potentially faster data loading, especially when processing videos with high frame rates.
🚀 Users can expect more consistent performance when training YOLOv5 on video datasets with frame skipping.

…lf.vid_stride * (self.frame + 1)) # read at vid_stride".

github-actions

👋 Hello @mucunwuxian, thank you for submitting a YOLOv5 🚀 PR! To allow your work to be integrated as seamlessly as possible, we advise you to:

✅ Verify your PR is up-to-date with ultralytics/yolov5 master branch. If your PR is behind you can update your code by clicking the 'Update branch' button or by running git pull and git merge master locally.
✅ Verify all YOLOv5 Continuous Integration (CI) checks are passing.
✅ Reduce changes to the absolute minimum required for your bug fix or feature addition. "It is not daily increase but daily decrease, hack away the unessential. The closer to the source, the less wastage there is." — Bruce Lee

glenn-jocher · 2022-09-19T09:53:40Z

@mucunwuxian thanks for the PR! Do you have any before and after results/profiling?

mucunwuxian · 2022-09-19T11:12:32Z

@glenn-jocher
Thank you!
I don't have any profile, but it feels like about 10 times faster.
I think the reason why it is faster is the validity of the cursor operation.
If you want result/profiling, I can ready for it.

glenn-jocher · 2022-09-19T11:30:46Z

@mucunwuxian oh that's strange. Are you sure it's not just faster because it's skipping more frames?

glenn-jocher · 2022-09-19T11:37:11Z

@mucunwuxian I tested and it is much faster! 4.1s vs 6.6s on M2 CPU, probably even faster on GPU.

glenn-jocher · 2022-09-19T11:38:31Z

@mucunwuxian I thought .grab() actually loaded the data though and was much slower than .retrieve(). Shouldn't we retreive multiple frames before grabbing the one we want for the fastest speed?

EDIT: this is backwards.

mucunwuxian · 2022-09-19T11:40:03Z

@glenn-jocher
My situation, I execute detect.py with --vid_stride 4 both before and after.
In that comparison, I felt about 10 times faster.
Perhaps, self.cap.set(cv2.CAP_PROP_POS_FRAMES, self.vid_stride * (self.frame + 1)) # read at vid_stride is slow.
Especially if the number of frames in the video is large.

Wait a minute.
Now let's measure.

glenn-jocher · 2022-09-19T11:41:42Z

@mucunwuxian I think we want the same behavior as the streamloader here, where it grabs every frame and only retreives frames when n % vid_stride == 0:

yolov5/utils/dataloaders.py

Lines 336 to 340 in 868c0e9

    
           while cap.isOpened() and n < f: 
        
               n += 1 
        
               cap.grab()  # .read() = .grab() followed by .retrieve() 
        
               if n % self.vid_stride == 0: 
        
                   success, im = cap.retrieve()

Signed-off-by: Glenn Jocher <glenn.jocher@ultralytics.com>

glenn-jocher · 2022-09-19T11:56:29Z

@mucunwuxian ok, I've aligned the behavior with LoadStreams. Same speed, slightly shorter code.

mucunwuxian · 2022-09-19T11:56:30Z

@glenn-jocher
I agree with you!

glenn-jocher · 2022-09-19T12:01:55Z

@mucunwuxian PR is merged. Thank you for your contributions to YOLOv5 🚀 and Vision AI ⭐

mucunwuxian · 2022-09-19T12:02:19Z

@glenn-jocher
OK!

By the way, I did same way in the beginning.
e6d9ced

So it's essentially correct, but different from the output of the original code.

But if you the author are okay then I think it's okay. :-D

glenn-jocher · 2022-09-19T12:03:29Z

@mucunwuxian oh got it!

Yes this way may skip the first few frames rather than always using the first frame, but this is the behavior for LoadStreams so now they are both aligned.

mucunwuxian · 2022-09-19T12:05:29Z

@glenn-jocher
Great!
That’s very helpful, I can learn a lot from you!

glenn-jocher · 2023-11-15T09:50:47Z

@mucunwuxian i appreciate your kind words! The YOLO community and the Ultralytics team have been invaluable in this endeavor. Feel free to reach out if you have any further questions or contributions.

mucunwuxian added 2 commits September 19, 2022 15:19

The following code is slow, "self.cap.set(cv2.CAP_PROP_POS_FRAMES, se…

e6d9ced

…lf.vid_stride * (self.frame + 1)) # read at vid_stride".

adjust...

26a2a1e

github-actions bot reviewed Sep 19, 2022

View reviewed changes

Merge branch 'master' into master

6e07cf6

Merge branch 'master' into master

313b3ea

Update dataloaders.py

8b04788

Signed-off-by: Glenn Jocher <glenn.jocher@ultralytics.com>

glenn-jocher merged commit 1164069 into ultralytics:master Sep 19, 2022

Hojland mentioned this pull request Oct 17, 2022

feat/bump Go-Autonomous/yolov5#15

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Accelerate video inference #9487

Accelerate video inference #9487

mucunwuxian commented Sep 19, 2022 •

edited by UltralyticsAssistant

Loading

github-actions bot left a comment

glenn-jocher commented Sep 19, 2022

mucunwuxian commented Sep 19, 2022 •

edited

Loading

glenn-jocher commented Sep 19, 2022

glenn-jocher commented Sep 19, 2022 •

edited

Loading

glenn-jocher commented Sep 19, 2022 •

edited

Loading

mucunwuxian commented Sep 19, 2022 •

edited

Loading

glenn-jocher commented Sep 19, 2022 •

edited

Loading

glenn-jocher commented Sep 19, 2022

mucunwuxian commented Sep 19, 2022

glenn-jocher commented Sep 19, 2022

mucunwuxian commented Sep 19, 2022 •

edited

Loading

glenn-jocher commented Sep 19, 2022 •

edited

Loading

mucunwuxian commented Sep 19, 2022

glenn-jocher commented Nov 15, 2023

Accelerate video inference #9487

Accelerate video inference #9487

Conversation

mucunwuxian commented Sep 19, 2022 • edited by UltralyticsAssistant Loading

🛠️ PR Summary

🌟 Summary

📊 Key Changes

🎯 Purpose & Impact

github-actions bot left a comment

Choose a reason for hiding this comment

glenn-jocher commented Sep 19, 2022

mucunwuxian commented Sep 19, 2022 • edited Loading

glenn-jocher commented Sep 19, 2022

glenn-jocher commented Sep 19, 2022 • edited Loading

glenn-jocher commented Sep 19, 2022 • edited Loading

mucunwuxian commented Sep 19, 2022 • edited Loading

glenn-jocher commented Sep 19, 2022 • edited Loading

glenn-jocher commented Sep 19, 2022

mucunwuxian commented Sep 19, 2022

glenn-jocher commented Sep 19, 2022

mucunwuxian commented Sep 19, 2022 • edited Loading

glenn-jocher commented Sep 19, 2022 • edited Loading

mucunwuxian commented Sep 19, 2022

glenn-jocher commented Nov 15, 2023

mucunwuxian commented Sep 19, 2022 •

edited by UltralyticsAssistant

Loading

mucunwuxian commented Sep 19, 2022 •

edited

Loading

glenn-jocher commented Sep 19, 2022 •

edited

Loading

glenn-jocher commented Sep 19, 2022 •

edited

Loading

mucunwuxian commented Sep 19, 2022 •

edited

Loading

glenn-jocher commented Sep 19, 2022 •

edited

Loading

mucunwuxian commented Sep 19, 2022 •

edited

Loading

glenn-jocher commented Sep 19, 2022 •

edited

Loading