Data Augmentation update #1263

torzdf · 2022-08-21T17:41:09Z

Updates the data augmentation pipeline to clean up some bugs and add some optimizations.

Bugs fixed:

Issue with preview needing to be refreshed twice to update the images
Issue where multiple threads would read/write to the same memory pointer, leading to image corruption
Fixes an issue where warp/warp to landmarks would behave differently depending on training image size
Fixes an issue where masks would sometimes be slightly misaligned

Optimizations:

Minor optimizations across the whole augmentation pipeline
Move more processing into the caching stage (slows down caching, but speeds up subsequent iterations)
General refactoring to make the code more maintainable

Some numbers:
Test data (1043 A images, 1229 B images). 1000 iterations

512px training image size, 64px model output size, 64 batch size: ~~65% faster (including caching)
684px training image size, 512px model output size, 8 batch size: ~15% faster (including caching)

Summary of updates:

lib.detected_face
- Subclass Masks for Landmark based masks
- Add training mask propery + methods to DetectedFace
lib.training_training
- subclass TrainingDataGenerator for training and preview data
- Split cache into own module
- Reduce thread count to 1 to prevent image corruption + data re-use
- Process on largest model input/output size rather than stored image size
- Size and crop masks during caching stage
- Implement ring buffer for data flow
- Fix preview reload bug
augmentation
- typing
- switch color aug order
- better initialization
- Fix warp + landmark warp to correctly apply at different image scales
- Slightly improved warp caching
- Don't store whether image is_preview. Handle all data as training images implicitly
plugins.trainer: Typing and fixes to work with training data refactor

- lib.detected_face - Subclass Masks for Landmark based masks - Add training mask propery + methods to DetectedFace - lib.training_training - subclass TrainingDataGenerator for training and preview data - Split cache into own module - Reduce thread count to 1 to prevent image corruption + data re-use - Process on largest model input/output size rather than stored image size - Size and crop masks during caching stage - Implement ring buffer for data flow - Fix preview reload bug - augmentation - typing - switch color aug order - better initialization - Fix warp + landmark warp to correctly apply at different image scales - Slightly improved warp caching - Don't store whether image is_preview. Handle all data as training images implicitly - plugins.trainer: Typing and fixes to work with trainingdata refactor

torzdf merged commit 2beceff into deepfakes:staging Aug 21, 2022

torzdf deleted the aug branch August 21, 2022 18:01

vlccdl mentioned this pull request Aug 22, 2022

tools-manual:ValueError: The truth value of an array with more than one element is ambiguous. Use a.any() or a.all() #1264

Closed

torzdf mentioned this pull request Aug 29, 2022

Legacy face centering is not working correctly #1163

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data Augmentation update #1263

Data Augmentation update #1263

torzdf commented Aug 21, 2022 •

edited

Loading

Data Augmentation update #1263

Data Augmentation update #1263

Conversation

torzdf commented Aug 21, 2022 • edited Loading

torzdf commented Aug 21, 2022 •

edited

Loading