amp & channels_last #50

xoiga123 · 2022-07-08T03:39:53Z

channels_last:

While PyTorch operators expect all tensors to be in Channels First (NCHW) dimension format, PyTorch operators support 3 output memory formats.
Contiguous: Tensor memory is in the same order as the tensor’s dimensions.
ChannelsLast: Irrespective of the dimension order, the 2d (image) tensor is laid out as an HWC or NHWC (N: batch, H: height, W: width, C: channels) tensor in memory. The dimensions could be permuted in any order.

amp:

http://www.idris.fr/eng/ia/mixed-precision-eng.html
https://pytorch.org/docs/stable/amp.html
Had to remove last activation from segmentation head because of https://pytorch.org/docs/stable/amp.html#prefer-binary-cross-entropy-with-logits-over-binary-cross-entropy

extra:

prefetch:

Going to train from scratch to see what's good, with a working log this time.
UPDATE 12/07/2022: Seems like the bottleneck is in dataloading, which takes an unholy amount of time even though I cached everything in RAM. Currently profiling CPU & GPU and trying out this dataloader which allegedly actually does prefetch.
UPDATE: It all makes sense now, Pytorch's Dataloader can only prefetch batches in the current running epoch. For the next epoch, there is apparently no prefetch whatsoever.

The text was updated successfully, but these errors were encountered:

xoiga123 added the enhancement New feature or request label Jul 8, 2022

xoiga123 self-assigned this Jul 8, 2022

xoiga123 linked a pull request Oct 1, 2022 that will close this issue

amp, onnx, custom dataset #68

Merged

datvuthanh closed this as completed in #68 Oct 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

amp & channels_last #50

amp & channels_last #50

xoiga123 commented Jul 8, 2022 •

edited

Loading

amp & channels_last #50

amp & channels_last #50

Comments

xoiga123 commented Jul 8, 2022 • edited Loading

xoiga123 commented Jul 8, 2022 •

edited

Loading