reproduce_pytorch_lightning_memory_issues

Reproducing issues I am having with my custom Dataset when using PyTorch Lightning multi GPU training with DDP.

When the dataset stores items as a numpy array, then the more num_workers are used in the data loader, the higher the memory usage. In contrast, when using PyTorch tensors instead of numpy arrays, memory usage is much lower.

For example compare the run like this:

python minimal.py --num_workers 10
# Uses around 5GB of RAM
python minimal.py --numpy --num_workers 10
# Same amount of data, but using numpy, now takes >30GB of RAM -> more than 6 times the amount

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
minimal.py		minimal.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

reproduce_pytorch_lightning_memory_issues

About

Releases

Packages

Languages

mpaepper/reproduce_pytorch_lightning_memory_issues

Folders and files

Latest commit

History

Repository files navigation

reproduce_pytorch_lightning_memory_issues

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages