Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Simple fix for memory leak on GPU0 #1349

Merged
merged 9 commits into from
Apr 3, 2020
Merged

Conversation

williamFalcon
Copy link
Contributor

@williamFalcon williamFalcon commented Apr 2, 2020

What does this PR do?

Fixes #958 #1094

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

Did you have fun?

Make sure you had fun coding 🙃

@williamFalcon williamFalcon changed the title Shubhamagarwal92 master [WIP] Shubhamagarwal92 master Apr 2, 2020
@mergify mergify bot requested a review from a team April 2, 2020 19:04
@Borda
Copy link
Member

Borda commented Apr 2, 2020

What PR does this replicate?

@williamFalcon
Copy link
Contributor Author

@Borda what do we do about this TRAINs test?

@williamFalcon williamFalcon added the discussion In a discussion stage label Apr 3, 2020
@Borda
Copy link
Member

Borda commented Apr 3, 2020

@Borda what do we do about this TRAINs test?

I do not like the randomness in failing, it is happening eg with some new versions...

@williamFalcon williamFalcon merged commit 16f4cc9 into master Apr 3, 2020
@Borda Borda deleted the shubhamagarwal92-master branch April 3, 2020 22:47
alexeykarnachev pushed a commit to alexeykarnachev/pytorch-lightning that referenced this pull request Apr 4, 2020
* SA: for Lightning-AI#958: set torch cuda device when finding root

* SA: for Lightning-AI#958: removing root gpu hack in trainer/evaluation_loop

* SA: setting torch cuda device

* comment line too long

* check if root gpu exists or available

* Incorporating suggestions on Lightning-AI#1094

* since root gpu returns none instead of -1 for cpu

* undo changes

* fixed dp memory thing

Co-authored-by: Shubham Agarwal <shubhamagarwal92@gmail.com>
@Borda Borda added bug Something isn't working and removed discussion In a discussion stage labels Apr 4, 2020
@Borda Borda added this to the 0.7.2 milestone Apr 4, 2020
@Borda Borda changed the title Shubhamagarwal92 master Simple fix for memory leak on GPU0 Apr 4, 2020
tullie pushed a commit to tullie/pytorch-lightning that referenced this pull request Jun 7, 2020
* SA: for Lightning-AI#958: set torch cuda device when finding root

* SA: for Lightning-AI#958: removing root gpu hack in trainer/evaluation_loop

* SA: setting torch cuda device

* comment line too long

* check if root gpu exists or available

* Incorporating suggestions on Lightning-AI#1094

* since root gpu returns none instead of -1 for cpu

* undo changes

* fixed dp memory thing

Co-authored-by: Shubham Agarwal <shubhamagarwal92@gmail.com>
@Borda Borda modified the milestones: v0.7., v0.7.x Apr 18, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Process runs on more GPUs than specified
3 participants