Bugfix/torchtext include lengths #2689

thschaaf · 2020-07-24T19:38:06Z

What does this PR do?

It fixes a bug when using torchtex and torchtext.data.Field with include_lengths=True that arises when transferring data to GPU.
It adds tests to check if Batches created by torchtext with include_lengths=True and include_lengths=False are processed by the Trainer.fit().

The fix checks if the data is a Tensor, tuple, or list before sending it to the device. If it is a tuple, or list, it iterates over the elements and sends them to the device. (Implementation changed and it uses now move_data_to_device recursively.)

Fixes #2688

Before submitting

Was this discussed/approved via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together? Otherwise, we ask you to create a separate PR for every change.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?
Did you verify new and existing tests pass locally with your changes?
If you made a notable change (that affects users), did you update the CHANGELOG?

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

Did you have fun?

Make sure you had fun coding 🙃

…xt.data.Field configured as include_lengths=True

…l pass

pep8speaks · 2020-07-24T21:57:56Z

Hello @thschaaf! Thanks for updating this PR.

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2020-07-30 01:53:43 UTC

awaelchli · 2020-07-24T22:28:04Z

@thschaaf Thanks for the PR. How much value do you see in this support? I was told by torchtext member that they will drop the Batch class from torchtext moving forward.
pytorch/text#861
We recently added the support for Batch class in PL because of a requrest but at that time I was not aware of its legacy state.
What do you think about this?

thschaaf · 2020-07-25T00:38:08Z

@thschaaf Thanks for the PR. How much value do you see in this support? I was told by torchtext member that they will drop the Batch class from torchtext moving forward.
pytorch/text#861
We recently added the support for Batch class in PL because of a requrest but at that time I was not aware of its legacy state.
What do you think about this?

@awaelchli Hopefully when torchtext removes the Batch class they do it without breaking too much code from people (in a substantial way). There is enough value to support this until torchtext actually does change their implementation. In my case the Batch object of torchtext is used behind the curtain, and it caused my Skip-Thought model training to fail on GPUs. I am sure others might run into similar issues, and was quite happy that the change to Pytorch-Lightning is quite compact. With this change RNN training works on a GPU.

What do you think about the added tests? They are agnostic to the underlying Batch class, only making sure that the the torchtext.data.Field paramter include_length=True is tested. They might be useful in the future, even after the code dealing with the torchtext Batch class is removed.

The change torchtext is planing seems sensible, but more people might use Pytorch-Lightning together with torchtext earlier. It is not intuitive if a model training runs on the CPU but throws such an exception on GPU.

Of course having the PR changes become part of PL would make my Skip-Thought model train faster and my life easier.

CHANGELOG.md

tests/utilities/test_apply_func_torchtext.py

Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>

pytorch_lightning/utilities/apply_func.py

@Borda

…s suggested by @Borda

pytorch_lightning/utilities/apply_func.py

tests/utilities/test_apply_func_torchtext.py

Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>

… Batch

thschaaf · 2020-07-29T15:21:38Z

ci/circleci: TPU-tests is constantly failing with ERROR: (gcloud.auth.activate-service-account) Could not read json file /home/circleci/gcloud-service-key.json: No JSON object could be decoded.
This seems unrelated to code changes of PR. How can that be fixed?

the issue is that somehow it is running in your CircleCI env which is missing GKE credentials...

@Borda Thanks! For some reasons, which I don't remember exactly, I followed the pytorch-lightning project. Just to unfollow the project did the trick.

…ouched)

awaelchli

looks good, thanks for the fix @thschaaf

tests/utilities/test_apply_func_torchtext.py

mergify · 2020-07-29T21:54:33Z

This pull request is now in conflict... :(

Thomas Schaaf added 2 commits July 24, 2020 14:48

Test using torchtext.data.Field with include_lengths=True/False

924e4c1

Fix issue that Tensors in a Batch generated by torchtext with torchte…

1fcbe36

…xt.data.Field configured as include_lengths=True

mergify bot requested a review from a team July 24, 2020 19:38

Thomas Schaaf added 4 commits July 24, 2020 15:43

Add description for fix of issue Lightning-AI#2688

59e97b2

changes to accomodate CodeFactor issues

3e3fbbe

Another attemt to make last CodeFactor issue pass (it's a false alarm)

fe9816d

temporarly disable test of test_grad_tracking to check if testing wil…

957ee89

…l pass

reenable test in test_grad_norm

7971e7d

Borda added the bug Something isn't working label Jul 25, 2020

Borda requested changes Jul 25, 2020

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

tests/utilities/test_apply_func_torchtext.py Outdated Show resolved Hide resolved

tests/utilities/test_apply_func_torchtext.py Outdated Show resolved Hide resolved

Borda requested review from justusschock, SkafteNicki and a team July 25, 2020 09:00

Update CHANGELOG.md

4d0a849

Co-authored-by: Jirka Borovec <Borda@users.noreply.github.com>

thschaaf commented Jul 26, 2020

View reviewed changes

pytorch_lightning/utilities/apply_func.py Outdated Show resolved Hide resolved

Renamed get_torchtext_data_iterator to _get_torchtext_data_iterator a…

c994e88

…s suggested by @Borda

thschaaf requested review from Borda and removed request for a team July 26, 2020 16:02

mergify bot requested a review from a team July 26, 2020 16:02

awaelchli requested changes Jul 26, 2020

View reviewed changes

pytorch_lightning/utilities/apply_func.py Outdated Show resolved Hide resolved

tests/utilities/test_apply_func_torchtext.py Outdated Show resolved Hide resolved

mergify bot requested a review from a team July 26, 2020 17:47

Update pytorch_lightning/utilities/apply_func.py

f60613c

Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>

thschaaf requested review from awaelchli and removed request for a team July 26, 2020 19:29

mergify bot requested a review from a team July 26, 2020 19:30

Thomas Schaaf added 2 commits July 26, 2020 20:22

adding tests more specific to batch_move_data_to_device with tochtext…

c9fdf50

… Batch

added check that Tensors were moved to target device

5e568ea

Thomas Schaaf added 2 commits July 29, 2020 11:23

adding tests/utilities/test_apply_func_torchtext.py to CI TPU test

647e44b

try to make test not skipped on CI with TPU

ff080da

thschaaf changed the title ~~Bugfix/torchtext include lengths~~ [wip] Bugfix/torchtext include lengths Jul 29, 2020

remove testing on TPU

43a5ea9

thschaaf changed the title ~~[wip] Bugfix/torchtext include lengths~~ Bugfix/torchtext include lengths Jul 29, 2020

undo an accidental change to test_tpu.py (file should not have been t…

73583c1

…ouched)

thschaaf changed the title ~~Bugfix/torchtext include lengths~~ [wip] Bugfix/torchtext include lengths Jul 29, 2020

Thomas Schaaf added 2 commits July 29, 2020 14:08

small change to trigger CI build

b929711

small change to trigger CI build

68e2152

thschaaf changed the title ~~[wip] Bugfix/torchtext include lengths~~ Bugfix/torchtext include lengths Jul 29, 2020

awaelchli approved these changes Jul 29, 2020

View reviewed changes

tests/utilities/test_apply_func_torchtext.py Show resolved Hide resolved

mergify bot requested a review from a team July 29, 2020 19:17

awaelchli reviewed Jul 29, 2020

View reviewed changes

tests/utilities/test_apply_func_torchtext.py Show resolved Hide resolved

Update tests/utilities/test_apply_func_torchtext.py

c97cd69

mergify bot requested a review from a team July 29, 2020 19:31

Revert to previous version

1685077

Borda approved these changes Jul 29, 2020

View reviewed changes

tests/utilities/test_apply_func_torchtext.py Outdated Show resolved Hide resolved

Apply suggestions from code review

8a7d68b

mergify bot requested a review from a team July 29, 2020 21:49

Borda added the ready PRs ready to be merged label Jul 29, 2020

thschaaf and others added 2 commits July 29, 2020 19:20

Merge branch 'master' into bugfix/torchtext-include_lengths

72f64ad

Change to trigger CI

3c04090

Borda requested review from jeremyjordan, nateraw, yukw777, ethanwharris and williamFalcon July 31, 2020 07:59

williamFalcon merged commit a6719f0 into Lightning-AI:master Jul 31, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bugfix/torchtext include lengths #2689

Bugfix/torchtext include lengths #2689

thschaaf commented Jul 24, 2020 •

edited

Loading

pep8speaks commented Jul 24, 2020 •

edited

Loading

awaelchli commented Jul 24, 2020

thschaaf commented Jul 25, 2020

thschaaf commented Jul 29, 2020 •

edited

Loading

awaelchli left a comment

mergify bot commented Jul 29, 2020

Bugfix/torchtext include lengths #2689

Bugfix/torchtext include lengths #2689

Conversation

thschaaf commented Jul 24, 2020 • edited Loading

What does this PR do?

Before submitting

PR review

Did you have fun?

pep8speaks commented Jul 24, 2020 • edited Loading

Comment last updated at 2020-07-30 01:53:43 UTC

awaelchli commented Jul 24, 2020

thschaaf commented Jul 25, 2020

thschaaf commented Jul 29, 2020 • edited Loading

awaelchli left a comment

Choose a reason for hiding this comment

mergify bot commented Jul 29, 2020

thschaaf commented Jul 24, 2020 •

edited

Loading

pep8speaks commented Jul 24, 2020 •

edited

Loading

thschaaf commented Jul 29, 2020 •

edited

Loading