Fixing tests #936

Borda · 2020-02-25T10:45:56Z

What does this PR do?

Fixes #918, some changes was not properly propagated

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

Did you have fun?

Make sure you had fun coding 🙃

pytorch_lightning/trainer/data_loading.py

williamFalcon · 2020-02-25T11:08:08Z

@Borda wait for #938.

This PR makes many more changes than just fixing the tests. Please factor out into other PRs so we see the changes clearly.

This PR breaks a lot of GPU tests

williamFalcon · 2020-02-25T11:09:08Z

pytorch_lightning/trainer/data_loading.py

@@ -177,14 +177,19 @@ def reset_train_dataloader(self, model):
        self.is_iterable_train_dataloader = (
            EXIST_ITER_DATASET and isinstance(self.train_dataloader.dataset, IterableDataset)
        )
-        if self.is_iterable_train_dataloader and not isinstance(self.val_check_interval, int):
+        if self.is_iterable_dataloader(self.train_dataloader) and not isinstance(self.val_check_interval, int):


this seems wrong. it needs to be self.is_iterable_train_dataloader as defined in 177

or remove 177

and what if thou train with another dataloader, checking if it is updated...
https://github.com/PyTorchLightning/pytorch-lightning/blob/master/pytorch_lightning/trainer/trainer.py#L857

i don't understand what you mean

at what point is/was determined if the train dataset is iterable?

ok, found it and it would be fine as it was... shall I rever ti?

williamFalcon · 2020-02-25T11:10:37Z

@Borda the refactors are nice, so maybe this PR is about refactoring?

Borda · 2020-02-25T12:05:48Z

@Borda the refactors are nice, so maybe this PR is about refactoring?

sure, I wanted to do just a simple cleaning #934 but I found that it is crashing so I opened this to try to fix it... I see that it is still failing so WIP, and shall be done/merged after #938

MattPainter01 · 2020-02-25T12:45:12Z

pytorch_lightning/trainer/evaluation_loop.py

        # when testing make sure user defined a test step
-        if test and not (self.is_overriden('test_step') or self.is_overriden('test_end')):
+        if test_mode and not (self.is_overriden('test_step') or self.is_overriden('test_end')):


We should make sure after we merge #938 into master that we update the pr here so that auto-merge doesn't reset this line again.

williamFalcon · 2020-02-25T13:53:52Z

@Borda unblocked. rebase so we can merge?

Borda · 2020-02-25T14:46:52Z

there was something wrong, your tests were passing just with caching luck...
I do not have a very good feeling about the prepare_data since the data is not available until you bind it with trainer... nasty fix in Template 🤖

williamFalcon · 2020-02-25T14:48:38Z

@Borda the tests passed on GPUs without any caching...
The prepare_data is needed for TPU/DDP. It was causing a lot of issues without it.

Borda · 2020-02-25T14:57:36Z

the test is done that is using the same MNIS cache, once you download, it is used for all test...
you can simply check to run just tests.test_trainer.test_trainer_max_steps_and_epochs on the blank repo, but if run all test any other test like tests.test_trainer.test_resume_from_checkpoint_epoch_restored does the download as a first one...

williamFalcon · 2020-02-25T15:39:05Z

prepare_data goes beyond tests... it's used for DDP and TPU.
If in the tests we happen to cache stuff that's great. But the overall framework needs prepare_data

Borda · 2020-02-25T15:50:26Z

prepare_data goes beyond tests... it's used for DDP and TPU.
If in the tests we happen to cache stuff that's great. But the overall framework needs prepare_data

it seems that we are talking a bit a diff thing...

shall we agree that tests shall pas always and does not matter what order
each test shall pass running on a blank system (except specific running conditions)

the case here was that tests.test_trainer.test_trainer_max_steps_and_epochs created model instance from a Template and then it asked for info about dataset before it was bouned to the trainer... then template method __dataloader asked for MNIST dataset without which does not exist on clean environment because it is its fist usage (note that other test model sed our shorten TestingMNIST dataset which has another cache folder)

Really surprised that it was passing on your side lol :]

williamFalcon · 2020-02-25T15:56:24Z

sure.

* abs import * rename test model * update trainer * revert test_step check * move tags * fix test_step * clean tests * fix template * update dataset path * fix parent order

Borda added bug Something isn't working need fix labels Feb 25, 2020

Borda requested a review from MattPainter01 February 25, 2020 10:45

Borda added the priority: 0 High priority task label Feb 25, 2020

williamFalcon reviewed Feb 25, 2020

View reviewed changes

pytorch_lightning/trainer/data_loading.py Outdated Show resolved Hide resolved

williamFalcon reviewed Feb 25, 2020

View reviewed changes

Borda changed the title ~~Fixing tests~~ [blocked by #938] Fixing tests Feb 25, 2020

MattPainter01 reviewed Feb 25, 2020

View reviewed changes

Borda removed the priority: 0 High priority task label Feb 25, 2020

Borda changed the title ~~[blocked by #938] Fixing tests~~ Fixing tests Feb 25, 2020

abs import

323a51c

Borda added 7 commits February 25, 2020 15:50

rename test model

36091e5

update trainer

351c3d5

revert test_step check

b7ee8bf

move tags

6aee0b5

fix test_step

ea11447

clean tests

89fce71

fix template

5c7149a

update dataset path

dccfa36

fix parent order

d62d0d0

Borda requested a review from a team February 25, 2020 15:56

Borda added ready PRs ready to be merged and removed need fix labels Feb 25, 2020

williamFalcon merged commit 5dd2afe into Lightning-AI:master Feb 25, 2020

Borda deleted the fixing branch February 25, 2020 18:13

awaelchli mentioned this pull request Feb 25, 2020

feat(trainer): add enable_benchmarking option #803

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixing tests #936

Fixing tests #936

Borda commented Feb 25, 2020

williamFalcon commented Feb 25, 2020

williamFalcon Feb 25, 2020

williamFalcon Feb 25, 2020

Borda Feb 25, 2020

williamFalcon Feb 25, 2020

Borda Feb 25, 2020

Borda Feb 25, 2020

williamFalcon commented Feb 25, 2020

Borda commented Feb 25, 2020

MattPainter01 Feb 25, 2020

williamFalcon commented Feb 25, 2020

Borda commented Feb 25, 2020

williamFalcon commented Feb 25, 2020

Borda commented Feb 25, 2020

williamFalcon commented Feb 25, 2020

Borda commented Feb 25, 2020 •

edited

Loading

williamFalcon commented Feb 25, 2020

Fixing tests #936

Fixing tests #936

Conversation

Borda commented Feb 25, 2020

What does this PR do?

PR review

Did you have fun?

williamFalcon commented Feb 25, 2020

williamFalcon Feb 25, 2020

Choose a reason for hiding this comment

williamFalcon Feb 25, 2020

Choose a reason for hiding this comment

Borda Feb 25, 2020

Choose a reason for hiding this comment

williamFalcon Feb 25, 2020

Choose a reason for hiding this comment

Borda Feb 25, 2020

Choose a reason for hiding this comment

Borda Feb 25, 2020

Choose a reason for hiding this comment

williamFalcon commented Feb 25, 2020

Borda commented Feb 25, 2020

MattPainter01 Feb 25, 2020

Choose a reason for hiding this comment

williamFalcon commented Feb 25, 2020

Borda commented Feb 25, 2020

williamFalcon commented Feb 25, 2020

Borda commented Feb 25, 2020

williamFalcon commented Feb 25, 2020

Borda commented Feb 25, 2020 • edited Loading

williamFalcon commented Feb 25, 2020

Borda commented Feb 25, 2020 •

edited

Loading