Fix bugs in scale_batch_size #2523

ghost · 2020-07-06T05:58:03Z

Set model.batch_size based on model.hparams.batch_size if not defined
Changed init_val for batch_size to 0 so that it starts with user defined batch_size instead of 2 all the time

- Set model.batch_size based on model.hparams.batch_size if not defined - Changed init_val for batch_size to 0 so that it starts with user defined batch_size instead of 2 all the time

pep8speaks · 2020-07-06T05:58:07Z

Hello @x2-l! Thanks for updating this PR.

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2020-07-06 13:07:55 UTC

codecov · 2020-07-06T06:07:04Z

Codecov Report

Merging #2523 into master will increase coverage by 1%.
The diff coverage is 33%.

@@          Coverage Diff           @@
##           master   #2523   +/-   ##
======================================
+ Coverage      88%     89%   +1%     
======================================
  Files          69      69           
  Lines        5628    5641   +13     
======================================
+ Hits         4963    5017   +54     
+ Misses        665     624   -41

Borda

LGTM
@SkafteNicki mind check?

SkafteNicki · 2020-07-06T09:58:04Z

I am not sure this will solve your problem. Since it is only model.batch_size that gets updated during the scaling, if you have defined your dataloaders using Dataloader(dataset, batch_size=self.hparams.batch_size) then this will have no effect.

ghost · 2020-07-06T10:08:09Z

this should be handled by user no? its the same as enabling auto_lr_find=True but the user sets Optimizer(self.parameters(), lr=self.hparams.learning_rate). or maybe scale_batch_size should consider overwritting DataLoader.batch_size if it is called?

justusschock · 2020-07-06T10:13:38Z

Maybe we also need to check, if self.hparams is a mapping, because then you can't use setattr

SkafteNicki · 2020-07-06T10:29:26Z

@x2-l DataLoader.batch_size cannot be overwritten after it is initialized, already tried it because that would be the easiest solution.
@justusschock agree, in PR #1998 I have implemented three functions as counterparts to setattr, hasattr and getattr that basically search for attribute in this order:

attribute of model i.e. model.batch_size
key of model.hparams i.e. `model.hparams['batch_size']
attribute of model.hparams i.e. model.hparams.batch_size

ghost · 2020-07-06T10:35:00Z

@SkafteNicki what bout reinitializing DataLoader with the new batch_size like

def replace_batch_size(self, dataloader, batch_size):
    skip_keys = ['batch_size']
    dl_args = {
        k: v for k, v in dataloader.__dict__.items() if not k.startswith('_') and k not in skip_keys
    }

    dl_args['batch_size'] = batch_size
    dataloader = type(dataloader)(**dl_args)
    return dataloader

SkafteNicki · 2020-07-06T10:53:01Z

@x2-l agree that should work

awaelchli · 2020-07-06T11:04:18Z

@x2-l I think there is already code like this for replacing the sampler/shuffle attributes. One could factor this out and create a general function
replace_dataloader_attr(dataloader: DataLoader, name: str, new_value: Any) like you proposed and avoid duplicated code.

SkafteNicki · 2020-07-06T11:17:49Z

Here is the function we already have: https://github.com/PyTorchLightning/pytorch-lightning/blob/927f305f7e556828b5cdd45e3977c67f3c54b8fc/pytorch_lightning/trainer/data_loading.py#L169-L178

Borda · 2020-07-06T11:30:50Z

I agree that this is quite a monkey patch and it would be better to support 'hparams.batch_size' as batch_arg_name

ghost · 2020-07-06T11:34:44Z

@SkafteNicki @Borda should I still update this? or can be updated together at PR #1998?

ghost · 2020-07-06T12:02:50Z

on second thought, why not just raise an warning that model.batch_size will be used when both model.batch_size and model.hparams.batch_size are set? if only model.hparams.batch_size is set, the attribute batch_size will be created for model.

the scale_batch_size function will then update both params so that it'll be consistent no matter what is being used in the rest of the code. same can be done for the learning_rate. when these functions are called the original values are just the initial values that will be replaced after the search. this can also avoid the undesired scenarios when self.hparams.batch_size and self.hparams.lr are used in defining the data loaders and optimizers and the values are not updated with the searched values.

@Borda @SkafteNicki @awaelchli thoughts?

justusschock · 2020-07-06T12:16:04Z

While I see your point, we should be careful about that. E.g. when you reinstantiate your model and overwrite the wrong parameter, you introduce unwanted behaviour somewhere else.

I think I'd prefer not to implicitly introduce concurring new arguments of not necessary

Borda · 2020-07-06T12:18:17Z

I would say, do this patch/fix here because it may take some time before PR #1998 lands...
also, this may be 0.8.x release compare to PR #1998 is aimed for 0.9

Enable scaling of batch size using hparams and removed setting the attribute on model. If both are specified, an warning is raised.

ghost · 2020-07-06T12:53:58Z

I enabled scaling of batch size using model.hparams.batch_arg_name and removed setting the .batch_arg_name attribute on the model. If both are specified, a warning is raised telling user that model.batch_arg_name will be used as initial value for the search. If it's undesired, user can stop and change accordingly. I think that'll be enough for the bug fix while not introducing anything new?

Borda · 2020-07-15T13:37:57Z

seems taking parts from #2223 so let's merge this after it...

mdgoldberg · 2020-07-24T05:37:30Z

Hello! Will this and/or #2223 also fix auto_lr_find? It has the same issue.

mergify · 2020-07-29T22:48:36Z

This pull request is now in conflict... :(

Borda · 2020-07-29T23:02:43Z

@ghost mind finish this one as #2223 have been merged...

awaelchli · 2020-08-08T14:03:54Z

Hey! recently a similar PR/issue was fixed by @SkafteNicki in #2821. Could we use the lightning_attr api he added there to implement these checks here? They look very similar.
And we need to keep in mind the compatibility with datamodules, but perhaps this should be done in a different PR :)

SkafteNicki · 2020-08-10T07:32:32Z

Agree with @awaelchli that the first issue addressed in this PR is the same issue the learning rate finder had. If you could exchange all hasattr, getattr, setattr with the corresponding function from this file https://github.com/PyTorchLightning/pytorch-lightning/blob/master/pytorch_lightning/utilities/parsing.py#L145-L195 then it would be great :)

awaelchli · 2020-08-16T13:06:28Z

It looks like this user and their repository completely vanished from github. Should we copy these changes over into a new PR and continue as we discussed?

Borda · 2020-08-17T21:11:11Z

It looks like this user and their repository completely vanished from github. Should we copy these changes over into a new PR and continue as we discussed?

he behaves as his nick says :D
I would say, check if we can push/rebase this PR, if yes, let's continue if not let's make the same branch in our repo, and merge this PR into the new branch and continue there...

…2523) (#3043) * lightning attr fix * revert refactor * create test * separate test * changelog update * tests * revert * Update pytorch_lightning/trainer/training_tricks.py Co-authored-by: William Falcon <waf2107@columbia.edu>

Fix issue 2484

0311d46

- Set model.batch_size based on model.hparams.batch_size if not defined - Changed init_val for batch_size to 0 so that it starts with user defined batch_size instead of 2 all the time

mergify bot requested a review from a team July 6, 2020 05:58

Fix linting

3724e9a

.batch_arg_name instead of .batch_size

15adc6b

Borda approved these changes Jul 6, 2020

View reviewed changes

Borda added the bug Something isn't working label Jul 6, 2020

mergify bot requested a review from a team July 6, 2020 07:26

Borda added the discussion In a discussion stage label Jul 6, 2020

Update training_tricks.py

a0da19d

Enable scaling of batch size using hparams and removed setting the attribute on model. If both are specified, an warning is raised.

Update

db802e3

Borda changed the title ~~Fix bugs in scale_batch_size~~ [blocked by #2223] Fix bugs in scale_batch_size Jul 15, 2020

Borda changed the title ~~[blocked by #2223] Fix bugs in scale_batch_size~~ Fix bugs in scale_batch_size Jul 29, 2020

Borda added this to the 0.9.0 milestone Aug 6, 2020

awaelchli mentioned this pull request Aug 19, 2020

fix setting batch_size attribute in batch_size finder (finishing PR #2523) #3043

Merged

7 tasks

awaelchli changed the base branch from master to bugfix/batch-size-scaler-attr August 19, 2020 10:41

edenlightning modified the milestones: 0.9.0, 0.9.x Aug 20, 2020

Borda closed this Aug 20, 2020

Borda modified the milestones: 0.9.x, 0.9.0 Aug 20, 2020

Vozf mentioned this pull request Oct 19, 2020

Fix COMET_EXPERIMENT_KEY environment variable usage in comet logger #4230

Merged

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix bugs in scale_batch_size #2523

Fix bugs in scale_batch_size #2523

ghost commented Jul 6, 2020

pep8speaks commented Jul 6, 2020 •

edited

Loading

codecov bot commented Jul 6, 2020 •

edited

Loading

Borda left a comment

SkafteNicki commented Jul 6, 2020

ghost commented Jul 6, 2020

justusschock commented Jul 6, 2020

SkafteNicki commented Jul 6, 2020

ghost commented Jul 6, 2020 •

edited by ghost

Loading

SkafteNicki commented Jul 6, 2020

awaelchli commented Jul 6, 2020 •

edited

Loading

SkafteNicki commented Jul 6, 2020

Borda commented Jul 6, 2020

ghost commented Jul 6, 2020

ghost commented Jul 6, 2020 •

edited by ghost

Loading

justusschock commented Jul 6, 2020

Borda commented Jul 6, 2020

ghost commented Jul 6, 2020 •

edited by ghost

Loading

Borda commented Jul 15, 2020

mdgoldberg commented Jul 24, 2020

mergify bot commented Jul 29, 2020

Borda commented Jul 29, 2020

awaelchli commented Aug 8, 2020

SkafteNicki commented Aug 10, 2020

awaelchli commented Aug 16, 2020

Borda commented Aug 17, 2020

Fix bugs in scale_batch_size #2523

Fix bugs in scale_batch_size #2523

Conversation

ghost commented Jul 6, 2020

pep8speaks commented Jul 6, 2020 • edited Loading

Comment last updated at 2020-07-06 13:07:55 UTC

codecov bot commented Jul 6, 2020 • edited Loading

Codecov Report

Borda left a comment

Choose a reason for hiding this comment

SkafteNicki commented Jul 6, 2020

ghost commented Jul 6, 2020

justusschock commented Jul 6, 2020

SkafteNicki commented Jul 6, 2020

ghost commented Jul 6, 2020 • edited by ghost Loading

SkafteNicki commented Jul 6, 2020

awaelchli commented Jul 6, 2020 • edited Loading

SkafteNicki commented Jul 6, 2020

Borda commented Jul 6, 2020

ghost commented Jul 6, 2020

ghost commented Jul 6, 2020 • edited by ghost Loading

justusschock commented Jul 6, 2020

Borda commented Jul 6, 2020

ghost commented Jul 6, 2020 • edited by ghost Loading

Borda commented Jul 15, 2020

mdgoldberg commented Jul 24, 2020

mergify bot commented Jul 29, 2020

Borda commented Jul 29, 2020

awaelchli commented Aug 8, 2020

SkafteNicki commented Aug 10, 2020

awaelchli commented Aug 16, 2020

Borda commented Aug 17, 2020

pep8speaks commented Jul 6, 2020 •

edited

Loading

codecov bot commented Jul 6, 2020 •

edited

Loading

ghost commented Jul 6, 2020 •

edited by ghost

Loading

awaelchli commented Jul 6, 2020 •

edited

Loading

ghost commented Jul 6, 2020 •

edited by ghost

Loading

ghost commented Jul 6, 2020 •

edited by ghost

Loading