[WIP] Hypertuner class #3160

SkafteNicki · 2020-08-25T14:55:44Z

What does this PR do?

This is redo of PR #1998. The last PR was too big land because it was a major refactoring, so I will try to split into more manageable pieces.

This PR is basically how the core structure of the HyperTuner class should look. It does not implement any functionalities yet. After this, I plan on 3 follow up PRs, one for each of the features that the HyperTuner class will include:

Lr finder (deprecate from trainer, add more tests, get working in ddp mode, add support for DataModule)
Batch size finder (deprecate from trainer, somewhat stable at this point, add support for DataModule)
new: n worker searcher Let's add a suggested_num_workers() method? #2196) (already have some code, just need to polish for the new api)

Tagging for input: @Borda , @awaelchli , @justusschock

Question: We have informed user that from v0.9 no more API changes would happen (before v1.0). This is a somewhat big API change, so to keep that promise, I see two options:

Wait with this to after v1.0
Let the hypertuner class be a functionality of bolts, and deprecate the lr finder and batch scaler in lightning

Before submitting

Was this discussed/approved via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together? Otherwise, we ask you to create a separate PR for every change.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?
Did you verify new and existing tests pass locally with your changes?
If you made a notable change (that affects users), did you update the CHANGELOG?

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

Did you have fun?

Make sure you had fun coding 🙃

williamFalcon · 2020-08-25T14:59:02Z

@SkafteNicki please wait a week... we need to finish refactors.

We can decide if it's before or after 0.9 then and if it stays here or bolts

SkafteNicki · 2020-08-25T15:05:06Z

@williamFalcon completly fine with me, whatever makes most sense for the project :]

rohitgr7 · 2020-08-30T13:00:10Z

@SkafteNicki does it support user-defined callbacks to be used when calling lr_find or auto_scale_batch_size? as of now, it drops them.

SkafteNicki · 2020-09-01T08:51:11Z

Based on discussion the tuner class is being dropped for now, instead we will refactor tuner methods into a trainer.tune() method. Closing this.

Borda · 2020-09-01T09:47:35Z

Based on discussion the tuner class is being dropped for now, instead we will refactor tuner methods into a trainer.tune() method. Closing this.

Anyway, GREAT work on this PR! ❤️

jcsagar · 2020-09-07T17:22:27Z

So with the new datamodule, is there no way to tune batch_size? I was really excited by the Hypertuner class in this thread, so am a little bummed this is closed for now.

For my current workflow, I tried to pass in trainer.tune(lightningmodule, datamodule) with auto_scale_batch_size. It tries to scale the batch size, (incorrectly) succeeds with every attempt, until it finds the batch_size of 2^26 (after 25 trials). Which it tries to then fit and fails. I looked into why this was happening and realized that datamodule isn't actually looked at by the scale_batch_size function in training_tricks.py.

awaelchli · 2020-09-07T22:14:16Z

@jcsagar Fixed in #3266 and #3271

jcsagar · 2020-09-07T22:22:23Z

@awaelchli Updated to master and works well. Great stuff, thanks!!!

Nicki Skafte and others added 9 commits August 21, 2020 08:26

base_structure

4f7aee8

finish structure

87b5b0f

added tests

5435962

fix styling

9e68b73

argument parsing

732d3de

Merge remote-tracking branch 'upstream/master' into hypertuner

783fc56

add internal caller

4b104ca

remove model

e9977cf

update test

4598191

mergify bot requested a review from a team August 25, 2020 14:56

Borda requested review from Borda and awaelchli August 25, 2020 15:50

Borda added Important refactor labels Aug 25, 2020

Borda added this to the 0.9.x milestone Aug 25, 2020

Borda added the feature Is an improvement or enhancement label Aug 25, 2020

awaelchli mentioned this pull request Aug 30, 2020

add current_epoch to dumped_params #3261

Merged

SkafteNicki closed this Sep 1, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Hypertuner class #3160

[WIP] Hypertuner class #3160

SkafteNicki commented Aug 25, 2020

williamFalcon commented Aug 25, 2020 •

edited

Loading

SkafteNicki commented Aug 25, 2020

rohitgr7 commented Aug 30, 2020

SkafteNicki commented Sep 1, 2020

Borda commented Sep 1, 2020

jcsagar commented Sep 7, 2020

awaelchli commented Sep 7, 2020

jcsagar commented Sep 7, 2020

[WIP] Hypertuner class #3160

[WIP] Hypertuner class #3160

Conversation

SkafteNicki commented Aug 25, 2020

What does this PR do?

Before submitting

PR review

Did you have fun?

williamFalcon commented Aug 25, 2020 • edited Loading

SkafteNicki commented Aug 25, 2020

rohitgr7 commented Aug 30, 2020

SkafteNicki commented Sep 1, 2020

Borda commented Sep 1, 2020

jcsagar commented Sep 7, 2020

awaelchli commented Sep 7, 2020

jcsagar commented Sep 7, 2020

williamFalcon commented Aug 25, 2020 •

edited

Loading