Save exclude_from_weight_decay in config for LAMB #2619

leondgarse · 2021-12-06T06:29:43Z

Before:

import tensorflow_addons as tfa
mm = keras.models.Sequential([keras.layers.Input([32, 32, 3]), keras.layers.Flatten(), keras.layers.Dense(10)])
mm.compile(optimizer=tfa.optimizers.LAMB(learning_rate=0.1, weight_decay_rate=0.02, exclude_from_weight_decay=['/gamma', '/beta']))
mm.save('aa.h5')
bb = keras.models.load_model('aa.h5')
print(bb.optimizer.exclude_from_weight_decay, bb.optimizer.exclude_from_layer_adaptation)
# None, None

After:

import tensorflow_addons as tfa
mm = keras.models.Sequential([keras.layers.Input([32, 32, 3]), keras.layers.Flatten(), keras.layers.Dense(10)])
mm.compile(optimizer=tfa.optimizers.LAMB(learning_rate=0.1, weight_decay_rate=0.02, exclude_from_weight_decay=['/gamma', '/beta']))
mm.save('aa.h5')
bb = keras.models.load_model('aa.h5')
print(bb.optimizer.exclude_from_weight_decay, bb.optimizer.exclude_from_layer_adaptation)
# ['/gamma', '/beta'] ['/gamma', '/beta']

Description

Brief Description of the PR:

Fixes # (issue)

Type of change

Checklist:

I've properly formatted my code according to the guidelines
- By running Black + Flake8
- By running pre-commit hooks
This PR addresses an already submitted issue for TensorFlow Addons
I have made corresponding changes to the documentation
I have added tests that prove my fix is effective or that my feature works
This PR contains modifications to C++ custom-ops

How Has This Been Tested?

If you're adding a bugfix or new feature please describe the tests that you ran to verify your changes:
*

Before: ```py import tensorflow_addons as tfa mm = keras.models.Sequential([keras.layers.Input([32, 32, 3]), keras.layers.Flatten(), keras.layers.Dense(10)]) mm.compile(optimizer=tfa.optimizers.LAMB(learning_rate=0.1, weight_decay_rate=0.02, exclude_from_weight_decay=['/gamma', '/beta'])) mm.save('aa.h5') bb = keras.models.load_model('aa.h5') print(bb.optimizer.exclude_from_weight_decay) # None, None ``` After: ```py import tensorflow_addons as tfa mm = keras.models.Sequential([keras.layers.Input([32, 32, 3]), keras.layers.Flatten(), keras.layers.Dense(10)]) mm.compile(optimizer=tfa.optimizers.LAMB(learning_rate=0.1, weight_decay_rate=0.02, exclude_from_weight_decay=['/gamma', '/beta'])) mm.save('aa.h5') bb = keras.models.load_model('aa.h5') print(bb.optimizer.exclude_from_weight_decay, bb.optimizer.exclude_from_layer_adaptation) # ['/gamma', '/beta'] ['/gamma', '/beta'] ```

bot-of-gabrieldemarmiesse · 2021-12-06T06:30:20Z

@junjiek

You are owner of some files modified in this pull request.
Would you kindly review the changes whenever you have the time to?
Thank you very much.

boring-cyborg bot added the optimizers label Dec 6, 2021

google-cla bot added the cla: yes label Dec 6, 2021

fsx950223 added the kokoro:force-run label Dec 6, 2021

kokoro-team removed the kokoro:force-run label Dec 6, 2021

fsx950223 approved these changes Dec 6, 2021

View reviewed changes

fsx950223 merged commit b303f23 into tensorflow:master Dec 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Save exclude_from_weight_decay in config for LAMB #2619

Save exclude_from_weight_decay in config for LAMB #2619

leondgarse commented Dec 6, 2021

bot-of-gabrieldemarmiesse commented Dec 6, 2021

Save exclude_from_weight_decay in config for LAMB #2619

Save exclude_from_weight_decay in config for LAMB #2619

Conversation

leondgarse commented Dec 6, 2021

Description

Type of change

Checklist:

How Has This Been Tested?

bot-of-gabrieldemarmiesse commented Dec 6, 2021