Check differentiability of custom loss function before training #17753

Frightera · 2023-04-03T16:21:16Z

Feature request was made in keras-team/tf-keras#52.

This is a very common mistake when users define custom loss function / classes which are not differentiable which leads getting None gradients in fitting process. This check makes it easier for users to interpret the problem.

Example usage:

import tensorflow as tf

X = tf.constant([-7.0, -4.0, -1.0])
y = tf.constant([3.0, 6.0, 9.0])

model = tf.keras.Sequential([
  tf.keras.layers.Dense(1, input_shape = (1, ))
])

class customloss(tf.keras.losses.Loss):
    def call(self, y_true, y_pred):
        return tf.round(tf.square(y_true - y_pred))

model.compile(loss=customloss(),
              optimizer="adam",
              metrics=["mae"])

Raises:

ValueError: The provided loss function (<__main__.customloss object at 0x7f49c5dffeb0>) is not differentiable. 
Training requires a differentiable loss function. Please review your loss function or consider using a standard 
differentiable loss function. You can disable the differentiability check by setting experimental_check_loss_differentiability=False' in 'model.compile()'.

You can see the other usages from this gist.

haifeng-jin · 2023-04-06T18:22:44Z

will continue the discussion on the issue keras-team/tf-keras#52 to re-scope this PR.

… input layers

Frightera · 2023-04-14T20:58:08Z

@haifeng-jin Can you take a look at it again? This now supports custom layers aswell.

While checking custom layers (can throw any error while checking), it uses nested try-except blocks which may not be the best practice.

Frightera · 2023-05-15T15:44:13Z

Hi @haifeng-jin,

Is there an update for this? I see the questions of new users (new to Keras) tries to use a non-differentiable loss function every two weeks in Stackoverflow.

haifeng-jin · 2023-05-17T04:39:03Z

Need an review from @qlzh727 since I do not have enough knowledge to review this PR.

sachinprasadhs · 2023-09-19T16:27:58Z

Hello, Thank you for submitting a pull request.

We're currently in the process of migrating the new Keras 3 code base from keras-team/keras-core to keras-team/keras.
Consequently, merging this PR is not possible at the moment. After the migration is successfully completed, feel free to reopen this PR at keras-team/keras if you believe it remains relevant to the Keras 3 code base. If instead this PR fixes a bug or security issue in legacy tf.keras, you can instead reopen the PR at keras-team/tf-keras, which hosts the TensorFlow-only, legacy version of Keras.

Frightera added 5 commits April 3, 2023 00:03

Add very first version of checking loss_differentiability utility.

48de9c7

Update explanations for verify_loss_differentiability

db20975

Add tests for verify_loss_differentiability and helper functions.

1e355f2

Reformat the code after the tests.

a3ddd26

Make error message of verify_loss_differentiability more user friendly.

150cbf4

google-ml-butler bot added the size:L label Apr 3, 2023

google-ml-butler bot assigned gbaned Apr 3, 2023

Frightera mentioned this pull request Apr 3, 2023

[Feature Request]: Warn user directly when custom loss is not differentiable keras-team/tf-keras#52

Closed

gbaned added this to Assigned Reviewer in PR Queue via automation Apr 4, 2023

gbaned requested a review from qlzh727 April 4, 2023 05:41

google-ml-butler bot added the keras-team-review-pending Pending review by a Keras team member. label Apr 4, 2023

qlzh727 requested a review from fchollet April 4, 2023 18:07

haifeng-jin removed the keras-team-review-pending Pending review by a Keras team member. label Apr 6, 2023

Frightera added 8 commits April 13, 2023 23:25

Add support for custom layers.

48a3ba9

Update tests to check layer differentiability

6b4fce4

Fix formatting

66c40ea

Fix formatting

1052d4c

Copy layers and the model when checking differentiability

afdd4e0

Update layer differentiability tests

77e3bc1

Fix bugs for layer differentiability checks, extend support for multi…

718fe74

… input layers

Fix formatting

bb4635f

gbaned requested a review from haifeng-jin May 4, 2023 08:23

gbaned added the keras-team-review-pending Pending review by a Keras team member. label May 4, 2023

divyashreepathihalli assigned haifeng-jin and unassigned gbaned May 5, 2023

divyashreepathihalli removed the keras-team-review-pending Pending review by a Keras team member. label May 5, 2023

haifeng-jin assigned qlzh727 and unassigned haifeng-jin May 17, 2023

sachinprasadhs closed this Sep 19, 2023

PR Queue automation moved this from Assigned Reviewer to Closed/Rejected Sep 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Check differentiability of custom loss function before training #17753

Check differentiability of custom loss function before training #17753

Frightera commented Apr 3, 2023

haifeng-jin commented Apr 6, 2023

Frightera commented Apr 14, 2023

Frightera commented May 15, 2023

haifeng-jin commented May 17, 2023

sachinprasadhs commented Sep 19, 2023

Check differentiability of custom loss function before training #17753

Check differentiability of custom loss function before training #17753

Conversation

Frightera commented Apr 3, 2023

haifeng-jin commented Apr 6, 2023

Frightera commented Apr 14, 2023

Frightera commented May 15, 2023

haifeng-jin commented May 17, 2023

sachinprasadhs commented Sep 19, 2023