Add support for all output activations #310

Rubinjo · 2023-02-15T07:37:41Z

I'm working on a binary classification problem and therefore have a sigmoid activation instead of a softmax activation function for my output layer. I have adjusted the model_wo_softmax function to accept any kind of activation by giving it as an argument to also cover binary classification problems.

See here the old vs the new call:

innvestigate.model_wo_softmax(model)
innvestigate.model_wo_output_activation(model, "softmax")

I have also edited all examples and documentation that cover this function, so everything should now have this new function.

adrhill · 2023-02-27T13:10:54Z

Hi @Rubinjo, thanks for the contribution.
Adding a model_wo_output_activation function sounds ok to me, however this PR needs three changes:

We can't remove model_wo_softmax since this would break our users' existing code. We recently had a major breaking release and this minor change is not worth another one.
I don't see a reason to update the readme and all notebooks. iNNvestigate is a package for use with classifiers and these most commonly use softmax output activation functions. If this update to the documentation was necessary, I would also suggest doing it in another PR to keep the diff small.
If model_wo_output_activation is supposed to remove any activation function, an implementation that doesn't require the name of the activation function as an argument would be more elegant.

Rubinjo · 2023-02-27T18:05:52Z

Yeah, it is indeed a pretty rigorous change. I mainly made the change not to duplicate the model_wo_softmax function into multiple new similar functions. I can also implement it by only refactoring the pre_softmax_tensors method, then your first point is still satisfied and no updates to the readme and notebooks are needed.

adrhill · 2023-02-27T18:10:06Z

I would leave model_wo_softmax as is and add a model_wo_output_activation(model) function that doesn't require a string. All other changes besides added tests should be reverted.

Rubinjo · 2023-02-27T18:25:52Z

Yeah I agree. Preserving model_wo_softmax and adding a model_wo_output_activation(model) should be the end result.

This reverts commit 48e1cd5.

Rubinjo · 2023-02-28T09:45:49Z

I have adjusted pre_softmax_tensors so now it can still use model_wo_softmax without any adjustment for the user and added a model_wo_output_activation(model) that can also use this function.

Now all three points from your earlier message should be satisfied.

adrhill · 2023-05-05T16:35:20Z

Looks good to me. Sorry for the delay! :)

Rubinjo added 3 commits February 13, 2023 09:14

Add support for different output activations

48e1cd5

Fix links to notebooks and format with black

c91ee4a

Update docs, examples and tests to new activation

172bc27

Rubinjo added 2 commits February 28, 2023 08:38

Revert "Add support for different output activations"

acfbee1

This reverts commit 48e1cd5.

Separate function for multiple output activations

e671d3b

adrhill merged commit 4c0f355 into albermax:master May 5, 2023

adrhill mentioned this pull request Jul 21, 2023

Sigmoid activation #318

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for all output activations #310

Add support for all output activations #310

Rubinjo commented Feb 15, 2023

adrhill commented Feb 27, 2023 •

edited

Loading

Rubinjo commented Feb 27, 2023

adrhill commented Feb 27, 2023

Rubinjo commented Feb 27, 2023

Rubinjo commented Feb 28, 2023

adrhill commented May 5, 2023

Add support for all output activations #310

Add support for all output activations #310

Conversation

Rubinjo commented Feb 15, 2023

adrhill commented Feb 27, 2023 • edited Loading

Rubinjo commented Feb 27, 2023

adrhill commented Feb 27, 2023

Rubinjo commented Feb 27, 2023

Rubinjo commented Feb 28, 2023

adrhill commented May 5, 2023

adrhill commented Feb 27, 2023 •

edited

Loading