feature_importance for multiinput models with data as a list of arrays #142

jmaspons · 2022-03-19T19:00:29Z

Datasets can be 2d or 3d arrays

…ets. Datasets can be 2d or 3d arrays

hbaniecki · 2022-03-20T11:18:09Z

Hi @jmaspons, thanks for this contribution. I will try to review it next week.

Could you provide some code examples (use case) for this functionality? Ideally, it would also be later incorporated into the package tests.
Can you tell me how it relates to Support 3d arrays as input data for feature_importance #141?

jmaspons · 2022-03-21T10:43:47Z

Hello,

You can find a test script with dummy data at https://gist.github.com/jmaspons/0199ef922571bafe5eaac1a056963a83 (it requires keras, abind, DALEX and data.table packages). The patch implements feature_importance for models with more than one input datasets as 2D and 3D arrays. It can be useful for time series data (3D to a RNN) with some static variables (2D). DALEX::explainer doesn't support this kind of data input, so no changes to feature_importance.explainer

#141 implements feature_importance for a single input model. I should add some changes to that PR for the variable_groups and the autogenerated variables following this patch which I tested much more cases.

The feature_importance.default and feature_importance.multiinput

In order to add tests, do you think it's acceptable to add all the dependencies or skip some by saving some data in the package?

hbaniecki · 2022-03-21T14:11:08Z

For starters, we should use underscore notation for function parameters instead of camelCase, e.g. perm_dim.

In order to add tests, do you think it's acceptable to add all the dependencies or skip some by saving some data in the package?

All such dependencies should be added to suggests; we wouldn't want more dependencies in imports. It would be nice to have some tests; they can run on generated data.

Static variables can be also categorical Requires ModelOriented/ingredients#142

No need in an internal function

Implements support for 3D arrays in a list of inputs and ... to predict function Waiting for ModelOriented/ingredients#142 and ModelOriented/ingredients#143

pbiecek · 2023-01-13T20:56:59Z

let's not have data.table and keras as dependencies

jmaspons · 2023-01-15T12:26:19Z

let's not have data.table and keras as dependencies

I'll find alternatives implementations for the data.table part. For the keras tests, is it ok to make it conditional and add keras in the suggested packages section?

pbiecek · 2023-03-15T21:18:27Z

we are trying to have DALEX as light as possible and keras is quite heavy package
so maybe a valid solution would be to move this function to DALEXtra?
(it has some heavy dependencies)

feature_importance for multiinput models with data as a list of datas…

ecaa5a9

…ets. Datasets can be 2d or 3d arrays

hbaniecki added the feature 💡 New feature or request label Mar 20, 2022

jmaspons added 2 commits March 21, 2022 11:44

Improve variable_groups names + code style

54f24ff

update docs

602de07

jmaspons added 3 commits March 21, 2022 15:42

CamelCase -> snake_case

c521d42

Update docs with updated parameter names

fd3ef8d

Tests for feature_importance.multiinput

a69bbfd

jmaspons added a commit to jmaspons/MLTools that referenced this pull request Mar 22, 2022

Time series with static variables working

91b760b

Static variables can be also categorical Requires ModelOriented/ingredients#142

jmaspons added 2 commits April 1, 2022 17:21

Fix call to feature_importance.multiinput by passing N=N

41922e1

Remove deprecated parameter

89b83e4

No need in an internal function

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature_importance for multiinput models with data as a list of arrays #142

feature_importance for multiinput models with data as a list of arrays #142

jmaspons commented Mar 19, 2022

hbaniecki commented Mar 20, 2022 •

edited

Loading

jmaspons commented Mar 21, 2022

hbaniecki commented Mar 21, 2022

pbiecek commented Jan 13, 2023

jmaspons commented Jan 15, 2023

pbiecek commented Mar 15, 2023

feature_importance for multiinput models with data as a list of arrays #142

Are you sure you want to change the base?

feature_importance for multiinput models with data as a list of arrays #142

Conversation

jmaspons commented Mar 19, 2022

hbaniecki commented Mar 20, 2022 • edited Loading

jmaspons commented Mar 21, 2022

hbaniecki commented Mar 21, 2022

pbiecek commented Jan 13, 2023

jmaspons commented Jan 15, 2023

pbiecek commented Mar 15, 2023

hbaniecki commented Mar 20, 2022 •

edited

Loading