560 improve results of lime timeseries notebooks #589

geek-yang · 2023-05-11T08:31:41Z

We try to fine-tune the parameters of lime timeseries and improve the results in the notebooks (lime_timeseries_coffee.ipynb and lime_timeseries_weather.ipynb).

Note that because of the absence of strategic segmentation and multi-channels masking, the results and visualization are not perfect. But they got much improved than before (at least good enough to show during the surf event.)

…otebook-LIME-timeseries

review-notebook-app · 2023-05-11T08:31:46Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

stefsmeets

Nice work! I noticed that you use explain_timeseries in the notebooks. But when using method='lime' or method='rise' gives different output. The interface is not consistant. Is this something that can be fixed in this PR or should it be a new issue?

Lime gives back an explanation instance:

>>> lime_exp = dianna.explain_timeseries(
>>>     run_expert_model, 
>>>     timeseries_data=data_extreme,
>>>     method='lime', 
>>>     labels=[0,1], 
>>>     class_names=["summer", "winter"],
>>>     num_features=len(data_extreme),
>>>     num_samples=10000,
>>>     num_slices=len(data_extreme), 
>>>     distance_method='euclidean',
>>>     mask_type=input_train_mean)
>>> rise_exp
<lime.explanation.Explanation at 0x7f8dbf2dfe20>

Rise gives back a numpy array:

>>> rise_exp = dianna.explain_timeseries(
>>>     run_expert_model, 
>>>     timeseries_data=data_extreme,
>>>     method='rise', 
>>>     labels=[0,1], 
>>>     p_keep=0.1,
>>>     n_masks=10000, 
>>>     mask_type=input_train_mean)
>>> rise_exp
array([[[7.700e-02],
        [3.000e-03],
        ...
        [4.000e-03]],

       [[9.650e-01],
        [9.820e-01],
        ...
       [1.053e+00]]])
>>> rise_exp.shape
(2, 28, 1)

geek-yang · 2023-05-15T09:47:44Z

Nice work! I noticed that you use explain_timeseries in the notebooks. But when using method='lime' or method='rise' gives different output. The interface is not consistant. Is this something that can be fixed in this PR or should it be a new issue?

Hi @stefsmeets , thanks a lot for your review and quick feedback 😄. Let me take a quick look into the interface, that's a bit unexpected. I will try to fix it in this PR, if possible.

Lime gives back an explanation instance:

>>> lime_exp = dianna.explain_timeseries(
>>>     run_expert_model, 
>>>     timeseries_data=data_extreme,
>>>     method='lime', 
>>>     labels=[0,1], 
>>>     class_names=["summer", "winter"],
>>>     num_features=len(data_extreme),
>>>     num_samples=10000,
>>>     num_slices=len(data_extreme), 
>>>     distance_method='euclidean',
>>>     mask_type=input_train_mean)
>>> rise_exp
<lime.explanation.Explanation at 0x7f8dbf2dfe20>

Rise gives back a numpy array:

>>> rise_exp = dianna.explain_timeseries(
>>>     run_expert_model, 
>>>     timeseries_data=data_extreme,
>>>     method='rise', 
>>>     labels=[0,1], 
>>>     p_keep=0.1,
>>>     n_masks=10000, 
>>>     mask_type=input_train_mean)
>>> rise_exp
array([[[7.700e-02],
        [3.000e-03],
        ...
        [4.000e-03]],

       [[9.650e-01],
        [9.820e-01],
        ...
       [1.053e+00]]])
>>> rise_exp.shape
(2, 28, 1)

This is because we use the lime_base function from the original implementation of lime to compute the scores. And it returns an explainer object, which contains LIME scores in explainer.local_exp. It is nice that you raise this point. I think we can simply only return explainer.local_exp, and convert it to a numpy array as well, to be consistent with rise. Let me fix this then.

geek-yang · 2023-05-17T12:52:08Z

@stefsmeets I just updated the returned values of LIME timeseries explainer and it is an array, which is consistent with RISE.

I also updated the notebook (lime_timeseries_weather.ipynb) to use explain_timeseries interface. The results from LIME timeseries are different from RISE, which is expected (not due to the difference in interface, but for several other reasons, e.g. the algorithms are different, also due to the absence of segmentation strategy #546).

Just take another look and let me know if you have more comments, thanks @stefsmeets !

stefsmeets

Thanks, Looks good to me! 🚀

Yang added 8 commits May 9, 2023 15:45

fix first instance issue for dtw distance

aabba73

fix tests

7d091e3

make scaling factor a special case for cosine distance

beed28f

add/update tests

bd06da7

Merge branch '558-verify-distance-methods' into 560-improve-results-n…

2234eee

…otebook-LIME-timeseries

fine-tune LIME for weather forecasts

d3aa3f4

remove duplication

4e53d40

use dianna visualizer

1a917aa

geek-yang linked an issue May 11, 2023 that may be closed by this pull request

Improve results in tutorial notebooks for LIME timeseries #560

Closed

Yang added 2 commits May 11, 2023 14:27

fix example with random forest onnx model

fb66dd6

add more comments

a830da6

geek-yang marked this pull request as ready for review May 15, 2023 07:10

geek-yang requested a review from stefsmeets May 15, 2023 07:10

stefsmeets reviewed May 15, 2023

View reviewed changes

cwmeijer added this to In progress in Sprint 30 - DIANNA 1.0.0 release including timeseries tutorials May 16, 2023

Yang added 3 commits May 17, 2023 12:49

generic saliency output from explainer as rise

5fd15bb

fix notebook for coffee dataset

9341024

fix tests

236e1f3

geek-yang requested a review from stefsmeets May 17, 2023 12:46

geek-yang moved this from In progress to Ready for review in Sprint 30 - DIANNA 1.0.0 release including timeseries tutorials May 17, 2023

stefsmeets approved these changes May 23, 2023

View reviewed changes

stefsmeets merged commit 0155a5a into main May 23, 2023
22 checks passed

Sprint 30 - DIANNA 1.0.0 release including timeseries tutorials automation moved this from Ready for review to Done May 23, 2023

stefsmeets deleted the 560-improve-results-notebook-LIME-timeseries branch May 23, 2023 11:29

stefsmeets mentioned this pull request May 23, 2023

Add timeseries functionality to dashboard #568

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

560 improve results of lime timeseries notebooks #589

560 improve results of lime timeseries notebooks #589

geek-yang commented May 11, 2023

review-notebook-app bot commented May 11, 2023

stefsmeets left a comment •

edited

Loading

geek-yang commented May 15, 2023

geek-yang commented May 17, 2023 •

edited

Loading

stefsmeets left a comment

560 improve results of lime timeseries notebooks #589

560 improve results of lime timeseries notebooks #589

Conversation

geek-yang commented May 11, 2023

review-notebook-app bot commented May 11, 2023

stefsmeets left a comment • edited Loading

Choose a reason for hiding this comment

geek-yang commented May 15, 2023

geek-yang commented May 17, 2023 • edited Loading

stefsmeets left a comment

Choose a reason for hiding this comment

stefsmeets left a comment •

edited

Loading

geek-yang commented May 17, 2023 •

edited

Loading