514 add channel masking #554

cwmeijer · 2023-04-13T08:11:25Z

Adds the ability to channel certain channels.

We can now

mask channels
mask time steps
make a combination of the above

We cannot do any completely random masking, which is fine I think, but could be discussed.
Note that I left some code duplication in the code for these different masking operations. I will change one of the 2 duplicates when I implement #546.

geek-yang

I left my comments below. Thanks for your effort @cwmeijer . I think we can have a discussion about it. I have some questions regarding the implementation. Just need to clarify a few things. Thanks in advance.

geek-yang · 2023-04-24T14:19:44Z

dianna/utils/maskers.py

+    return np.concatenate([time_step_masks, channel_masks, number_of_combined_masks], axis=0)
+
+
+def generate_channel_masks(input_data: np.ndarray, number_of_masks: int, p_keep: float):


This is the duplication part you mentioned in the top post I think, just a reminder for ourselves, that it will be solved in PR #562, I will leave a comment there as well.

geek-yang · 2023-04-24T15:50:52Z

dianna/utils/maskers.py

+    number_of_channel_masks = number_of_masks // 3
+    number_of_time_step_masks = number_of_channel_masks
+    number_of_combined_masks = number_of_masks - number_of_time_step_masks - number_of_channel_masks


This is quite an interesting way to implement the masking for multi-channels. I can understand that by doing this we can have a very balanced masking array, which contains masking for entire channels, masking for certain time steps across channels and a mixture of them. But I have a feeling that this makes it a bit too complex.

What I have in mind are two simple ways:

Simple flatten the input data and mask them brute-forcely

Loop through all channels and treat them individually (mask each channel separately and concatenate them)

I can imagine that the current implementation makes the segmentation very tricky to code. Let's have a chat about it. It could be possible that I misunderstand something.

But thanks a lot for the effort! This also provides more insight about what we want.

geek-yang · 2023-04-24T15:52:05Z

tests/methods/test_maskers.py

+    assert np.any(result)
+    assert np.any(~result)


Smart check!

geek-yang

Thanks for the explanation @cwmeijer. To clarify a bit, this implementation tends to mask the timestep across all channels, a complete channel and a mixing of these two. This way we can have it more structured and it also allows us to implement the same strategy as we have for image masking based on wave function.

I checked the code and all the tests. They look very good to me. Nice work @cwmeijer! I think we can merge this now and continue working on the segmentation.

But since the algorithm is not so straight forward, we need to add more documentation/docstrings/examples to explain it. I will create a new issue for this and we also add this for the smart masking strategy based on wave functions for the images, which was not added before.

The purely random masking is a low-hanging fruit which can be another option to this. We can implement it later when working on Lime implementation for timeseries. I will create another issue for that.

Overall, nice work and thanks for all the strategical and smart thinking behind all the codes @cwmeijer 👍 .

cwmeijer added 3 commits April 12, 2023 11:32

add channel masking (refs #514)

dda0b97

add combined masks (refs #514)

ec17ba9

Fix univariate case for masking with channel mask support (refs #514)

3bd7f76

cwmeijer changed the title ~~add channel masking (refs #514)~~ 514 add channel masking Apr 13, 2023

cwmeijer added 2 commits April 13, 2023 15:56

fix linter issues

3aa167e

remove unused import

f7b8616

cwmeijer marked this pull request as ready for review April 13, 2023 14:01

formatted imports

289275f

cwmeijer requested a review from geek-yang April 18, 2023 08:51

geek-yang reviewed Apr 24, 2023

View reviewed changes

geek-yang mentioned this pull request Apr 25, 2023

546 masking time step segmentation #562

Merged

geek-yang approved these changes Apr 25, 2023

View reviewed changes

cwmeijer merged commit 3f97c30 into main Apr 25, 2023

cwmeijer deleted the 514-channel-masking branch April 25, 2023 09:47

This was referenced Apr 25, 2023

Add documentation/docstrings/examples of strategic masking based on wave function #564

Closed

Add naive random masking strategy for masking timeseries #565

Open

geek-yang linked an issue Apr 25, 2023 that may be closed by this pull request

add channel masking #514

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

514 add channel masking #554

514 add channel masking #554

cwmeijer commented Apr 13, 2023 •

edited

Loading

geek-yang left a comment

geek-yang Apr 24, 2023

geek-yang Apr 24, 2023

geek-yang Apr 24, 2023

geek-yang left a comment

		return np.concatenate([time_step_masks, channel_masks, number_of_combined_masks], axis=0)


		def generate_channel_masks(input_data: np.ndarray, number_of_masks: int, p_keep: float):

514 add channel masking #554

514 add channel masking #554

Conversation

cwmeijer commented Apr 13, 2023 • edited Loading

geek-yang left a comment

Choose a reason for hiding this comment

geek-yang Apr 24, 2023

Choose a reason for hiding this comment

geek-yang Apr 24, 2023

Choose a reason for hiding this comment

geek-yang Apr 24, 2023

Choose a reason for hiding this comment

geek-yang left a comment

Choose a reason for hiding this comment

cwmeijer commented Apr 13, 2023 •

edited

Loading