Fix empty data loading in Insights #728

Reubend · 2021-08-03T00:18:28Z

Previously, when Captum Insights had already shown all existing batches from the dataset, it would stop showing data at all after fetches. This change makes it recycle some batches, so that the user can keep clicking on "Fetch Data" to see the changes that their setting have on the results.

This was reported in #686

facebook-github-bot · 2021-08-03T00:26:31Z

@Reubend has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2021-08-03T00:42:28Z

@Reubend has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

NarineK · 2021-08-11T22:54:27Z

captum/insights/attr_vis/app.py

@@ -439,7 +441,22 @@ def _calculate_vis_output(
        return results if results else None

    def _get_outputs(self) -> List[Tuple[List[VisualizationOutput], SampleCache]]:
-        batch_data = next(self._dataset_iter)
+        # If we run out of new betches, then we need to


betches -> batches ?

NarineK · 2021-08-11T22:56:17Z

LGTM! @bilalsal, @edward-io do you have any comments on this issue ?

miguelmartin75 · 2021-08-12T18:12:24Z

captum/insights/attr_vis/app.py

@@ -199,6 +200,7 @@ class scores.
        self._outputs: List[VisualizationOutput] = []
        self._config = FilterConfig(prediction="all", classes=[], num_examples=4)
        self._dataset_iter = iter(dataset)


wouldn't your PR be simpler to construct this as cycle(iter(dataset)) since it seems you're effectively reimplementing this?

It would be much simpler that way, but my concern is about memory usage. Per the docs, it seems that cycle

may require significant auxiliary storage (depending on the length of the iterable)

Which suggests to me that it stores the entire thing in an array and then loops around. If the dataset is huge, then I think this could be an issue, because every batch would get stored in memory. That's why I made the cache only store a few batches instead, rather than using a cycle over the entire dataset.

However, if you don't think the memory is an issue, I could do it this way instead. It would probably only be an issue if somebody requested a lot of attributions.

Makes sense. Thanks for clarifying. Probably best to assume memory will be an issue.

edward-io

Looks good to me!

facebook-github-bot · 2021-08-18T23:42:07Z

@Reubend merged this pull request in e222265.

facebook-github-bot added the cla signed label Aug 3, 2021

Reubend mentioned this pull request Aug 3, 2021

Receiving empty output in captum insights eventhough insights flask server shows request success in log #686

Closed

Fix empty data loading in Insights

85b1565

Reubend force-pushed the master branch from 4f64c86 to 85b1565 Compare August 3, 2021 00:37

NarineK requested review from NarineK and edward-io August 11, 2021 22:53

NarineK approved these changes Aug 11, 2021

View reviewed changes

miguelmartin75 reviewed Aug 12, 2021

View reviewed changes

edward-io approved these changes Aug 17, 2021

View reviewed changes

facebook-github-bot closed this in e222265 Aug 18, 2021

facebook-github-bot added the Merged label Aug 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix empty data loading in Insights #728

Fix empty data loading in Insights #728

Reubend commented Aug 3, 2021 •

edited

Loading

facebook-github-bot commented Aug 3, 2021

facebook-github-bot commented Aug 3, 2021

NarineK Aug 11, 2021

NarineK commented Aug 11, 2021

miguelmartin75 Aug 12, 2021

Reubend Aug 13, 2021

miguelmartin75 Aug 16, 2021

edward-io left a comment

facebook-github-bot commented Aug 18, 2021

Fix empty data loading in Insights #728

Fix empty data loading in Insights #728

Conversation

Reubend commented Aug 3, 2021 • edited Loading

facebook-github-bot commented Aug 3, 2021

facebook-github-bot commented Aug 3, 2021

NarineK Aug 11, 2021

Choose a reason for hiding this comment

NarineK commented Aug 11, 2021

miguelmartin75 Aug 12, 2021

Choose a reason for hiding this comment

Reubend Aug 13, 2021

Choose a reason for hiding this comment

miguelmartin75 Aug 16, 2021

Choose a reason for hiding this comment

edward-io left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Aug 18, 2021

Reubend commented Aug 3, 2021 •

edited

Loading