Release Initial Release of Captum for Model Interpretability, Attribution and Debugging · pytorch/captum

We just released our first version of the PyTorch Captum library for model interpretability!

Highlights

This first release, v0.1.0, supports a number of gradient-based attribution algorithms as well as Captum Insights, a visualization tool for model debugging and understanding.

Attribution Algorithms

The following general purpose gradient-based attribution algorithms are provided. These can be applied to any type of PyTorch model and input features, including image, text, and multimodal.

Attribution of output of the model with respect to the input features
1. Saliency
2. InputXGradient
3. IntegratedGradient
4. DeepLift
5. DeepLiftShap
6. GradientShap
Attribution of output of the model with respect to the layers of the model
1. LayerActivation
2. LayerGradientXActivation
3. LayerConductance
4. InternalInfluence
Attribution of neurons with respect to the input features
1. NeuronGradient
2. NeuronIntegratedGradients
3. NeuronConductance
Attribution Algorithm + noisy sampling
1. NoiseTunnel
  NoiseTunnel helps to reduce the noise in the attributions that are assigned by attribution algorithms by using different noise tunnel techniques such as smoothgrad, smoothgrad_sq and vargrad.

Batch and Data Parallel Optimizations

Since some of the algorithms, like integrated gradients, expand input tensors internally, we want to make sure we can scale those tensors and our forward/backward computations efficiently. For that reason, we developed a feature that chunks tensors internally into internal_batch_size pieces, an argument which can be passed as input to attribute methods, which will make the library run forward and backward passes for each tensor batch separately and ultimately combine those after computing gradients.

The algorithms that support batched optimization are:

IntegratedGradients
LayerConductance
InternalInfluence
NeuronConductance

PyTorch data parallel models are also supported across all Captum algorithms, allowing users to take advantage of multiple GPUs when applying interpretability algorithms.

More details on these algorithms can be found on our website at captum.ai/docs/algorithms

Captum Insights

Captum Insights provides these algorithms in an interactive Jupyter notebook-based tool for model debugging and understanding. It can be used embedded within a notebook or run as a standalone application.

Features:

Visualize attribution across sampled data for classification models
Multimodal support with text, image and general features into a single model
Filtering and debugging specific sets of classes and misclassified examples
Jupyter notebook support for easy model and dataset modification

Insights is built with standard web technologies including JavaScript, CSS, React, Yarn and Flask.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial Release of Captum for Model Interpretability, Attribution and Debugging

Highlights

Attribution Algorithms

Batch and Data Parallel Optimizations

Captum Insights