Existing algorithms for explaining the output of image classifiers use different definitions of explanations and a variety of techniques to find them. However, none of the existing tools use a principled approach based on formal definitions of cause and explanation.
In this paper we present a novel black-box approach to computing explanations grounded in the theory of ...