3d4c0a618d0acd7921493e4f30395c22-Paper-Conference.pdf
–Neural Information Processing Systems
Giventextual explanations, our proposed framework uses agenerative model conditioned on textual input to create data points representing the explanations. Bycomparing theneuron'sresponse tothese generated data points and control data points, we can estimate the quality of the explanation.
Neural Information Processing Systems
Feb-11-2026, 14:25:49 GMT