V-CECE: Visual Counterfactual Explanations via Conceptual Edits
–Neural Information Processing Systems
Recent black-box counterfactual generation frameworks fail to take into account the semantic content of the proposed edits, while relying heavily on training to guide the generation process. We propose a novel, plug-and-play black-box counterfactual generation framework, which suggests step-by-step edits based on theoretical guarantees of optimal edits to produce human-level counterfactual explanations with zero training. Our framework utilizes a pre-trained image editing diffusion model, and operates without access to the internals of the classifier, leading to an explainable counterfactual generation process.
Neural Information Processing Systems
Jun-12-2026, 15:36:49 GMT
- Technology: