EnCoBo: Energy-Guided Concept Bottlenecks for Interpretable Generation
Kim, Sangwon, Lee, Kyoungoh, Dong, Jeyoun, Ahn, Jung Hwan, Kim, Kwang-Ju
–arXiv.org Artificial Intelligence
Concept Bottleneck Models (CBMs) provide interpretable decision-making through explicit, human-understandable concepts. However, existing generative CBMs often rely on auxiliary visual cues at the bottleneck, which undermines interpretability and intervention capabilities. We propose EnCoBo, a post-hoc concept bottleneck for generative models that eliminates auxiliary cues by constraining all representations to flow solely through explicit concepts. Unlike autoencoder-based approaches that inherently rely on black-box decoders, EnCoBo leverages a decoder-free, energy-based framework that directly guides generation in the latent space. Guided by diffusion-scheduled energy functions, EnCoBo supports robust post-hoc interventions-such as concept composition and negation-across arbitrary concepts. Experiments on CelebA-HQ and CUB datasets showed that EnCoBo improved concept-level human intervention and interpretability while maintaining competitive visual quality.
arXiv.org Artificial Intelligence
Sep-19-2025
- Country:
- Asia > South Korea
- North America > United States
- California (0.04)
- Genre:
- Research Report (0.50)
- Technology: