JEDI: The Force of Jensen-Shannon Divergence in Disentangling Diffusion Models

Bill, Eric Tillmann, Simsar, Enis, Hofmann, Thomas

Jul-24-2025–arXiv.org Artificial Intelligence

We introduce JEDI, a test-time adaptation method that enhances subject separation and compositional alignment in diffusion models without requiring retraining or external supervision. JEDI operates by minimizing semantic entanglement in attention maps using a novel Jensen-Shannon divergence based objective. To improve efficiency, we leverage adversarial optimization, reducing the number of updating steps required. JEDI is model-agnostic and applicable to architectures such as Stable Diffusion 1.5 and 3.5, consistently improving prompt alignment and disentanglement in complex scenes. Additionally, JEDI provides a lightweight, CLIP-free disentanglement score derived from internal attention distributions, offering a principled benchmark for compositional alignment under test-time conditions. Code and results are available at https://ericbill21.github.io/JEDI/.

artificial intelligence, machine learning, optimization problem, (14 more...)

arXiv.org Artificial Intelligence

Jul-24-2025

arXiv.org PDF

Add feedback

Country:
- North America (0.28)
- Europe > Switzerland (0.28)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Optimization (0.46)
  - Machine Learning > Neural Networks (0.39)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found