Appendix: Energy-Based Cross Attention for Bayesian Context 388 Update in T ext-to-Image Diffusion Models 389 A Proof of Theorem 1 390 Theorem 1. For the energy functions 391 E(Q; K) = α 2 diag(KK