Generalized Contrastive Divergence: Joint Training of Energy-Based Model and Diffusion Model through Inverse Reinforcement Learning

Yoon, Sangwoong, Kwon, Dohyun, Hwang, Himchan, Noh, Yung-Kyun, Park, Frank C.

Dec-6-2023–arXiv.org Artificial Intelligence

In GCD, the joint training of EBM and a diffusion model is formulated as a minimax problem, which reaches an equilibrium when both models converge to the data distribution. The minimax learning with GCD bears interesting equivalence to inverse reinforcement learning, where the energy corresponds to a negative reward, the diffusion model is a policy, and the real data is expert demonstrations. We present preliminary yet promising results showing that joint training is beneficial for both EBM and a diffusion model. GCD enables EBM training without MCMC while improving the sample quality of a diffusion model.

diffusion model, machine learning, reinforcement learning, (12 more...)

arXiv.org Artificial Intelligence

Dec-6-2023

arXiv.org PDF

Add feedback

Country:
- Europe (0.28)
- North America > United States (0.28)

Genre:
- Research Report > New Finding (0.34)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Statistical Learning (1.00)