Fine-tuning of diffusion models via stochastic control: entropy regularization and beyond

Mar-12-2024–arXiv.org Artificial Intelligence

This paper aims to develop and provide a rigorous treatment to the problem of entropy regularized fine-tuning in the context of continuous-time diffusion models, which was recently proposed by Uehara et al. (arXiv:2402.15194, 2024). The idea is to use stochastic control for sample generation, where the entropy regularizer is introduced to mitigate reward collapse. We also show how the analysis can be extended to fine-tuning involving a general $f$-divergence regularizer.

diffusion model, noise, pre, (16 more...)

arXiv.org Artificial Intelligence

Mar-12-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York (0.04)
  - Illinois (0.04)
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - Netherlands > South Holland
    - Dordrecht (0.04)
  - Italy > Calabria
    - Catanzaro Province > Catanzaro (0.04)
  - France > Auvergne-Rhône-Alpes
    - Isère > Grenoble (0.04)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning > Neural Networks (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found