Probability Functional Descent: A Unifying Perspective on GANs, Variational Inference, and Reinforcement Learning

Chu, Casey, Blanchet, Jose, Glynn, Peter

Jan-30-2019–arXiv.org Machine Learning

The goal of this paper is to provide a unifying view of a wide range of problems of interest in machine learning by framing them as the minimization of functionals defined on the space of probability measures. In particular, we show that generative adversarial networks, variational inference, and actor-critic methods in reinforcement learning can all be seen through the lens of our framework. We then discuss a generic optimization algorithm for our formulation, called probability functional descent (PFD), and show how this algorithm recovers existing methods developed independently in the settings mentioned earlier.

algorithm, descent step, influence function, (10 more...)

arXiv.org Machine Learning

Jan-30-2019

arXiv.org PDF

Add feedback

Country:
- North America > United States > California > Santa Clara County
  - Stanford (0.04)
  - Palo Alto (0.04)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Optimization (1.00)
  - Machine Learning
    - Reinforcement Learning (1.00)
    - Neural Networks > Deep Learning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found