Scalable Gradients for Stochastic Differential Equations

Li, Xuechen, Wong, Ting-Kam Leonard, Chen, Ricky T. Q., Duvenaud, David

Jan-8-2020–arXiv.org Machine Learning

The adjoint sensitivity method scalably computes gradients of solutions to ordinary differential equations. We generalize this method to stochastic differential equations, allowing time-efficient and constant-memory computation of gradients with high-order adaptive solvers. Specifically, we derive a stochastic differential equation whose solution is the gradient, a memory-efficient algorithm for caching noise, and conditions under which numerical solutions converge. In addition, we combine our method with gradient-based stochastic variational inference for latent stochastic differential equations. We use our method to fit stochastic dynamics defined by neural networks, achieving competitive performance on a 50-dimensional motion capture dataset.

differential equation, diffusion function, stochastic differential equation, (10 more...)

arXiv.org Machine Learning

Jan-8-2020

arXiv.org PDF

Add feedback

Country:
- North America > Canada
  - Ontario > Toronto (0.28)
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - Italy > Sicily
    - Palermo (0.04)
  - Belgium > Flanders
    - Flemish Brabant > Leuven (0.04)

Genre:
- Research Report (0.64)

Technology:
- Information Technology
  - Mathematics of Computing (1.00)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Machine Learning
      - Neural Networks > Deep Learning (1.00)
      - Learning Graphical Models > Directed Networks
        Bayesian Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found