AITopics | Tian Qi Chen

Neural Ordinary Differential Equations

Tian Qi Chen, Yulia Rubanova, Jesse Bettencourt, David K. Duvenaud

Neural Information Processing SystemsMay-26-2025, 07:07:34 GMT

We introduce a new family of deep neural network models. Instead of specifying a discrete sequence of hidden layers, we parameterize the derivative of the hidden state using a neural network. The output of the network is computed using a blackbox differential equation solver. These continuous-depth models have constant memory cost, adapt their evaluation strategy to each input, and can explicitly trade numerical precision for speed. We demonstrate these properties in continuous-depth residual networks and continuous-time latent variable models. We also construct continuous normalizing flows, a generative model that can train by maximum likelihood, without partitioning or ordering the data dimensions. For training, we show how to scalably backpropagate through any ODE solver, without access to its internal operations. This allows end-to-end training of ODEs within larger models.

artificial intelligence, machine learning, neural network, (15 more...)

Neural Information Processing Systems

Country: North America > Canada > Ontario > Toronto (0.14)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.34)

Add feedback

Isolating Sources of Disentanglement in Variational Autoencoders

Tian Qi Chen, Xuechen Li, Roger B. Grosse, David K. Duvenaud

Neural Information Processing SystemsMay-26-2025, 04:52:44 GMT

We decompose the evidence lower bound to show the existence of a term measuring the total correlation between latent variables. We use this to motivate the β-TCVAE (Total Correlation Variational Autoencoder) algorithm, a refinement and plug-in replacement of the β-VAE for learning disentangled representations, requiring no additional hyperparameters during training. We further propose a principled classifier-free measure of disentanglement called the mutual information gap (MIG). We perform extensive quantitative and qualitative experiments, in both restricted and non-restricted settings, and show a strong relation between total correlation and disentanglement, when the model is trained using our framework.

artificial intelligence, machine learning, representation, (14 more...)

Neural Information Processing Systems

Country: North America > Canada > Ontario > Toronto (0.14)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Neural Ordinary Differential Equations

Tian Qi Chen, Yulia Rubanova, Jesse Bettencourt, David K. Duvenaud

Neural Information Processing SystemsMar-26-2025, 05:57:35 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, neural network, (15 more...)

Neural Information Processing Systems

Country: North America > Canada (0.46)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Neural Networks with Cheap Differential Operators

Tian Qi Chen, David K. Duvenaud

Neural Information Processing SystemsMar-25-2025, 13:07:35 GMT

Gradients of neural networks can be computed efficiently for any architecture, but some applications require differential operators with higher time complexity. We describe a family of restricted neural network architectures that allow efficient computation of a family of differential operators involving dimension-wise derivatives, used in cases such as computing the divergence. Our proposed architecture has a Jacobian matrix composed of diagonal and hollow (non-diagonal) components. We can then modify the backward computation graph to extract dimension-wise derivatives efficiently with automatic differentiation. We demonstrate these cheap differential operators for solving root-finding subproblems in implicit ODE solvers, exact density evaluation for continuous normalizing flows, and evaluating the Fokker-Planck equation for training stochastic differential equation models.

artificial intelligence, deep learning, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > Canada (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Isolating Sources of Disentanglement in Variational Autoencoders

Tian Qi Chen, Xuechen Li, Roger B. Grosse, David K. Duvenaud

Neural Information Processing SystemsMar-23-2025, 17:58:30 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, canada government, machine learning, (15 more...)

Neural Information Processing Systems

Country: North America > Canada (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Residual Flows for Invertible Generative Modeling

Tian Qi Chen, Jens Behrmann, David K. Duvenaud, Joern-Henrik Jacobsen

Neural Information Processing SystemsMar-23-2025, 17:24:17 GMT

Neural Information Processing Systems http://nips.cc/

Add feedback

Latent Ordinary Differential Equations for Irregularly-Sampled Time Series

Yulia Rubanova, Tian Qi Chen, David K. Duvenaud

Neural Information Processing SystemsMar-23-2025, 07:51:12 GMT

Neural Information Processing Systems http://nips.cc/

canada government, machine learning, ode-rnn, (19 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.46)
North America > United States (0.28)

Genre: Research Report (0.46)

Industry: Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.72)

Add feedback

Neural Networks with Cheap Differential Operators

Tian Qi Chen, David K. Duvenaud

Neural Information Processing SystemsJan-24-2025, 21:17:27 GMT

Gradients of neural networks can be computed efficiently for any architecture, but some applications require differential operators with higher time complexity. We describe a family of restricted neural network architectures that allow efficient computation of a family of differential operators involving dimension-wise derivatives, used in cases such as computing the divergence. Our proposed architecture has a Jacobian matrix composed of diagonal and hollow (non-diagonal) components. We can then modify the backward computation graph to extract dimension-wise derivatives efficiently with automatic differentiation. We demonstrate these cheap differential operators for solving root-finding subproblems in implicit ODE solvers, exact density evaluation for continuous normalizing flows, and evaluating the Fokker-Planck equation for training stochastic differential equation models.

artificial intelligence, deep learning, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > Canada (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Residual Flows for Invertible Generative Modeling

Tian Qi Chen, Jens Behrmann, David K. Duvenaud, Joern-Henrik Jacobsen

Neural Information Processing SystemsJan-24-2025, 01:34:07 GMT

Flow-based generative models parameterize probability distributions through an invertible transformation and can be trained by maximum likelihood. Invertible residual networks provide a flexible family of transformations where only Lipschitz conditions rather than strict architectural constraints are needed for enforcing invertibility. However, prior work trained invertible residual networks for density estimation by relying on biased log-density estimates whose bias increased with the network's expressiveness. We give a tractable unbiased estimate of the log density using a "Russian roulette" estimator, and reduce the memory required during training by using an alternative infinite series for the gradient. Furthermore, we improve invertible residual blocks by proposing the use of activation functions that avoid derivative saturation and generalizing the Lipschitz condition to induced mixed norms. The resulting approach, called Residual Flows, achieves state-of-theart performance on density estimation amongst flow-based models, and outperforms networks that use coupling blocks at joint generative and discriminative modeling.

Add feedback

Latent Ordinary Differential Equations for Irregularly-Sampled Time Series

Yulia Rubanova, Tian Qi Chen, David K. Duvenaud

Neural Information Processing SystemsJan-23-2025, 06:45:00 GMT

Time series with non-uniform intervals occur in many applications, and are difficult to model using standard recurrent neural networks (RNNs). We generalize RNNs to have continuous-time hidden dynamics defined by ordinary differential equations (ODEs), a model we call ODE-RNNs. Furthermore, we use ODE-RNNs to replace the recognition network of the recently-proposed Latent ODE model. Both ODE-RNNs and Latent ODEs can naturally handle arbitrary time gaps between observations, and can explicitly model the probability of observation times using Poisson processes. We show experimentally that these ODE-based models outperform their RNN-based counterparts on irregularly-sampled data.

artificial intelligence, machine learning, ode-rnn, (16 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.46)
North America > United States (0.28)

Genre: Research Report (0.46)

Industry: Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.92)

Add feedback

Filters

Collaborating Authors

Tian Qi Chen

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Neural Ordinary Differential Equations

Isolating Sources of Disentanglement in Variational Autoencoders

Neural Ordinary Differential Equations

Neural Networks with Cheap Differential Operators

Isolating Sources of Disentanglement in Variational Autoencoders

Residual Flows for Invertible Generative Modeling

Latent Ordinary Differential Equations for Irregularly-Sampled Time Series

Neural Networks with Cheap Differential Operators

Residual Flows for Invertible Generative Modeling

Latent Ordinary Differential Equations for Irregularly-Sampled Time Series