AITopics | McAuliffe, Jon

Collaborating Authors

McAuliffe, Jon

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Rao-Blackwellized Stochastic Gradients for Discrete Distributions

Liu, Runjing, Regier, Jeffrey, Tripuraneni, Nilesh, Jordan, Michael I., McAuliffe, Jon

arXiv.org Machine LearningOct-10-2018

We wish to compute the gradient of an expectation over a finite or countably infinite sample space having $K \leq \infty$ categories. When $K$ is indeed infinite, or finite but very large, the relevant summation is intractable. Accordingly, various stochastic gradient estimators have been proposed. In this paper, we describe a technique that can be applied to reduce the variance of any such estimator, without changing its bias---in particular, unbiasedness is retained. We show that our technique is an instance of Rao-Blackwellization, and we demonstrate the improvement it yields in empirical studies on both synthetic and real-world data.

artificial intelligence, estimator, machine learning, (16 more...)

arXiv.org Machine Learning

1810.04777

Country: North America > United States > California (0.14)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.63)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.63)

Add feedback

Fast Black-box Variational Inference through Stochastic Trust-Region Optimization

Regier, Jeffrey, Jordan, Michael I., McAuliffe, Jon

Neural Information Processing SystemsDec-31-2017

We introduce TrustVI, a fast second-order algorithm for black-box variational inference based on trust-region optimization and the reparameterization trick. At each iteration, TrustVI proposes and assesses a step based on minibatches of draws from the variational distribution. The algorithm provably converges to a stationary point. We implemented TrustVI in the Stan framework and compared it to two alternatives: Automatic Differentiation Variational Inference (ADVI) and Hessian-free Stochastic Gradient Variational Inference (HFSGVI). The former is based on stochastic first-order optimization. The latter uses second-order information, but lacks convergence guarantees. TrustVI typically converged at least one order of magnitude faster than ADVI, demonstrating the value of stochastic second-order information. TrustVI often found substantially better variational distributions than HFSGVI, demonstrating that our convergence theory can matter in practice.

air transportation, iteration, optimization problem, (16 more...)

Neural Information Processing Systems

Industry: Transportation > Air (0.60)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.71)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.51)

Add feedback

Fast Black-box Variational Inference through Stochastic Trust-Region Optimization

Regier, Jeffrey, Jordan, Michael I., McAuliffe, Jon

arXiv.org Machine LearningNov-4-2017

air transportation, iteration, optimization problem, (16 more...)

arXiv.org Machine Learning

1706.02375

Genre: Research Report (0.83)

Industry: Transportation > Air (0.60)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.51)

Add feedback

Learning an Astronomical Catalog of the Visible Universe through Scalable Bayesian Inference

Regier, Jeffrey, Pamnany, Kiran, Giordano, Ryan, Thomas, Rollin, Schlegel, David, McAuliffe, Jon, Prabhat, null

arXiv.org Machine LearningNov-10-2016

Celeste is a procedure for inferring astronomical catalogs that attains state-of-the-art scientific results. To date, Celeste has been scaled to at most hundreds of megabytes of astronomical images: Bayesian posterior inference is notoriously demanding computationally. In this paper, we report on a scalable, parallel version of Celeste, suitable for learning catalogs from modern large-scale astronomical datasets. Our algorithmic innovations include a fast numerical optimization routine for Bayesian posterior inference and a statistically efficient scheme for decomposing astronomical optimization problems into subproblems. Our scalable implementation is written entirely in Julia, a new high-level dynamic programming language designed for scientific and numerical computing. We use Julia's high-level constructs for shared and distributed memory parallelism, and demonstrate effective load balancing and efficient scaling on up to 8192 Xeon cores on the NERSC Cori supercomputer.

bayesian inference, light source, optimization problem, (17 more...)

arXiv.org Machine Learning

1611.03404

Country: North America > United States > California (0.14)

Genre: Research Report (0.82)

Industry: Energy (0.88)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.65)

Add feedback

A Gaussian Process Model of Quasar Spectral Energy Distributions

Miller, Andrew, Wu, Albert, Regier, Jeff, McAuliffe, Jon, Lang, Dustin, Prabhat, Mr., Schlegel, David, Adams, Ryan P.

Neural Information Processing SystemsDec-31-2015

We propose a method for combining two sources of astronomical data, spectroscopy and photometry, that carry information about sources of light (e.g., stars, galaxies, and quasars) at extremely different spectral resolutions. Our model treats the spectral energy distribution (SED) of the radiation from a source as a latent variable that jointly explains both photometric and spectroscopic observations. We place a flexible, nonparametric prior over the SED of a light source that admits a physically interpretable decomposition, and allows us to tractably perform inference. We use our model to predict the distribution of the redshift of a quasar from five-band (low spectral resolution) photometric data, the so called ``photo-z'' problem. Our method shows that tools from machine learning and Bayesian statistics allow us to leverage multiple resolutions of information to make accurate predictions with well-characterized uncertainties.

artificial intelligence, machine learning, quasar, (18 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.14)

Industry: Energy (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Celeste: Variational inference for a generative model of astronomical images

Regier, Jeffrey, Miller, Andrew, McAuliffe, Jon, Adams, Ryan, Hoffman, Matt, Lang, Dustin, Schlegel, David, Prabhat, null

arXiv.org Machine LearningJun-3-2015

We present a new, fully generative model of optical telescope image sets, along with a variational procedure for inference. Each pixel intensity is treated as a Poisson random variable, with a rate parameter dependent on latent properties of stars and galaxies. Key latent properties are themselves random, with scientific prior distributions constructed from large ancillary data sets. We check our approach on synthetic images. We also run it on images from a major sky survey, where it exceeds the performance of the current state-of-the-art method for locating celestial bodies and measuring their colors.

artificial intelligence, galaxy, machine learning, (17 more...)

arXiv.org Machine Learning

1506.01351

Country:

North America > United States > California (0.14)
North America > United States > Texas (0.14)

Genre: Research Report (0.70)

Industry: Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.94)
Information Technology > Artificial Intelligence > Natural Language > Generation (0.61)

Add feedback