Multiscale Dictionary Learning for Estimating Conditional Distributions
Petralia, Francesca, Vogelstein, Joshua, Dunson, David B.
Massive datasets are becoming an ubiquitous byproduct of modern scientific and industrial applications. These data present statistical and computational challenges because many previously developed analysis approaches do not scaleup sufficiently. Challenges arise because of the ultra high-dimensionality and relatively low sample size. Parsimonious models for such big data assume that the density in the ambient space concentrates around a lower-dimensional (possibly nonlinear) subspace. A plethora of methods are emerging to estimate such lower-dimensional subspaces [25, 2]. 1 We are interested in using such lower-dimensional embeddings to obtain estimates of the conditional distribution of some target variable(s). This conditional density estimation setting arises in a number of important application areas, including neuroscience, genetics, and video processing. For example, one might desire automated estimation of a predictive density for a neurologic phenotype of interest, such as intelligence, on the basis of available data for a patient including neuroimaging.
Dec-4-2013
- Country:
- Asia > Middle East
- Jordan (0.04)
- North America > United States
- New York > New York County
- New York City (0.04)
- North Carolina > Durham County
- Durham (0.04)
- New York > New York County
- Asia > Middle East
- Genre:
- Research Report (0.50)
- Industry:
- Health & Medicine > Therapeutic Area > Neurology (1.00)