Multiscale Dictionary Learning for Estimating Conditional Distributions

Petralia, Francesca, Vogelstein, Joshua, Dunson, David B.

Dec-4-2013–arXiv.org Machine Learning

Massive datasets are becoming an ubiquitous byproduct of modern scientific and industrial applications. These data present statistical and computational challenges because many previously developed analysis approaches do not scaleup sufficiently. Challenges arise because of the ultra high-dimensionality and relatively low sample size. Parsimonious models for such big data assume that the density in the ambient space concentrates around a lower-dimensional (possibly nonlinear) subspace. A plethora of methods are emerging to estimate such lower-dimensional subspaces [25, 2]. 1 We are interested in using such lower-dimensional embeddings to obtain estimates of the conditional distribution of some target variable(s). This conditional density estimation setting arises in a number of important application areas, including neuroscience, genetics, and video processing. For example, one might desire automated estimation of a predictive density for a neurologic phenotype of interest, such as intelligence, on the basis of available data for a patient including neuroimaging.

artificial intelligence, data mining, machine learning, (21 more...)

arXiv.org Machine Learning

Dec-4-2013

arXiv.org PDF

Add feedback

Country:
- North America > United States > North Carolina (0.14)

Genre:
- Research Report (0.50)

Industry:
- Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:
- Information Technology
  - Data Science > Data Mining (1.00)
  - Artificial Intelligence
    - Representation & Reasoning > Uncertainty
      - Bayesian Inference (0.94)
    - Machine Learning > Learning Graphical Models
      - Directed Networks > Bayesian Learning (0.94)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found