Interpretable and Scalable Graphical Models for Complex Spatio-temporal Processes
–arXiv.org Artificial Intelligence
This thesis focuses on data that has complex spatio-temporal structure and on probabilistic graphical models that learn the structure in an interpretable and scalable manner. We target two research areas of interest: Gaussian graphical models for tensor-variate data and summarization of complex time-varying texts using topic models. This work advances the state-of-the-art in several directions. First, it introduces a new class of tensor-variate Gaussian graphical models via the Sylvester tensor equation. Second, it develops an optimization technique based on a fast-converging proximal alternating linearized minimization method, which scales tensor-variate Gaussian graphical model estimations to modern big-data settings. Third, it connects Kronecker-structured (inverse) covariance models with spatio-temporal partial differential equations (PDEs) and introduces a new framework for ensemble Kalman filtering that is capable of tracking chaotic physical systems. Fourth, it proposes a modular and interpretable framework for unsupervised and weakly-supervised probabilistic topic modeling of time-varying data that combines generative statistical models with computational geometric methods. Throughout, practical applications of the methodology are considered using real datasets. This includes brain-connectivity analysis using EEG data, space weather forecasting using solar imaging data, longitudinal analysis of public opinions using Twitter data, and mining of mental health related issues using TalkLife data. We show in each case that the graphical modeling framework introduced here leads to improved interpretability, accuracy, and scalability.
arXiv.org Artificial Intelligence
Jan-15-2023
- Country:
- South America > Paraguay
- North America
- Canada > British Columbia (0.04)
- United States
- Texas (0.04)
- Illinois (0.04)
- Michigan (0.04)
- Colorado > Denver County
- Denver (0.04)
- New Mexico > Los Alamos County
- Los Alamos (0.04)
- Virginia > Arlington County
- Arlington (0.04)
- Wisconsin > Dane County
- Madison (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Massachusetts
- Suffolk County > Boston (0.04)
- Middlesex County > Cambridge (0.04)
- California
- San Francisco County > San Francisco (0.13)
- Ventura County (0.04)
- Solano County (0.04)
- Marin County (0.04)
- Los Angeles County > Los Angeles (0.04)
- New York > New York County
- New York City (0.04)
- Europe > Spain
- Canary Islands (0.04)
- Asia
- Middle East > Jordan (0.04)
- South Korea (0.04)
- China (0.04)
- Africa
- Mali (0.04)
- Senegal > Kolda Region
- Kolda (0.04)
- Genre:
- Overview (0.92)
- Research Report
- New Finding (1.00)
- Experimental Study (0.67)
- Industry:
- Information Technology > Services (1.00)
- Health & Medicine > Therapeutic Area
- Psychiatry/Psychology (1.00)
- Immunology (1.00)
- Infections and Infectious Diseases (0.93)
- Neurology (0.87)
- Government > Regional Government
- Technology:
- Information Technology
- Data Science > Data Mining (1.00)
- Communications > Social Media (1.00)
- Artificial Intelligence
- Systems & Languages (1.00)
- Natural Language (1.00)
- Representation & Reasoning
- Optimization (1.00)
- Mathematical & Statistical Methods (1.00)
- Uncertainty > Bayesian Inference (0.67)
- Machine Learning
- Performance Analysis > Accuracy (0.92)
- Neural Networks > Deep Learning (0.67)
- Statistical Learning > Regression (0.67)
- Learning Graphical Models > Directed Networks
- Bayesian Learning (0.67)
- Information Technology