AITopics | Page, David

A Screening Rule for l1-Regularized Ising Model Estimation

Kuang, Zhaobin, Geng, Sinong, Page, David

Neural Information Processing SystemsDec-31-2017

We discover a screening rule for l1-regularized Ising model estimation. The simple closed-form screening rule is a necessary and sufficient condition for exactly recovering the blockwise structure of a solution under any given regularization parameters. With enough sparsity, the screening rule can be combined with various optimization procedures to deliver solutions efficiently in practice. The screening rule is especially suitable for large-scale exploratory data analysis, where the number of variables in the dataset can be thousands while we are only interested in the relationship among a handful of variables within moderate-size clusters for interpretability. Experimental results on various datasets demonstrate the efficiency and insights gained from the introduction of the screening rule.

oncology, optimization problem, screening rule, (19 more...)

Neural Information Processing Systems

Country: North America > United States > Wisconsin (0.14)

Genre: Research Report (0.49)

Industry: Health & Medicine > Therapeutic Area > Oncology (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Data Science (0.86)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.68)
(3 more...)

Add feedback

An Efficient Pseudo-likelihood Method for Sparse Binary Pairwise Markov Network Estimation

Geng, Sinong, Kuang, Zhaobin, Page, David

arXiv.org Machine LearningApr-6-2017

The pseudo-likelihood method is one of the most popular algorithms for learning sparse binary pairwise Markov networks. In this paper, we formulate the $L_1$ regularized pseudo-likelihood problem as a sparse multiple logistic regression problem. In this way, many insights and optimization procedures for sparse logistic regression can be applied to the learning of discrete Markov networks. Specifically, we use the coordinate descent algorithm for generalized linear models with convex penalties, combined with strong screening rules, to solve the pseudo-likelihood problem with $L_1$ regularization. Therefore a substantial speedup without losing any accuracy can be achieved. Furthermore, this method is more stable than the node-wise logistic regression approach on unbalanced high-dimensional data when penalized by small regularization parameters. Thorough numerical experiments on simulated data and real world data demonstrate the advantages of the proposed method.

health & medicine, lr problem, optimization problem, (19 more...)

arXiv.org Machine Learning

1702.0832

Country: North America > United States (0.29)

Genre: Research Report > New Finding (0.90)

Industry: Health & Medicine (0.46)

Add feedback

Bayesian Estimation of Latently-grouped Parameters in Undirected Graphical Models

Liu, Jie, Page, David

Neural Information Processing SystemsDec-31-2013

In large-scale applications of undirected graphical models, such as social networks and biological networks, similar patterns occur frequently and give rise to similar parameters.In this situation, it is beneficial to group the parameters for more efficient learning. We show that even when the grouping is unknown, we can infer theseparameter groups during learning via a Bayesian approach. We impose a Dirichlet process prior on the parameters. Posterior inference usually involves calculating intractableterms, and we propose two approximation algorithms, namely a Metropolis-Hastings algorithm with auxiliary variables and a Gibbs sampling algorithm with"stripped" Beta approximation (Gibbs SBA). Simulations show that both algorithms outperform conventional maximum likelihood estimation (MLE). Gibbs SBA's performance is close to Gibbs sampling with exact likelihood calculation. Modelslearned with Gibbs SBA also generalize better than the models learned by MLE on real-world Senate voting data.

artificial intelligence, bayesian estimator, bayesian inference, (18 more...)

Neural Information Processing Systems

Country: North America > United States > Wisconsin > Dane County > Madison (0.14)

Industry: Information Technology (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

A Preliminary Investigation into Predictive Models for Adverse Drug Events

Davis, Jesse (Katholieke Universiteit Leuven) | Costa, Vitor Santos (Universidade do Porto) | Peissig, Peggy (Marshfield Clinic) | Caldwell, Michael (Marshfield Clinic) | Page, David (University of Wisconsin - Madison)

AAAI ConferencesJul-9-2013

Adverse drug events are a leading cause of danger and cost in health care. We could reduce both the danger and the cost if we had accurate models to predict, at prescription time for each drug, which patients are most at risk for known adverse reactions to that drug, such as myocardial infarction (MI, or "heart attack") if given a Cox2 inhibitor, angioedema if given an ACE inhibitor, or bleeding if given an anticoagulant such as Warfarin. We address this task for the specific case of Cox2 inhibitors, a type of non-steroidal anti-inflammatory drug (NSAID) or pain reliever that is easier on the gastrointestinal system than most NSAIDS. Because of the MI adverse drug reaction, some but not all very effective Cox2 inhibitors were removed from the market. Specifically, we use machine learning to predict which patients on a Cox2 inhibitor would suffer an MI. An important issue for machine learning is that we do not know which of these patients might have suffered an MI even without the drug. To begin to make some headway on this important problem, we compare our predictive model for MI for patients on Cox2 inhibitors against a more general model for predicting MI among a broader population not on Cox2 inhibitors.

preliminary investigation

AAAI Conferences

Workshops at the Twenty-Seventh AAAI Conference on Artificial Intelligence

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.87)
Health & Medicine > Therapeutic Area > Immunology (0.53)

Technology:

Information Technology > Modeling & Simulation (0.60)
Information Technology > Data Science > Data Mining (0.60)
Information Technology > Artificial Intelligence > Machine Learning (0.44)

Add feedback

Machine Learning for Personalized Medicine: Predicting Primary Myocardial Infarction from Electronic Health Records

Weiss, Jeremy C. (University of Wisconsin-Madison) | Natarajan, Sriraam (Wake Forest University) | Peissig, Peggy L. (Marshfield Clinic Research Foundation) | McCarty, Catherine A. (Essentia Institute of Rural Health) | Page, David (University of Wisconsin-Madison)

AI MagazineDec-31-2012

Electronic health records (EHRs) are an emerging relational domain with large potential to improve clinical outcomes. We apply two statistical relational learning (SRL) algorithms to the task of predicting primary myocardial infarction. We show that one SRL algorithm, relational functional gradient boosting, outperforms propositional learners particularly in the medically-relevant high recall region. We observe that both SRL algorithms predict outcomes better than their propositional analogs and suggest how our methods can augment current epidemiological practices.

health safety security environment and social responsibility, Health & Medicine, primary myocardial infarction, (9 more...)

AI Magazine

Industry: Health & Medicine > Health Care Technology > Medical Record (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.79)

Add feedback

Machine Learning for Personalized Medicine: Predicting Primary Myocardial Infarction from Electronic Health Records

Weiss, Jeremy C. (University of Wisconsin-Madison) | Natarajan, Sriraam (Wake Forest University) | Peissig, Peggy L. (Marshfield Clinic Research Foundation) | McCarty, Catherine A. (Essentia Institute of Rural Health) | Page, David (University of Wisconsin-Madison)

AI MagazineDec-31-2012

Electronic health records (EHRs) are an emerging relational domain with large potential to improve clinical outcomes. We apply two statistical relational learning (SRL) algorithms to the task of predicting primary myocardial infarction. We show that one SRL algorithm, relational functional gradient boosting, outperforms propositional learners particularly in the medically-relevant high recall region. We observe that both SRL algorithms predict outcomes better than their propositional analogs and suggest how our methods can augment current epidemiological practices.

algorithm, cardiology, vascular disease, (22 more...)

AI Magazine

Country:

North America > United States > Wisconsin (0.15)
North America > United States > Minnesota (0.14)
Europe > United Kingdom > Scotland (0.14)

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Health Care Technology > Medical Record (1.00)
Health & Medicine > Epidemiology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.75)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)

Add feedback

Multiplicative Forests for Continuous-Time Processes

Weiss, Jeremy, Natarajan, Sriraam, Page, David

Neural Information Processing SystemsDec-31-2012

Learning temporal dependencies between variables over continuous time is an important and challenging task. Continuous-time Bayesian networks effectively model such processes but are limited by the number of conditional intensity matrices, which grows exponentially in the number of parents per variable. We develop a partition-based representation using regression trees and forests whose parameter spaces grow linearly in the number of node splits. Using a multiplicative assumption we show how to update the forest likelihood in closed form, producing efficient model updates. Our results show multiplicative forests can be learned from few temporal trajectories with large gains in performance and scalability.

cardiology, ctbn, vascular disease, (21 more...)

Neural Information Processing Systems

Country: North America > United States > Wisconsin > Dane County > Madison (0.14)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.50)

Add feedback

CLP(BN): Constraint Logic Programming for Probabilistic Knowledge

Costa, Vitor Santos, Page, David, Qazi, Maleeha, Cussens, James

arXiv.org Artificial IntelligenceOct-19-2012

We present CLP(BN), a novel approach that aims at expressing Bayesian networks through the constraint logic programming framework. Arguably, an important limitation of traditional Bayesian networks is that they are propositional, and thus cannot represent relations between multiple similar objects in multiple contexts. Several researchers have thus proposed first-order languages to describe such networks. Namely, one very successful example of this approach are the Probabilistic Relational Models (PRMs), that combine Bayesian networks with relational database technology. The key difficulty that we had to address when designing CLP(cal{BN}) is that logic based representations use ground terms to denote objects. With probabilitic data, we need to be able to uniquely represent an object whose value we are not sure about. We use {sl Skolem functions} as unique new symbols that uniquely represent objects with unknown value. The semantics of CLP(cal{BN}) programs then naturally follow from the general framework of constraint logic programming, as applied to a specific domain where we have probabilistic data. This paper introduces and defines CLP(cal{BN}), and it describes an implementation and initial experiments. The paper also shows how CLP(cal{BN}) relates to Probabilistic Relational Models (PRMs), Ngo and Haddawys Probabilistic Logic Programs, AND Kersting AND De Raedts Bayesian Logic Programs.

bayesian inference, clp, logic programming, (16 more...)

arXiv.org Artificial Intelligence

1212.2519

Country:

North America > United States > Wisconsin (0.14)
Asia > Japan > Honshū (0.14)

Add feedback

Identifying Adverse Drug Events by Relational Learning

Page, David (University of Wisconsin-Madison) | Costa, Vitor Santos (CRACS-INESC TEC and FCUP) | Natarajan, Sriraam (Wake Forest University) | Barnard, Aubrey (University of Wisconsin-Madison) | Peissig, Peggy (Marshfield Clinic Research Foundation) | Caldwell, Michael (Marshfield Clinic)

AAAI ConferencesJul-21-2012

The pharmaceutical industry, consumer protection groups, users of medications and government oversight agencies are all strongly interested in identifying adverse reactions to drugs. While a clinical trial of a drug may use only a thousand patients, once a drug is released on the market it may be taken by millions of patients. As a result, in many cases adverse drug events (ADEs) are observed in the broader population that were not identiﬁed during clinical trials. Therefore, there is a need for continued, postmarketing surveillance of drugs to identify previously-unanticipated ADEs. This paper casts this problem as a reverse machine learning task, related to relational subgroup discovery and provides an initial evaluation of this approach based on experiments with an actual EMR/EHR and known adverse drug events.

diagnosis, us government, vascular disease, (16 more...)

AAAI Conferences

Twenty-Sixth AAAI Conference on Artificial Intelligence

Country: North America > United States > Wisconsin (0.14)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.87)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.95)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Unachievable Region in Precision-Recall Space and Its Effect on Empirical Evaluation

Boyd, Kendrick, Costa, Vitor Santos, Davis, Jesse, Page, David

arXiv.org Artificial IntelligenceJul-18-2012

Precision-recall (PR) curves and the areas under them are widely used to summarize machine learning results, especially for data sets exhibiting class skew. They are often used analogously to ROC curves and the area under ROC curves. It is known that PR curves vary as class skew changes. What was not recognized before this paper is that there is a region of PR space that is completely unachievable, and the size of this region depends only on the skew. This paper precisely characterizes the size of that region and discusses its implications for empirical evaluation methodology in machine learning.

health & medicine, oncology, unachievable region, (20 more...)

arXiv.org Artificial Intelligence

1206.4667

Country:

Europe > United Kingdom > England (0.28)
North America > United States > Wisconsin > Dane County > Madison (0.14)

Industry: Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

Filters

Collaborating Authors

Page, David

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

A Screening Rule for l1-Regularized Ising Model Estimation

An Efficient Pseudo-likelihood Method for Sparse Binary Pairwise Markov Network Estimation

Bayesian Estimation of Latently-grouped Parameters in Undirected Graphical Models

A Preliminary Investigation into Predictive Models for Adverse Drug Events

Machine Learning for Personalized Medicine: Predicting Primary Myocardial Infarction from Electronic Health Records

Machine Learning for Personalized Medicine: Predicting Primary Myocardial Infarction from Electronic Health Records

Multiplicative Forests for Continuous-Time Processes

CLP(BN): Constraint Logic Programming for Probabilistic Knowledge

Identifying Adverse Drug Events by Relational Learning

Unachievable Region in Precision-Recall Space and Its Effect on Empirical Evaluation