AITopics | Bayesian Learning

Collaborating Authors

Bayesian Learning

A Bayesian network, Bayes network, belief network, Bayes(ian) model or probabilistic directed acyclic graphical model is a probabilistic graphical model (a type of statistical model) that represents a set of variables and their conditional dependencies via a directed acyclic graph (DAG). (Wikipedia)

News Overviews Instructional Materials AI-Alerts Classics

Wireless Traffic Prediction with Scalable Gaussian Process: Framework, Algorithms, and Verification

Xu, Yue, Yin, Feng, Xu, Wenjun, Lin, Jiaru, Cui, Shuguang

arXiv.org Machine LearningFeb-13-2019

The cloud radio access network (CRAN) is a promising paradigm to meet the stringent requirements of the fifth generation (5G) wireless systems. Meanwhile, wireless traffic prediction is a key enabler for C-RANs to improve both the spectrum efficiency and energy efficiency through load-aware network managements. This paper proposes a scalable Gaussian process (GP) framework as a promising solution to achieve large-scale wireless traffic prediction in a cost-efficient manner. First, to the best of our knowledge, this paper is the first to empower GP regression with the alternating direction method of multipliers (ADMM) for parallel hyper-parameter optimization in the training phase, where such a scalable training framework well balances the local estimation in baseband units (BBUs) and information consensus among BBUs in a principled way for large-scale executions. Second, in the prediction phase, we fuse local predictions obtained from the BBUs via a cross-validation based optimal strategy, which demonstrates itself to be reliable and robust for general regression tasks. Moreover, such a cross-validation based optimal fusion strategy is built upon a well acknowledged probabilistic model to retain the valuable closed-form GP inference properties. Third, we propose a CRAN based scalable wireless prediction architecture, where the prediction accuracy and the time consumption can be balanced by tuning the number of the BBUs according to the real-time system demands. Experimental results show that our proposed scalable GP model can outperform the state-of-the-art approaches considerably, in terms of wireless traffic prediction performance. I. INTRODUCTION The fifth generation (5G) system is expected to provide approximately 1000 times higher wireless capacity and reduce up to 90 percent of energy consumption compared with the current 4G system [1]. A CRAN is composed of two parts: the distributed remote radio heads (RRHs) with basic radio functionalities to provide coverage over a large area, and the centralized baseband units (BBUs) pool with parallel BBUs to support joint processing and cooperative network management. The BBUs can perform dynamic resource allocation in accordance with realtime networkdemands based on the virtualized resources in cloud computing. One major feature for the C-RANs to enable high energy-efficient services is the fast adaptability to nonuniform traffic variations [1]-[4], e.g., the tidal effects. Consequently, wireless traffic prediction techniques stand out as the key enabler to realize such loadaware managementand proactive control in C-RANs, e.g., the load-aware RRH on/off operation [4].

bbus, gp model, prediction, (13 more...)

arXiv.org Machine Learning

1902.04763

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.04)
North America > United States > Alaska > Anchorage Municipality > Anchorage (0.04)
(15 more...)

Genre:

Research Report > Promising Solution (0.68)
Research Report > New Finding (0.48)

Industry: Telecommunications (1.00)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
(4 more...)

Add feedback

Computer-Based Medical Consultations: MYCIN

AI ClassicsFeb-12-2019, 23:10:31 GMT

This book has been adapted in large part from the author's doctoral thesis [Shortliffe, l 974b]. Portions of the work appeared previously in Computers And Biomedical Research [Shortliffe, 1973, l 975b], Mathematical Biosciences [Shortliffe, 1975a], and the Proceedings Of The Thirteenth San Diego Biomedical Symposium [Shortliffe, l 974a]. To Stanford's Medical Scientist Training Program, which is supported by the National Institutes of Health Contents

columbia university, diagnostic medicine, relx group plc, (74 more...)

AI Classics

Country:

Europe (1.00)
North America > United States > California > San Francisco County > San Francisco (0.28)
North America > United States > California > Santa Clara County (0.27)

Genre:

Research Report (1.20)
Overview (1.00)
Summary/Review (1.00)
Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Government > Regional Government > > > > > > > North America Government (1.20)
Government > Regional Government > North America Government > United States Government (1.20)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.02)
(10 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (1.46)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.01)
(4 more...)

Add feedback

Readings in Medical Artificial Intelligence

AI ClassicsFeb-12-2019, 22:35:30 GMT

JANICE S. AIKINS Dr. Aikins received her Ph.D. in computer science from Stanford University in 1980. She is currently a research computer scientist at IBM's Palo Alto Scientific Center. She specializes in designing systems with an emphasis on the explicit representation of control knowledge in expert systems. ROBERT L. BLUM Dr. Blum received his M.D. from the University of California Medical School at San Francisco in 1973. From 1973 to 1976 he did an internship and residency in the Department of Internal Medicine at the Kaiser Foundation Hospital in Oakland, California, where he was chief resident in 1976.

diagnostic medicine, university of pittsburgh, university of wisconsin, (98 more...)

AI Classics

Country:

North America > United States > California > Santa Clara County (0.34)
North America > United States > California > San Francisco County > San Francisco (0.34)
North America > United States > California > Alameda County (0.33)

Genre:

Research Report > Experimental Study (1.00)
Summary/Review (1.00)
Research Report > New Finding (1.00)
(4 more...)

Industry:

Health & Medicine > Therapeutic Area > Internal Medicine (1.22)
Health & Medicine > Health Care Providers & Services (1.21)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.01)
(23 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.26)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.21)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (1.03)
(15 more...)

Add feedback

RULE-BASED EXPERT SYSTEMS

AI ClassicsFeb-12-2019, 22:33:16 GMT

Edward H. Shortliffe Chapter 6 Details of the Revised Therapy Algorithm

diagnostic medicine, programming languages/compilers, university of pittsburgh, (140 more...)

AI Classics

Country:

Europe (1.00)
Asia (1.00)
North America > Canada (0.67)
North America > United States > California > Santa Clara County > Stanford (0.27)

Genre:

Instructional Material > Course Syllabus & Notes (1.00)
Research Report > Experimental Study (1.00)
Summary/Review (1.00)
(6 more...)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.02)
Health & Medicine > Therapeutic Area > Immunology (1.01)
Health & Medicine > Pharmaceuticals & Biotechnology (1.01)
(29 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.82)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (1.79)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Differential Description Length for Hyperparameter Selection in Machine Learning

Host-Madsen, Anders, Abolfazli, Mojtaba, Zhang, June

arXiv.org Machine LearningFeb-12-2019

This paper introduces a new method for model selection and more generally hyperparameter selection in machine learning. The paper first proves a relationship between generalization error and a difference of description lengths of the training data; we call this difference differential description length (DDL). This allows prediction of generalization error from the training data \emph{alone} by performing encoding of the training data. This can now be used for model selection by choosing the model that has the smallest predicted generalization error. We show how this encoding can be done for linear regression and neural networks. We provide experiments showing that this leads to smaller generalization error than cross-validation and traditional MDL and Bayes methods.

codelength, description length, generalization error, (14 more...)

arXiv.org Machine Learning

1902.04699

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Colorado (0.04)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
(2 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.47)

Add feedback

Thompson Sampling with Information Relaxation Penalties

Min, Seungki, Maglaras, Costis, Moallemi, Ciamac C.

arXiv.org Machine LearningFeb-12-2019

We consider a finite time horizon multi-armed bandit (MAB) problem in a Bayesian framework, for which we develop a general set of control policies that leverage ideas from information relaxations of stochastic dynamic optimization problems. In crude terms, an information relaxation allows the decision maker (DM) to have access to the future (unknown) rewards and incorporate them in her optimization problem to pick an action at time $t$, but penalizes the decision maker for using this information. In our setting, the future rewards allow the DM to better estimate the unknown mean reward parameters of the multiple arms, and optimize her sequence of actions. By picking different information penalties, the DM can construct a family of policies of increasing complexity that, for example, include Thompson Sampling and the true optimal (but intractable) policy as special cases. We systematically develop this framework of information relaxation sampling, propose an intuitive family of control policies for our motivating finite time horizon Bayesian MAB problem, and prove associated structural results and performance bounds. Numerical experiments suggest that this new class of policies performs well, in particular in settings where the finite time horizon introduces significant tension in the problem. Finally, inspired by the finite time horizon Gittins index, we propose an index policy that builds on our framework that particularly outperforms to the state-of-the-art algorithms in our numerical experiments.

inner problem, irs, penalty function, (11 more...)

arXiv.org Machine Learning

1902.04251

Genre: Research Report (0.70)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.88)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Add feedback

Maximum Likelihood Estimation for Learning Populations of Parameters

Vinayak, Ramya Korlakai, Kong, Weihao, Valiant, Gregory, Kakade, Sham M.

arXiv.org Machine LearningFeb-12-2019

Consider a setting with $N$ independent individuals, each with an unknown parameter, $p_i \in [0, 1]$ drawn from some unknown distribution $P^\star$. After observing the outcomes of $t$ independent Bernoulli trials, i.e., $X_i \sim \text{Binomial}(t, p_i)$ per individual, our objective is to accurately estimate $P^\star$. This problem arises in numerous domains, including the social sciences, psychology, health-care, and biology, where the size of the population under study is usually large while the number of observations per individual is often limited. Our main result shows that, in the regime where $t \ll N$, the maximum likelihood estimator (MLE) is both statistically minimax optimal and efficiently computable. Precisely, for sufficiently large $N$, the MLE achieves the information theoretic optimal error bound of $\mathcal{O}(\frac{1}{t})$ for $t < c\log{N}$, with regards to the earth mover's distance (between the estimated and true distributions). More generally, in an exponentially large interval of $t$ beyond $c \log{N}$, the MLE achieves the minimax error bound of $\mathcal{O}(\frac{1}{\sqrt{t\log N}})$. In contrast, regardless of how large $N$ is, the naive "plug-in" estimator for this problem only achieves the sub-optimal error of $\Theta(\frac{1}{\sqrt{t}})$.

bernstein polynomial, coefficient, polynomial, (17 more...)

arXiv.org Machine Learning

1902.04553

Country:

North America > Canada > Ontario > Toronto (0.28)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
Europe > Ukraine > Kharkiv Oblast > Kharkiv (0.04)

Genre: Research Report > New Finding (0.34)

Industry: Health & Medicine (0.87)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

10 Machine Learning Algorithms You need to Know – Towards Data Science

#artificialintelligenceFeb-11-2019, 10:10:21 GMT

We live in a start of revolutionized era due to development of data analytics, large computing power, and cloud computing. Machine learning will definitely have a huge role there and the brains behind Machine Learning is based on algorithms. This article covers 10 most popular Machine Learning Algorithms which uses currently. These algorithms can be categorized into 3 main categories. Following algorithms are going to be covered in this article.

algorithm, artificial intelligence, machine learning, (16 more...)

#artificialintelligence

Genre: Research Report (0.99)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.37)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.34)

Add feedback

A Machine Learning based Robust Prediction Model for Real-life Mobile Phone Data

Sarker, Iqbal H.

arXiv.org Machine LearningFeb-11-2019

Real-life mobile phone data may contain noisy instances, which is a fundamental issue for building a prediction model with many potential negative consequences. The complexity of the inferred model may increase, may arise overfitting problem, and thereby the overall prediction accuracy of the model may decrease. In this paper, we address these issues and present a robust prediction model for real-life mobile phone data of individual users, in order to improve the prediction accuracy of the model. In our robust model, we first effectively identify and eliminate the noisy instances from the training dataset by determining a dynamic noise threshold using naive Bayes classifier and laplace estimator, which may differ from user-to-user according to their unique behavioral patterns. After that, we employ the most popular rule-based machine learning classification technique, i.e., decision tree, on the noise-free quality dataset to build the prediction model. Experimental results on the real-life mobile phone datasets (e.g., phone call log) of individual mobile phone users, show the effectiveness of our robust model in terms of precision, recall and f-measure.

mobile phone data, prediction model, probability, (13 more...)

arXiv.org Machine Learning

1902.07588

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Oceania > Australia > Victoria > Melbourne (0.14)
North America > United States > New York (0.05)
(21 more...)

Genre: Research Report (0.64)

Industry:

Telecommunications (0.93)
Information Technology (0.93)
Materials > Metals & Mining (0.67)

Technology:

Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (1.00)
(2 more...)

Add feedback

Cyclical Stochastic Gradient MCMC for Bayesian Deep Learning

Zhang, Ruqi, Li, Chunyuan, Zhang, Jianyi, Chen, Changyou, Wilson, Andrew Gordon

arXiv.org Machine LearningFeb-11-2019

The posteriors over neural network weights are high dimensional and multimodal. Each mode typically characterizes a meaningfully different representation of the data. We develop Cyclical Stochastic Gradient MCMC (SG-MCMC) to automatically explore such distributions. In particular, we propose a cyclical stepsize schedule, where larger steps discover new modes, and smaller steps characterize each mode. We prove that our proposed learning rate schedule provides faster convergence to samples from a stationary distribution than SG-MCMC with standard decaying schedules. Moreover, we provide extensive experimental results to demonstrate the effectiveness of cyclical SG-MCMC in learning complex multimodal distributions, especially for fully Bayesian inference with modern deep neural networks.

algorithm, cyclical stochastic gradient mcmc, sg-mcmc, (12 more...)

arXiv.org Machine Learning

1902.03932

Country:

North America > United States > New York (0.04)
Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)
Asia > Afghanistan > Parwan Province > Charikar (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)

Add feedback