AITopics | Undirected Networks

Collaborating Authors

Undirected Networks

News Overviews Instructional Materials AI-Alerts Classics

POMCPOW: An online algorithm for POMDPs with continuous state, action, and observation spaces

arXiv.org Artificial IntelligenceDec-26-2017

Online solvers for partially observable Markov decision processes have been applied to problems with large discrete state spaces, but continuous state, action, and observation spaces remain a challenge. This paper begins by investigating double progressive widening (DPW) as a solution to this challenge. However, we prove that this modification alone is not sufficient because the belief representations in the search tree collapse to a single particle causing the algorithm to converge to a policy that is suboptimal regardless of the computation time. The main contribution of the paper is to propose a new algorithm, POMCPOW, that incorporates DPW and weighted particle filtering to overcome this deficiency and attack continuous problems. Simulation results show that these modifications allow the algorithm to be successful where previous approaches fail.

artificial intelligence, machine learning, observation space, (18 more...)

arXiv.org Artificial Intelligence

1709.06196

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

Diagnosing early-stage cervical cancer using artificial intelligence

#artificialintelligenceDec-24-2017, 16:50:39 GMT

Using an artificial intelligence-based algorithm that uses scattered light data from tissues, researchers from IISER Kolkata and IIT Kanpur have been able to differentiate normal and precancerous tissue, and even identify the different stages of progression of the disease within a few minutes and with great accuracy. In vivo studies are now being carried out. The morphology of healthy and precancerous cervical tissue sites are quite different, and light that gets scattered from these tissues varies accordingly. Yet, it is difficult to discern with naked eyes the subtle differences in the scattered light characteristics of normal and precancerous tissue. Now, an artificial intelligence-based algorithm developed by a team of researchers from Indian Institute of Science Education and Research (IISER) Kolkata and Indian Institute of Technology (IIT) Kanpur makes this possible.

algorithm, artificial intelligence, machine learning, (14 more...)

#artificialintelligence

Country: Asia > India > West Bengal > Kolkata (0.52)

Genre: Research Report (0.77)

Industry: Health & Medicine > Therapeutic Area > Oncology > Cervical Cancer (0.42)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.32)

Add feedback

A Zero-Math Introduction to Markov Chain Monte Carlo Methods

@machinelearnbotDec-24-2017, 07:20:20 GMT

So, what are Markov chain Monte Carlo (MCMC) methods? In this article, I will explain that short answer, without any math. A parameter of interest is just some number that summarizes a phenomenon we're interested in. In general we use statistics to estimate parameters. For example, if we want to learn about the height of human adults, our parameter of interest might be average height in in inches.

artificial intelligence, machine learning, posterior distribution, (12 more...)

@machinelearnbot

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.64)

Add feedback

On Statistical Optimality of Variational Bayes

Pati, Debdeep, Bhattacharya, Anirban, Yang, Yun

arXiv.org Machine LearningDec-24-2017

Variational inference [25, 7, 40] is now a well-established tool to approximate intractable posterior distributions in hierarchical multi-layered Bayesian models. The traditional Markov chain Monte Carlo (MCMC; [17]) approach of approximating distributions with intractable normalizing constants draws (correlated) samples according to a discrete-time Markov chain whose stationary distribution is the target distribution. Despite their success and popularity, MCMC methods can be slow to converge and lack scalability in big data problems and/or problems involving very many latent variables, which has fueled search for alternatives. In contrast to the sampling approach of MCMC, variational inference approaches the problem from an optimization viewpoint. First, a class of analytically tractable distributions, referred to as the variational family, is identified for the problem at hand. For example, in mean-field approximation, the set of parameters and latent variables is divided into blocks and the variational distribution is assumed to be independent across blocks.

artificial intelligence, bayesian inference, machine learning, (19 more...)

arXiv.org Machine Learning

1712.08983

Country: North America > United States (0.46)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.86)

Add feedback

Unsupervised Learning Course Web Page

@machinelearnbotDec-23-2017, 08:55:14 GMT

Aims: This course provides students with an in-depth introduction to statistical modelling and unsupervised learning techniques. It presents probabilistic approaches to modelling and their relation to coding theory and Bayesian statistics. A variety of latent variable models will be covered including mixture models (used for clustering), dimensionality reduction methods, time series models such as hidden Markov models which are used in speech recognition and bioinformatics, independent components analysis, hierarchical models, and nonlinear models. The course will present the foundations of probabilistic graphical models (e.g. We will cover Markov chain Monte Carlo sampling methods and variational approximations for inference. Time permitting, students will also learn about other topics in machine learning.

artificial intelligence, machine learning, student

@machinelearnbot

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry: Education (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

Non-convex Optimization for Machine Learning

Jain, Prateek, Kar, Purushottam

arXiv.org Machine LearningDec-21-2017

A vast majority of machine learning algorithms train their models and perform inference by solving optimization problems. In order to capture the learning and prediction problems accurately, structural constraints such as sparsity or low rank are frequently imposed or else the objective itself is designed to be a non-convex function. This is especially true of algorithms that operate in high-dimensional spaces or that train non-linear models such as tensor models and deep networks. The freedom to express the learning problem as a non-convex optimization problem gives immense modeling power to the algorithm designer, but often such problems are NP-hard to solve. A popular workaround to this has been to relax non-convex problems to convex ones and use traditional methods to solve the (convex) relaxed optimization problems. However this approach may be lossy and nevertheless presents significant challenges for large scale optimization. On the other hand, direct approaches to non-convex optimization have met with resounding success in several domains and remain the methods of choice for the practitioner, as they frequently outperform relaxation-based techniques - popular heuristics include projected gradient descent and alternating minimization. However, these are often poorly understood in terms of their convergence and other properties. This monograph presents a selection of recent advances that bridge a long-standing gap in our understanding of these heuristics. The monograph will lead the reader through several widely used non-convex optimization techniques, as well as applications thereof. The goal of this monograph is to both, introduce the rich literature in this area, as well as equip the reader with the tools and techniques needed to analyze these simple procedures for non-convex problems.

artificial intelligence, machine learning, non-convex optimization, (18 more...)

arXiv.org Machine Learning

doi: 10.1561/2200000058

1712.07897

Country:

Asia > India (0.27)
North America > United States (0.27)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.92)
Information Technology (0.92)
Banking & Finance (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.45)

Add feedback

On Monte Carlo Tree Search and Reinforcement Learning

Vodopivec, Tom, Samothrakis, Spyridon, Ster, Branko

Journal of Artificial Intelligence ResearchDec-20-2017

Fuelled by successes in Computer Go, Monte Carlo tree search (MCTS) has achieved widespread adoption within the games community. Its links to traditional reinforcement learning (RL) methods have been outlined in the past; however, the use of RL techniques within tree search has not been thoroughly studied yet. In this paper we re-examine in depth this close relation between the two fields; our goal is to improve the cross-awareness between the two communities. We show that a straightforward adaptation of RL semantics within tree search can lead to a wealth of new algorithms, for which the traditional MCTS is only one of the variants. We confirm that planning methods inspired by RL in conjunction with online search demonstrate encouraging results on several classic board games and in arcade video game competitions, where our algorithm recently ranked first. Our study promotes a unified view of learning, planning, and search.

algorithm, backup, sarsa-uct, (16 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.5507

AI Access Foundation

11099

Journal of Artificial Intelligence Research

Country:

Oceania > Australia > New South Wales > Sydney (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Slovenia > Central Slovenia > Municipality of Ljubljana > Ljubljana (0.04)
(12 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Leisure & Entertainment > Games > Computer Games (0.87)
Leisure & Entertainment > Games > Go (0.66)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.45)

Add feedback

Riemann-Theta Boltzmann Machine

Krefl, Daniel, Carrazza, Stefano, Haghighat, Babak, Kahlen, Jens

arXiv.org Machine LearningDec-20-2017

A general Boltzmann machine with continuous visible and discrete integer valued hidden states is introduced. Under mild assumptions about the connection matrices, the probability density function of the visible units can be solved for analytically, yielding a novel parametric density function involving a ratio of Riemann-Theta functions. The conditional expectation of a hidden state for given visible states can also be calculated analytically, yielding a derivative of the logarithmic Riemann-Theta function. The conditional expectation can be used as activation function in a feedforward neural network, thereby increasing the modelling capacity of the network. Both the Boltzmann machine and the derived feedforward neural network can be successfully trained via standard gradient- and non-gradient-based optimization techniques.

artificial intelligence, boltzmann machine, machine learning, (18 more...)

arXiv.org Machine Learning

1712.07581

Country:

North America > United States (0.28)
Europe > United Kingdom > England (0.28)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.83)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

VAMPnets: Deep learning of molecular kinetics

Mardt, Andreas, Pasquali, Luca, Wu, Hao, Noé, Frank

arXiv.org Machine LearningDec-20-2017

There is an increasing demand for computing the relevant structures, equilibria and long-timescale kinetics of biomolecular processes, such as protein-drug binding, from high-throughput molecular dynamics simulations. Current methods employ transformation of simulated coordinates into structural features, dimension reduction, clustering the dimension-reduced data, and estimation of a Markov state model or related model of the interconversion rates between molecular structures. This handcrafted approach demands a substantial amount of modeling expertise, as poor decisions at any step will lead to large modeling errors. Here we employ the variational approach for Markov processes (VAMP) to develop a deep learning framework for molecular kinetics using neural networks, dubbed VAMPnets. A VAMPnet encodes the entire mapping from molecular coordinates to Markov states, thus combining the whole data processing pipeline in a single end-to-end framework. Our method performs equally or better than state-of-the art Markov modeling methods and provides easily interpretable few-state kinetic models.

artificial intelligence, machine learning, output node, (18 more...)

arXiv.org Machine Learning

doi: 10.1038/s41467-017-02388-1

1710.06012

Genre: Research Report (0.83)

Industry: Information Technology (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

athenahealth: Data Scientists

@machinelearnbotDec-19-2017, 23:51:04 GMT

Join us to use cutting edge machine learning to unbreak healthcare in the US. In the US, physicians face huge informational challenges – from dealing with mountains of formulaic email to wrestling with arcane insurance rules to finding at-risk patients in their large client pools. Athenahealth's Data Science group is using advanced machine learning and AI to develop a new generation of smart tools that can help physicians by reducing their paperwork, finding at-risk patients, providing key information at the right time, and overall allowing physicians to focus on what's important: spending time with patients. We're seeking experienced data scientists who love machine learning and complex data and who care about making a positive impact on the world by fielding real ML-driven systems. Positions are available at multiple levels of seniority.

artificial intelligence, athenahealth, machine learning, (4 more...)

@machinelearnbot

Country: North America > United States (0.66)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.63)

Add feedback