AITopics | University of the Witwatersrand

Well File:

Well Planning ( results)
- Shallow Hazard Analysis ( results)
- Well Plat ( results)
Wellbore Schematic ( results)
Directional Survey ( results)
Fluid Sample ( results)
Log ( results)
- Density ( results)
- Gamma Ray ( results)
- Mud ( results)
- Resistivity ( results)
Report ( results)
- Daily Report ( results)
- End of Well Report ( results)
- Well Completion Report ( results)
Rock Sample ( results)

University of the Witwatersrand

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Belief Reward Shaping in Reinforcement Learning

Marom, Ofir (University of the Witwatersrand) | Rosman, Benjamin (University of the Witwatersrand, Council for Scientific and Industrial Research)

AAAI ConferencesFeb-8-2018

A key challenge in many reinforcement learning problems is delayed rewards, which can significantly slow down learning. Although reward shaping has previously been introduced to accelerate learning by bootstrapping an agent with additional information, this can lead to problems with convergence. We present a novel Bayesian reward shaping framework that augments the reward distribution with prior beliefs that decay with experience. Formally, we prove that under suitable conditions a Markov decision process augmented with our framework is consistent with the optimal policy of the original MDP when using the Q-learning algorithm. However, in general our method integrates seamlessly with any reinforcement learning algorithm that learns a value or action-value function through experience. Experiments are run on a gridworld and a more complex backgammon domain that show that we can learn tasks significantly faster when we specify intuitive priors on the reward distribution.

agent, backgammon, bayesian inference, (20 more...)

AAAI Conferences

Thirty-Second AAAI Conference on Artificial Intelligence

Country: Africa > South Africa > Gauteng (0.14)

Genre: Research Report (0.46)

Industry: Leisure & Entertainment > Games > Backgammon (0.69)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.47)

Add feedback

An Analysis of Monte Carlo Tree Search

James, Steven (University of the Witwatersrand) | Konidaris, George ( Brown University ) | Rosman, Benjamin (Council for Scientific and Industrial Research)

AAAI ConferencesFeb-14-2017

Monte Carlo Tree Search (MCTS) is a family of directed search algorithms that has gained widespread attention in recent years. Despite the vast amount of research into MCTS, the effect of modifications on the algorithm, as well as the manner in which it performs in various domains, is still not yet fully known. In particular, the effect of using knowledge-heavy rollouts in MCTS still remains poorly understood, with surprising results demonstrating that better-informed rollouts often result in worse-performing agents. We present experimental evidence suggesting that, under certain smoothness conditions, uniformly random simulation policies preserve the ordering over action preferences. This explains the success of MCTS despite its common use of these rollouts to evaluate states. We further analyse non-uniformly random rollout policies and describe conditions under which they offer improved performance.

artificial intelligence, planning & scheduling, rollout, (17 more...)

AAAI Conferences

Thirty-First AAAI Conference on Artificial Intelligence

Country:

Europe (0.46)
Africa > South Africa > Gauteng (0.14)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)

Add feedback

Reinforcement Learning with Parameterized Actions

Masson, Warwick (University of the Witwatersrand) | Ranchod, Pravesh (University of the Witwatersrand ) | Konidaris, George (Duke University)

AAAI ConferencesApr-19-2016

We introduce a model-free algorithm for learning in Markov decision processes with parameterized actions—discrete actions with continuous parameters. At each step the agent must select both which action to use and which parameters to use with that action. We introduce the Q-PAMDP algorithm for learning in these domains, show that it converges to a local optimum, and compare it to direct policy search in the goal-scoring and Platform domains.

optimization problem, parameterized action, soccer, (19 more...)

AAAI Conferences

Thirtieth AAAI Conference on Artificial Intelligence

Country:

Africa (0.28)
North America > United States > North Carolina (0.14)

Industry: Leisure & Entertainment > Sports > Soccer (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.34)

Add feedback

Identifying and Tracking Switching, Non-Stationary Opponents: A Bayesian Approach

Hernandez-Leal, Pablo (Instituto Nacional de Astrofisica, Optica y Electronica (INAOE)) | Taylor, Matthew E. (Washington State University) | Rosman, Benjamin (University of the Witwatersrand) | Sucar, L. Enrique (Instituto Nacional de Astrofisica, Optica y Electronica (INAOE)) | Cote, Enrique Munoz de (Instituto Nacional de Astrofisica, Optica y Electronica (INAOE))

AAAI ConferencesApr-12-2016

In many situations, agents are required to use a set of strategies (behaviors) and switch among them during the course of an interaction. This work focuses on the problem of recognizing the strategy used by an agent within a small number of interactions. We propose using a Bayesian framework to address this problem. Bayesian policy reuse (BPR) has been empirically shown to be efficient at correctly detecting the best policy to use from a library in sequential decision tasks. In this paper we extend BPR to adversarial settings, in particular, to opponents that switch from one stationary strategy to another. Our proposed extension enables learning new models in an online fashion when the learning agent detects that the current policies are not performing optimally. Experiments presented in repeated games show that our approach is capable of efficiently detecting opponent strategies and reacting quickly to behavior switches, thereby yielding better performance than state-of-the-art approaches in terms of average rewards.

artificial intelligence, bayesian inference, opponent, (18 more...)

AAAI Conferences

Workshops at the Thirtieth AAAI Conference on Artificial Intelligence

Country:

North America > United States > Washington (0.14)
North America > Mexico > Puebla (0.14)

Genre: Research Report (1.00)

Industry: Leisure & Entertainment > Games (0.70)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.96)

Add feedback