Edmonton
AAAI Conferences Calendar
This page includes forthcoming AAAI sponsored conferences, conferences presented by AAAI Affiliates, and conferences held in cooperation with AAAI. AI Magazine also maintains a calendar listing that includes nonaffiliated conferences at www.aaai.org/Magazine/calendar.php. The APA Technology, Mind, and Research Society Conference. Mind & Society will be held April 5-7, 2018 in Melbourne, Florida, USA. Other Applications of Applied Intelligent be held June 24-29, 2018 in Delft, The on Web and Social Media.
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Machado, Marlos C., Bellemare, Marc G., Talvitie, Erik, Veness, Joel, Hausknecht, Matthew, Bowling, Michael
The Arcade Learning Environment (ALE) is an evaluation platform that poses the challenge of building AI agents with general competency across dozens of Atari 2600 games. It supports a variety of different problem settings and it has been receiving increasing attention from the scientific community, leading to some high-profile success stories such as the much publicized Deep Q-Networks (DQN). In this article we take a big picture look at how the ALE is being used by the research community. We show how diverse the evaluation methodologies in the ALE have become with time, and highlight some key concerns when evaluating agents in the ALE. We use this discussion to present some methodological best practices and provide new benchmark results using these best practices. To further the progress in the field, we introduce a new version of the ALE that supports multiple game modes and provides a form of stochasticity we call sticky actions. We conclude this big picture look by revisiting challenges posed when the ALE was introduced, summarizing the state-of-the-art in various problems and highlighting problems that remain open.
Today in AI: Toronto-based DeepLearni.ng joins Vector Institute, NextAI expands program to Montreal
Several Canadian startups have made announcements and launched new features to make strides in AI. DeepLearni.ng announced that it is now a partner of Toronto's Vector Institute, which is working to be a hub for artificial intelligence research. At the launch of tin March 2017, the government announced that 30 companies were dedicating $80 million to the Institute. DeepLearni.ng is now a Bronze sponsor of the Vector Institute, joining companies like Helpful.com, the Chan Zuckerberg initiative, and integrate.ai "In our work with enterprises around the world, our team of machine learning practitioners strives to transform the latest AI research into real-world business applications," said Stephen Piron, the company's co-founder and co-CEO.
Reactive Reinforcement Learning in Asynchronous Environments
Travnik, Jaden B., Mathewson, Kory W., Sutton, Richard S., Pilarski, Patrick M.
The relationship between a reinforcement learning (RL) agent and an asynchronous environment is often ignored. Frequently used models of the interaction between an agent and its environment, such as Markov Decision Processes (MDP) or Semi-Markov Decision Processes (SMDP), do not capture the fact that, in an asynchronous environment, the state of the environment may change during computation performed by the agent. In an asynchronous environment, minimizing reaction time---the time it takes for an agent to react to an observation---also minimizes the time in which the state of the environment may change following observation. In many environments, the reaction time of an agent directly impacts task performance by permitting the environment to transition into either an undesirable terminal state or a state where performing the chosen action is inappropriate. We propose a class of reactive reinforcement learning algorithms that address this problem of asynchronous environments by immediately acting after observing new state information. We compare a reactive SARSA learning algorithm with the conventional SARSA learning algorithm on two asynchronous robotic tasks (emergency stopping and impact prevention), and show that the reactive RL algorithm reduces the reaction time of the agent by approximately the duration of the algorithm's learning update. This new class of reactive algorithms may facilitate safer control and faster decision making without any change to standard learning guarantees.
Another Fortune 500 Company to Conduct Pilot Evaluation of OneSoft--s Machine Learning Platform
Edmonton, Alberta, Feb. 07, 2018 (GLOBE NEWSWIRE) -- OneSoft Solutions Inc. (the --Company-- or --OneSoft--) (TSX-V:OSS, OTC:OSSIF)--is pleased to announce that its wholly owned subsidiary, OneBridge Solutions, Inc. (--OneBridge--), has entered into a Pilot Program agreement with another U.S.-based, Fortune 500 natural gas, oil and petrochemical company (the --Client--). The Client, whose operations include natural gas gathering, treating, processing, transportation and storage, primarily in the United States, will evaluate OneBridge--s Cognitive Integrity ManagementTM (--CIM--) SaaS solution.
Preliminary Results on Exploration-Driven Satisfiability Solving
Chowdhury, Md Solimul (The University of Alberta) | Müller, Martin (The University of Alberta) | You, Jia-Huai (The University of Alberta)
In this abstract, we present our study of exploring the SAT search space via random-sampling, with the goal of improving Conflict Directed Clause Learning (CDCL) SAT solvers. Our proposed CDCL SAT solving algorithm expSAT uses a novel branching heuristic expVSIDS. It combines the standard VSIDS scores with heuristic scores derived from exploration. Experiments with application benchmarks from recent SAT competitions demonstrate the potential of the expSAT approach for improving CDCL SAT solvers.
On Monte Carlo Tree Search and Reinforcement Learning
Vodopivec, Tom, Samothrakis, Spyridon, Ster, Branko
Fuelled by successes in Computer Go, Monte Carlo tree search (MCTS) has achieved widespread adoption within the games community. Its links to traditional reinforcement learning (RL) methods have been outlined in the past; however, the use of RL techniques within tree search has not been thoroughly studied yet. In this paper we re-examine in depth this close relation between the two fields; our goal is to improve the cross-awareness between the two communities. We show that a straightforward adaptation of RL semantics within tree search can lead to a wealth of new algorithms, for which the traditional MCTS is only one of the variants. We confirm that planning methods inspired by RL in conjunction with online search demonstrate encouraging results on several classic board games and in arcade video game competitions, where our algorithm recently ranked first. Our study promotes a unified view of learning, planning, and search.
Heads-Up Limit Hold'em Poker Is Solved
Mirowski cites Turing as author of the paragraph containing this remark. The paragraph appeared in [46], in a chapter with Turing listed as one of three contributors. Which parts of the chapter are the work of which contributor, particularly the introductory material containing this quote, is not made explicit.
Kernel Contraction and Base Dependence
Oveisi, Mehrdad, Delgrande, James P., Pelletier, Francis Jeffry, Popowich, Fred
The AGM paradigm of belief change studies the dynamics of belief states in light of new information. Finding, or even approximating, those beliefs that are dependent on or relevant to a change is valuable because, for example, it can narrow the set of beliefs considered during belief change operations. A strong intuition in this area is captured by Gärdenforss preservation criterion (GPC), which suggests that formulas independent of a belief change should remain intact. GPC thus allows one to build dependence relations that are linked with belief change. Such dependence relations can in turn be used as a theoretical benchmark against which to evaluate other approximate dependence or relevance relations. Fariñas and Herzig axiomatize a dependence relation with respect to a belief set, and, based on GPC, they characterize the correspondence between AGM contraction functions and dependence relations. In this paper, we introduce base dependence as a relation between formulas with respect to a belief base, and prove a more general characterization that shows the correspondence between kernel contraction and base dependence. At this level of generalization, different types of base dependence emerge, which we show to be a result of possible redundancy in the belief base. We further show that one of these relations that emerge, strong base dependence, is parallel to saturated kernel contraction. We then prove that our latter characterization is a reversible generalization of Fariñas and Herzigs characterization. That is, in the special case when the underlying belief base is deductively closed (i.e., it is a belief set), strong base dependence reduces to dependence, and so do their respective characterizations. Finally, an intriguing feature of Fariñas and Herzigs formalism is that it meets other criteria for dependence, namely, Keyness conjunction criterion for dependence (CCD) and Gärdenforss conjunction criterion for independence (CCI). We prove that our base dependence formalism also meets these criteria. Even more interestingly, we offer a more specific criterion that implies both CCD and CCI, and show our base dependence formalism also meets this new criterion.