South America
Learning Hybrid Models with Guarded Transitions
Santana, Pedro (Massachusetts Institute of Technology) | Lane, Spencer (Massachusetts Institute of Technology) | Timmons, Eric (Massachusetts Institute of Technology) | Williams, Brian (Massachusetts Institute of Technology) | Forster, Carlos (Instituto Tecnológico de Aeronáutica)
Innovative methods have been developed for diagnosis, activity monitoring, and state estimation that achieve high accuracy through the use of stochastic models involving hybrid discrete and continuous behaviors. A key bottleneck is the automated acquisition of these hybrid models, and recent methods have focused predominantly on Jump Markov processes and piecewise autoregressive models. In this paper, we present a novel algorithm capable of performing unsupervised learning of guarded Probabilistic Hybrid Automata (PHA) models, which extends prior work by allowing stochastic discrete mode transitions in a hybrid system to have a functional dependence on its continuous state. Our experiments indicate that guarded PHA models can yield significant performance improvements when used by hybrid state estimators, particularly when diagnosing the true discrete mode of the system, without any noticeable impact on their real-time performance.
Belief Revision with General Epistemic States
Meng, Hua (Southwest Jiaotong University) | Kou, Hui (Sichuan University) | Li, Sanjiang (University of Technology, Sydney)
In order to properly regulate iterated belief revision, Darwiche and Pearl (1997) model belief revision as revising epistemic states by propositions. An epistemic state in their sense consists of a belief set and a set of conditional beliefs. Although the denotation of an epistemic state can be indirectly captured by a total preorder on the set of worlds, it is unclear how to directly capture the structure in terms of the beliefs and conditional beliefs it contains. In this paper, we first provide an axiomatic characterisation for epistemic states by using nine rules about beliefs and conditional beliefs, and then argue that the last two rules are too strong and should be eliminated for characterising the belief state of an agent. We call a structure which satisfies the first seven rules a general epistemic state (GEP). To provide a semantical characterisation of GEPs, we introduce a mathematical structure called belief algebra, which is in essence a certain binary relation defined on the power set of worlds.We then establish a 1-1 correspondence between GEPs and belief algebras, and show that total preorders on worlds are special cases of belief algebras. Furthermore, using the notion of belief algebras, we extend the classical iterated belief revision rules of Darwiche and Pearl to our setting of general epistemic states.
Collaboration in Social Problem-Solving: When Diversity Trumps Network Efficiency
Noble, Diego (Federal University of Rio Grande do Sul) | Prates, Marcelo (Federal University of Rio Grande do Sul) | Bossle, Daniel (Federal University of Rio Grande do Sul) | Lamb, Luís (Federal University of Rio Grande do Sul)
Recent studies have suggested that current agent-based models are not sufficiently sophisticated to reproduce results achieved by human collaborative learning and reasoning. Such studies suggest that humans are diverse and dynamic when solving problems socially. However, despite their relevance to problem-solving, these two behavioral features have not yet been fully investigated. In this paper we analyse a recent social problem-solving model and attempt to address its shortcomings. Specifically, we investigate the effects of separating exploitation from exploration in agent behaviors and explore the concept of diversity in such models. We found out that diverse populations outperform homogeneous ones in both efficient and inefficient networks. Finally, we show that agent diversity is more relevant than the strategic behavioral dynamics. This work contributes towards understanding the role of diverse and dynamic behaviors in social problem-solving as well as the advancement of state-of-art social problem-solving models.
Reusing Previously Found A* Paths for Fast Goal-Directed Navigation in Dynamic Terrain
Hernandez, Carlos (Universidad Católica de la Santísima Concepción) | Asin, Roberto (Universidad Católica de la Santísima Concepción) | Baier, Jorge A (Pontificia Universidad Catolica de Chile)
Generalized Adaptive A* (GAA*) is an incremental algorithm that replans using A* when solving goal-directed navigation problems in dynamic terrain. Immediately after each A* search, it runs an efficient procedure that updates the heuristic values of states that were just expanded by A*, making them more informed. Those updates allow GAA* to speed up subsequent A* searches. Being based on A*, it is simple to describe and communicate; however, it is outperformed by other incremental algorithms like the state-of-the-art D*Lite algorithm at goal-directed navigation. In this paper we show how GAA* can be modified to exploit more information from a previous search in addition to the updated heuristic function. Specifically, we show how GAA* can be modified to utilize the paths found by a previous A* search. Our algorithm — Multipath Generalized Adaptive A* (MPGAA*) — has the same theoretical properties of GAA* and differs from it by only a few lines of pseudocode. Arguably, MPGAA* is simpler to understand than D*Lite. We evaluate MPGAA* over various realistic dynamic terrain settings, and observed that it generally outperforms the state-of-the-art algorithm D*Lite in scenarios resembling outdoor and indoor navigation.
Consistent Knowledge Discovery from Evolving Ontologies
Lecue, Freddy (IBM Dublin Research Center) | Pan, Jeff Z (The University of Aberdeen, UK. )
Deductive reasoning and inductive learning are the most common approaches for deriving knowledge. In real world applications when data is dynamic and incomplete, especially those exposed by sensors, reasoning is limited by dynamics of data while learning is biased by data incompleteness. Therefore discovering consistent knowledge from incomplete and dynamic data is a challenging open problem. In our approach the semantics of data is captured through ontologies to empower learning (mining) with (Description Logics) reasoning. Consistent knowledge discovery is achieved by applying generic, significative, representative association semantic rules. The experiments have shown scalable, accurate and consistent knowledge discovery with data from Dublin.
Teaching AI Ethics Using Science Fiction
Burton, Emanuelle (Center College) | Goldsmith, Judy (University of Kentucky) | Mattei, Nicholas (NICTA and University of New South Wales)
The cultural and political implications of modern AI research are not some far off concern, they are things that affect the world in the here and now. From advanced control systems with advanced visualizations and image processing techniques that drive the machines of the modern military to the slow creep of a mechanized workforce, ethical questions surround us. Part of dealing with these ethical questions is not just speculating on what could be but teaching our students how to engage with these ethical questions. We explore the use of science fiction as an appropriate tool to enable AI researchers to help engage students and the public on the current state and potential impacts of AI.
I Spy: An Interactive Game-Based Approach to Multimodal Robot Learning
Parde, Natalie Paige (University of North Texas) | Papakostas, Michalis (University of Texas Arlington and NCSR Demokritos) | Tsiakas, Konstantinos (University of Texas Arlington and NCSR Demokritos) | Dagioglou, Maria (NCSR Demokritos) | Karkaletsis, Vangelis (NCSR Demokritos) | Nielsen, Rodney D (University of North Texas)
Teaching robots about objects in their environment requires a multimodal correlation of images and linguistic descriptions to build complete feature and object models. These models can be created manually by collecting images and related keywords and presenting the pairings to robots, but doing so is tedious and unnatural. This work abstracts the problem of training robots to learn about the world around them by introducing I Spy , an interactive dialogue- and vision-based game in which players place objects in front of a humanoid robot and challenge it to guess which object they have in mind. The robot gradually learns about the objects and the features which describe them through repeated games, by updating its knowledge with newly captured training images. This paper details I Spy's learning and gaming processes, describes the approaches taken to extract information from multiple modalities both before and during gameplay, and finally discusses the results of a study designed to evaluate the game's model accuracy over time, its overall performance, and its appeal to human players.
An Accelerated Approach to Decentralized Reinforcement Learning of the Ball-Dribbling Behavior
Leottau, David Leonardo (Universidad de Chile) | Ruiz-del-Solar, Javier (Universidad de Chile)
In the context of soccer robotics, ball dribbling is a complex behavior where a robot player attempts to maneuver the ball in a very controlled way, while moving towards a desired target. To learn when and how to modify the robot’s velocity vector is a complex problem, hardly solvable in an effective way with methods based on identification of the system dynamics and/or kinematics and mathematical models. We propose a decentralized reinforcement learning strategy, where each component of the omnidirectional biped walk (𝑣𝑥,𝑣𝑦,𝑣𝜃) is learned in parallel with single-agents working in a multiagent task. Moreover, we propose an approach to accelerate the decentralized learning based on knowledge transfer from simple linear controllers. Obtained results are successful; with less human effort, and less required designer knowledge, the decentralized reinforcement learning scheme shows better performances than the current dribbling engine used by UChile Robotics Team in the SPL robot soccer competitions. The proposed decentralized rein- forcement learning scheme achieves asymptotic performance after 1500 episodes and can be accelerated up to 70% by using our approach to share actions.
Comparative Analysis of Abstract Policies to Transfer Learning in Robotics Navigation
Freire, Valdinei (Universidade de São Paulo) | Costa, Anna Helena Reali (Universidade de São Paulo)
Reinforcement learning enables a robot to learn behavior through trial-and-error. However, knowledge is usually built from scratch and learning may take a long time. Many approaches have been proposed to transfer the knowledge learned in one task and reuse it in another new similar task to speed up learning in the target task.A very effective knowledge to be transferred is an abstract policy, which generalizes the learned policies in source tasks to extend the domain of tasks that can reuse them.There are inductive and deductive methods to generate abstract policies.However, there is a lack of deeper analysis to assess not only the effectiveness of each type of policy, but also the way in which each policy is used to accelerate the learning in a new task.In this paper we propose two simple inductive methods and we use a deductive method to generate stochastic abstract policies from source tasks. We also propose two strategies to use the abstract policy during learning in a new task: the hard and the soft strategy. We make a comparative analysis between the three types of policies and the two strategies of use in a robotic navigation domain.We show that these techniques are effective in improving the agent learning performance, especially during the early stages of the learning process, when the agent is completely unaware of the new task.
Formulating LUTI Calibration as an Optimisation Problem: Estimation of Tranus Shadow Price and Substitution Parameters
Capelle, Thomas (Inria and Université Grenoble Alpes) | Sturm, Peter (Inria and Université Grenoble Alpes) | Vidard, Arthur (Inria and Université Grenoble Alpes) | Morton, Brian (University of North Carolina at Chapel Hill)
Cities and their employment catchment areas are focus points of economic activity, transportation, and social interactions. The need for land use and transport inte- grated modelling (LUTI modelling) as a decision aid tool in urban planning, has become apparent. Instanti- ating such models on cities, requires a substantial data collection, model structuring and parameter estimation effort; for conciseness, the latter is referred to here as calibration. This work is a partial effort towards the integrated calibration of LUTI models. It considers one of the most widely used LUTI models and softwares, Tranus. The usual calibration approach for Tranus is briefly reviewed. It is then reformulated as an optimisa- tion problem, in order to make it amenable to the sys- tematic incorporation of constraints on parameters and additional data and to form a clear basis for future fully integrated calibration. The problem at hand concerns a dynamic system; an approach is shown how to “elimi- nate” parts of the dynamics in order to ease the param- eter optimisation. We also discuss how to validate cali- bration results and propose to use synthetic data gener- ated from real world problems in order to assess conver- gence properties and accuracy of calibration methods.