Young, Jay
Learning Micro-Management Skills in RTS Games by Imitating Experts
Young, Jay (The University of Birmingham) | Hawes, Nick (The University of Birmingham)
We investigate the problem of learning the control of small groups of units in combat situations in Real Time Strategy (RTS) games. AI systems may acquire such skills by observing and learning from expert players, or other AI systems performing those tasks. However, access to training data may be limited, and representations based on metric information -- position, velocity, orientation etc. -- may be brittle, difficult for learning mechanisms to work with, and generalise poorly to new situations. In this work we apply \textit{qualitative spatial relations} to compress such continuous, metric state-spaces into symbolic states, and show that this makes the learning problem easier, and allows for more general models of behaviour. Models learnt from this representation are used to control situated agents, and imitate the observed behaviour of both synthetic (pre-programmed) agents, as well as the behaviour of human-controlled agents on a number of canonical micro-management tasks. We show how a Monte-Carlo method can be used to decompress qualitative data back in to quantitative data for practical use in our control system. We present our work applied to the popular RTS game Starcraft.
Predicting Situated Behaviour Using Sequences of Abstract Spatial Relations
Young, Jay (The University of Birmingham) | Hawes, Nick (The University of Birmingham)
The ability to understand behaviour is a crucial skill for Artificial Intelligence systems that are expected to interact with external agents such as humans or other AI systems. Such systems might be expected to operate in co-operative or team-based scenarios, such as domestic robots capable of helping out with household jobs, or disaster relief robots expected to collaborate and lend assistance to others. Conversely, they may also be required to hinder the activities of malicious agents in adversarial scenarios. In this paper we address the problem of modelling agent behaviour in domains expressed in continuous, quantitative space by applying qualitative, relational spatial abstraction techniques. We employ three common techniques for Qualitative Spatial Reasoning — the Region Connection Calculus, the Qualitative Trajectory Calculus and the Star calculus. We then supply an algorithm based on analysis of Mutual Information that allows us to find the set of abstract, spatial relationships that provide high degrees of information about an agent's future behaviour. We employ the RoboCup soccer simulator as a base for movement-based tasks of our own design and compare the predictions of our system against those of systems utilising solely metric representations. Results show that use of a spatial abstraction-based representation, along with feature selection mechanisms, allows us to outperform metric representations on the same tasks.
Evolutionary Learning of Goal Priorities in a Real-Time Strategy Game
Young, Jay (The University of Birmingham, United Kingdom) | Hawes, Nick (The University of Birmingham, United Kingdom)
However, due to the small numbers of goals present in existing systems, goal management Autonomous AI systems should be aware of their own goals is a relatively simple affair. Hanheide et al. (2010) describe and be capable of independently formulating behaviour to a system similar in architecture to our own that manages address them. We would ideally like to provide an agent with just two goals, whereas the one discussed in this paper must a collection of competences that allow it to act in novel situations manage upwards of forty. As the number of goals increases, that may not be predictable at design-time. In particular, the potential for goal conflict grows. This leads to a requirement we are interested in the operation of AI systems in for more sophisticated management processes, such as complex, oversubscribed domains where there may exist a dynamic goal re-prioritisation, allowing agents to alter their variety of ways to address high-level goals by composing behaviour to meet changing operational requirements. In the behaviours to achieve a set of sub-goals taken from a larger oversubscribed problem domains we are interested in, encoding set. Our research focusses how such sub-goals might be chosen all possible operating strategies at design time may (i.e.