Agents
A Tactical Command Approach to Human Control of Vehicle Swarms
Beal, Jacob (BBN Technologies)
Human control of vehicle swarms faces a dilemma: an operator must be able to exercise precise control over how a mission is executed, but controlling individual vehicles is not scalable. The Proto spatial computing lan- guage offers an intermediate representation, where the motion of a swarm is specified as a vector field, which is then approximated by the movement of individual members (Bachrach, Beal, and McLurkin 2010). I propose that this can be exploited to build a “tactical command” model of swarm control, whereby human “officers” dynamically decompose a swarm into units and task those units to carry out geometric and topological maneuvers under the constraints imposed by the platform. This abstraction may also allow situation awareness interfaces for individual agents to be extended to apply to swarm units.
Robotic Swarms as Solids, Liquids and Gasses
Apker, Thomas B. (NRC/NRL Postdoctoral Fellow) | Potter, Mitchell A. (United States Naval Research Laboratory)
There have been significant advances in developing each phase of the mission. Secondly, based on our everyday algorithms that allow researchers to examine these experience with physical objects in our environment, behaviors in simulation (Luke et al. 2005), generally assuming the three major physical states of matter, solid, liquid and noise-free estimates of the agents' own, neighbors' and gas, represent a natural and intuitive means of describing the targets' positions. However, the actual information flow into types of motions a swarm of mobile robots can perform as biological agents' in terms of the sensing, processing and they cluster, transit or wander (Gage 1992).
A Biomimetic Approach Based on Immune Systems for Classification of Unstructured Data
Hamou, Mohamed, Amine, Abdelmalek, Lokbani, Ahmed Chaouki
In this paper we present the results of unstructured data clustering in this case a textual data from Reuters 21578 corpus with a new biomimetic approach using immune system. Before experimenting our immune system, we digitalized textual data by the n-grams approach. The novelty lies on hybridization of n-grams and immune systems for clustering. The experimental results show that the recommended ideas are promising and prove that this method can solve the text clustering problem.
Asynchronous Decentralized Algorithm for Space-Time Cooperative Pathfinding
Čáp, Michal, Novák, Peter, Vokřínek, Jiří, Pěchouček, Michal
Cooperative pathfinding is a multi-agent path planning problem where a group of vehicles searches for a corresponding set of non-conflicting space-time trajectories. Many of the practical methods for centralized solving of cooperative pathfinding problems are based on the prioritized planning strategy. However, in some domains (e.g., multi-robot teams of unmanned aerial vehicles, autonomous underwater vehicles, or unmanned ground vehicles) a decentralized approach may be more desirable than a centralized one due to communication limitations imposed by the domain and/or privacy concerns. In this paper we present an asynchronous decentralized variant of prioritized planning ADPP and its interruptible version IADPP. The algorithm exploits the inherent parallelism of distributed systems and allows for a speed up of the computation process. Unlike the synchronized planning approaches, the algorithm allows an agent to react to updates about other agents' paths immediately and invoke its local spatio-temporal path planner to find the best trajectory, as response to the other agents' choices. We provide a proof of correctness of the algorithms and experimentally evaluate them on synthetic domains.
$QD$-Learning: A Collaborative Distributed Strategy for Multi-Agent Reinforcement Learning Through Consensus + Innovations
Kar, Soummya, Moura, Jose' M. F., Poor, H. Vincent
The paper considers a class of multi-agent Markov decision processes (MDPs), in which the network agents respond differently (as manifested by the instantaneous one-stage random costs) to a global controlled state and the control actions of a remote controller. The paper investigates a distributed reinforcement learning setup with no prior information on the global state transition and local agent cost statistics. Specifically, with the agents' objective consisting of minimizing a network-averaged infinite horizon discounted cost, the paper proposes a distributed version of $Q$-learning, $\mathcal{QD}$-learning, in which the network agents collaborate by means of local processing and mutual information exchange over a sparse (possibly stochastic) communication network to achieve the network goal. Under the assumption that each agent is only aware of its local online cost data and the inter-agent communication network is \emph{weakly} connected, the proposed distributed scheme is almost surely (a.s.) shown to yield asymptotically the desired value function and the optimal stationary control policy at each network agent. The analytical techniques developed in the paper to address the mixed time-scale stochastic dynamics of the \emph{consensus + innovations} form, which arise as a result of the proposed interactive distributed scheme, are of independent interest.
Coalition Structure Generation over Graphs
Voice, T., Polukarov, M., Jennings, N. R.
We give the analysis of the computational complexity of coalition structure generation over graphs. Given an undirected graph G = (N,E) and a valuation function v : P(N) → R over the subsets of nodes, the problem is to find a partition of N into connected subsets, that maximises the sum of the components values. This problem is generally NPcomplete; in particular, it is hard for a defined class of valuation functions which are independent of disconnected membersthat is, two nodes have no effect on each others marginal con- tribution to their vertex separator. Nonetheless, for all such functions we provide bounds on the complexity of coalition structure generation over general and minorfree graphs. Our proof is constructive and yields algorithms for solving corresponding instances of the problem. Furthermore, we derive linear time bounds for graphs of bounded treewidth. However, as we show, the problem remains NPcomplete for planar graphs, and hence, for any K_k minorfree graphs where k ≥ 5. Moreover, a 3-SAT problem with m clauses can be represented by a coalition structure generation problem over a planar graph with O(m^2) nodes. Importantly, our hardness result holds for a particular subclass of valuation functions, termed edge sum, where the value of each subset of nodes is simply determined by the sum of given weights of the edges in the induced subgraph.
Decentralized Sensor Fusion With Distributed Particle Filters
Rosencrantz, Matthew, Gordon, Geoffrey, Thrun, Sebastian
This paper presents a scalable Bayesian technique for decentralized state estimation from multiple platforms in dynamic environments. As has long been recognized, centralized architectures impose severe scaling limitations for distributed systems due to the enormous communication overheads. We propose a strictly decentralized approach in which only nearby platforms exchange information. They do so through an interactive communication protocol aimed at maximizing information flow. Our approach is evaluated in the context of a distributed surveillance scenario that arises in a robotic system for playing the game of laser tag. Our results, both from simulation and using physical robots, illustrate an unprecedented scaling capability to large teams of vehicles.
Agent-Based Modeling and Simulation
Klügl, Franziska (Orebro University) | Bazzan, Ana L. C. (Universidade Federal do Rio Grande do Sul)
This article gives an introduction to agent-based modeling and simulation (ABMS). After a general discussion about modeling and simulation, we address the basic concept of ABMS, focusing on its generative and bottom-up nature, its advantages as well as its pitfalls. The subsequent part of the article deals with application-oriented aspects, including selected tools and well-known applications. In order to illustrate the benefits of using ABMS, we focus on several aspects of a well-known area related to simulation of complex systems, namely traffic.