Government
Apoptotic Stigmergic Agents for Real-Time Swarming Simulation
Parunak, H. Van Dyke (Jacobs Technology Group) | Brooks, S. Hugh (enkidu7) | Brueckner, Sven A. (Jacobs Technology Group) | Gupta, Ravi (enkidu7)
One common use for swarming agents is in social simulation. This paper reports on such a model developed to track protest activities at the May 2012 NATO summit in Chicago. The use of apoptotic stigmergic agents allows the model to run on-line, consuming two kinds of external data and reporting its results in real time.
A Tactical Command Approach to Human Control of Vehicle Swarms
Beal, Jacob (BBN Technologies)
Human control of vehicle swarms faces a dilemma: an operator must be able to exercise precise control over how a mission is executed, but controlling individual vehicles is not scalable. The Proto spatial computing lan- guage offers an intermediate representation, where the motion of a swarm is specified as a vector field, which is then approximated by the movement of individual members (Bachrach, Beal, and McLurkin 2010). I propose that this can be exploited to build a “tactical command” model of swarm control, whereby human “officers” dynamically decompose a swarm into units and task those units to carry out geometric and topological maneuvers under the constraints imposed by the platform. This abstraction may also allow situation awareness interfaces for individual agents to be extended to apply to swarm units.
Hansel and Gretel for All Ages: A Template for Recurring Humor Dialog
Kadri, Faisal L. (Independent Researcher)
The fable of Hansel and Gretel describes the plight of two children over two types of threat; harm to their immediate survival and pain from hunger. The two contexts of self-preservation and feeding are evident from the flow of the story dialog, therefore an automatic re-playing of dialog can be realized by picking sentences from two lists; one containing sentences in the context of self-preservation, the other in the context of feeding. Theory and Internet humor appreciation surveys suggest that humorous sentences in the context of self-preservation have relatively constant preference with respect to age, while in the context of hunger and protection of feeding turf to decline with age, reflecting the reduced need for food with aging. Sentences in the context of sociosexual relationships increased in preference until adulthood then declined with maturity. Also, sentences in parenting context, such as when caring for offspring, society and the environment were found to increase in preference with age and maturity. Therefore in order to construct a recursive Hansel and Gretel dialog for audience of all ages, two lists of sentences are added to feeding: In sociosexual and parenting contexts. The self-preservation list is paired with one of the remaining three, representing three stages of age; youth, adulthood and maturity. The single thread story of Hansel and Gretel serves as a template for recursive dialog, making it possible to create alternative threads and unbound possibilities for plots, thereby duplicating the story structure without repeating the narrative.
Transforming Graph Data for Statistical Relational Learning
Rossi, R. A., McDowell, L. K., Aha, D. W., Neville, J.
Relational data representations have become an increasingly important topic due to the recent proliferation of network datasets (e.g., social, biological, information networks) and a corresponding increase in the application of Statistical Relational Learning (SRL) algorithms to these domains. In this article, we examine and categorize techniques for transforming graph-based relational data to improve SRL algorithms. In particular, appropriate transformations of the nodes, links, and/or features of the data can dramatically affect the capabilities and results of SRL algorithms. We introduce an intuitive taxonomy for data representation transformations in relational domains that incorporates link transformation and node transformation as symmetric representation tasks. More specifically, the transformation tasks for both nodes and links include (i) predicting their existence, (ii) predicting their label or type, (iii) estimating their weight or importance, and (iv) systematically constructing their relevant features. We motivate our taxonomy through detailed examples and use it to survey competing approaches for each of these tasks. We also discuss general conditions for transforming links, nodes, and features. Finally, we highlight challenges that remain to be addressed.
A Tutorial on Dual Decomposition and Lagrangian Relaxation for Inference in Natural Language Processing
Dual decomposition, and more generally Lagrangian relaxation, is a classical method for combinatorial optimization; it has recently been applied to several inference problems in natural language processing (NLP). This tutorial gives an overview of the technique. We describe example algorithms, describe formal guarantees for the method, and describe practical issues in implementing the algorithms. While our examples are predominantly drawn from the NLP literature, the material should be of general relevance to inference problems in machine learning. A central theme of this tutorial is that Lagrangian relaxation is naturally applied in conjunction with a broad class of combinatorial algorithms, allowing inference in models that go significantly beyond previous work on Lagrangian relaxation for inference in graphical models.
Exploiting Locality in Searching the Web
Published experiments on spidering the Web suggest that, given training data in the form of a (relatively small) subgraph of the Web containing a subset of a selected class of target pages, it is possible to conduct a directed search and find additional target pages significantly faster (with fewer page retrievals) than by performing a blind or uninformed random or systematic search, e.g., breadth-first search. If true, this claim motivates a number of practical applications. Unfortunately, these experiments were carried out in specialized domains or under conditions that are difficult to replicate. We present and apply an experimental framework designed to reexamine and resolve the basic claims of the earlier work, so that the supporting experiments can be replicated and built upon. We provide high-performance tools for building experimental spiders, make use of the ground truth and static nature of the WT10g TREC Web corpus, and rely on simple well understand machine learning techniques to conduct our experiments. In this paper, we describe the basic framework, motivate the experimental design, and report on our findings supporting and qualifying the conclusions of the earlier research.
Optimal Limited Contingency Planning
Meuleau, Nicolas, Smith, David
For a given problem, the optimal Markov policy can be considerred as a conditional or contingent plan containing a (potentially large) number of branches. Unfortunately, there are applications where it is desirable to strictly limit the number of decision points and branches in a plan. For example, it may be that plans must later undergo more detailed simulation to verify correctness and safety, or that they must be simple enough to be understood and analyzed by humans. As a result, it may be necessary to limit consideration to plans with only a small number of branches. This raises the question of how one goes about finding optimal plans containing only a limited number of branches. In this paper, we present an any-time algorithm for optimal k-contingency planning (OKP). It is the first optimal algorithm for limited contingency planning that is not an explicit enumeration of possible contingent plans. By modelling the problem as a Partially Observable Markov Decision Process, it implements the Bellman optimality principle and prunes the solution space. We present experimental results of applying this algorithm to some simple test cases.
Active Learning with Distributional Estimates
Roeder, Jens, Nadler, Boaz, Kunzmann, Kevin, Hamprecht, Fred A.
Active Learning (AL) is increasingly important in a broad range of applications. Two main AL principles to obtain accurate classification with few labeled data are refinement of the current decision boundary and exploration of poorly sampled regions. In this paper we derive a novel AL scheme that balances these two principles in a natural way. In contrast to many AL strategies, which are based on an estimated class conditional probability ^p(y|x), a key component of our approach is to view this quantity as a random variable, hence explicitly considering the uncertainty in its estimated value. Our main contribution is a novel mathematical framework for uncertainty-based AL, and a corresponding AL scheme, where the uncertainty in ^p(y|x) is modeled by a second-order distribution. On the practical side, we show how to approximate such second-order distributions for kernel density classification. Finally, we find that over a large number of UCI, USPS and Caltech4 datasets, our AL scheme achieves significantly better learning curves than popular AL methods such as uncertainty sampling and error reduction sampling, when all use the same kernel density classifier.
Exploiting compositionality to explore a large space of model structures
Grosse, Roger, Salakhutdinov, Ruslan R, Freeman, William T., Tenenbaum, Joshua B.
The recent proliferation of richly structured probabilistic models raises the question of how to automatically determine an appropriate model for a dataset. We investigate this question for a space of matrix decomposition models which can express a variety of widely used models from unsupervised learning. To enable model selection, we organize these models into a context-free grammar which generates a wide variety of structures through the compositional application of a few simple rules. We use our grammar to generically and efficiently infer latent components and estimate predictive likelihood for nearly 2500 structures using a small toolbox of reusable algorithms. Using a greedy search over our grammar, we automatically choose the decomposition structure from raw data by evaluating only a small fraction of all models. The proposed method typically finds the correct structure for synthetic data and backs off gracefully to simpler models under heavy noise. It learns sensible structures for datasets as diverse as image patches, motion capture, 20 Questions, and U.S. Senate votes, all using exactly the same code.