AITopics

Parkes, David C., Yanovsky, Dimah, Singh, Satinder P.

Approximately Efficient Online Mechanism Design

Neural Information Processing SystemsDec-31-2005

Online mechanism design (OMD) addresses the problem of sequential decision making in a stochastic environment with multiple self-interested agents. The goal in OMD is to make value-maximizing decisions despite this self-interest. In previous work we presented a Markov decision process (MDP)-basedapproach to OMD in large-scale problem domains. In practice the underlying MDP needed to solve OMD is too large and hence the mechanism must consider approximations. This raises the possibility thatagents may be able to exploit the approximation for selfish gain. We adopt sparse-sampling-based MDP algorithms to implement ɛ- efficient policies, and retain truth-revelation as an approximate Bayesian-Nash equilibrium. Our approach is empirically illustrated in the context of the dynamic allocation of WiFi connectivity to users in a coffeehouse.

agent, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > United States (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.72)
Information Technology > Artificial Intelligence > Representation & Reasoning > Model-Based Reasoning (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.49)

Neural Information Processing SystemsDec-31-2005

Convergence and No-Regret in Multiagent Learning

Bowling, Michael

Learning in a multiagent system is a challenging problem due to two key factors. First, if other agents are simultaneously learning then the environment isno longer stationary, thus undermining convergence guarantees. Second, learning is often susceptible to deception, where the other agents may be able to exploit a learner's particular dynamics. In the worst case, this could result in poorer performance than if the agent was not learning at all. These challenges are identifiable in the two most common evaluationcriteria for multiagent learning algorithms: convergence and regret. Algorithms focusing on convergence or regret in isolation are numerous. In this paper, we seek to address both criteria in a single algorithm by introducing GIGA-WoLF, a learning algorithm for normalform games.We prove the algorithm guarantees at most zero average regret, while demonstrating the algorithm converges in many situations of self-play. We prove convergence in a limited setting and give empirical resultsin a wider variety of situations. These results also suggest a third new learning criterion combining convergence and regret, which we call negative non-convergence regret (NNR).

algorithm, artificial intelligence, machine learning, (15 more...)

Neural Information Processing Systems

Country: North America > Canada > Alberta (0.29)

Industry: Leisure & Entertainment > Games (0.95)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Beetz, M., Grosskreutz, H.

Probabilistic Hybrid Action Models for Predicting Concurrent Percept-driven Robot Behavior

Journal of Artificial Intelligence ResearchDec-15-2005

Most autonomous robots are equipped with restricted, unreliable, and inaccurate sensors and effectors and operate in complex and dynamic environments. A successful approach to deal with the resulting uncertainty is the use of controllers that prescribe the robots' behavior in terms of concurrent reactive plans (CRPs) -- plans that specify how the robots are to react to sensory input in order to accomplish their jobs reliably (e.g., McDermott, 1992a; Beetz, 1999). Reactive plans are successfully used to produce situation specific behavior, to detect problems and recover from them automatically, and to recognize and exploit opportunities (Beetz et al., 2001). These kinds of behaviors are particularly important for autonomous robots that have only uncertain information about the world, act in dynamically changing environments, and are to accomplish complex tasks efficiently. Besides reliability and flexibility, foresight is another important capability of competent autonomous robots (McDermott, 1992a).

execution scenario, navigation plan, robot, (13 more...)

doi: 10.1613/jair.1565

AI Access Foundation

10432

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Washington > King County > Seattle (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
(14 more...)

Genre: Research Report (0.67)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
(4 more...)

The Coevolution of AI and AAAI

Mackworth, Alan K.

AI MagazineDec-15-2005

AI and AAAI are coevolving. As AI matures, its focus is shifting from inward-looking to outwardlooking. Some of the new concerns of the field are social awareness, networking, cross-disciplinarity, globalization, and open access. AAAI must reflect and support those concerns. AI is now a mature discipline.

aaai, ai and aaai, artificial intelligence, (10 more...)

Country: North America > Canada (0.31)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.31)

Ramos, Vitorino, Fernandes, Carlos, Rosa, Agostinho C.

On Self-Regulated Swarms, Societal Memory, Speed and Dynamics

arXiv.org Artificial IntelligenceDec-1-2005

Wasps, bees, ants and termites all make effective use of their environment and resources by displaying collective "swarm" intelligence. Termite colonies - for instance - build nests with a complexity far beyond the comprehension of the individual termite, while ant colonies dynamically allocate labor to various vital tasks such as foraging or defense without any central decision-making ability. Recent research suggests that microbial life can be even richer: highly social, intricately networked, and teeming with interactions, as found in bacteria. What strikes from these observations is that both ant colonies and bacteria have similar natural mechanisms based on Stigmergy and Self-Organization in order to emerge coherent and sophisticated patterns of global foraging behavior. Keeping in mind the above characteristics we propose a Self-Regulated Swarm (SRS) algorithm which hybridizes the advantageous characteristics of Swarm Intelligence as the emergence of a societal environmental memory or cognitive map via collective pheromone laying in the landscape (properly balancing the exploration/exploitation nature of our dynamic search strategy), with a simple Evolutionary mechanism that trough a direct reproduction procedure linked to local environmental features is able to self-regulate the above exploratory swarm population, speeding it up globally.

ant, evolutionary algorithm, machine learning, (16 more...)

arXiv.org Artificial Intelligence

cs/0512002

Country:

Europe (1.00)
North America > United States > Massachusetts (0.28)
North America > United States > California (0.28)

Genre:

Research Report > New Finding (0.54)
Research Report > Experimental Study (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)

Dutta, P. S., Jennings, N. R., Moreau, L.

Cooperative Information Sharing to Improve Distributed Learning in Multi-Agent Systems

Journal of Artificial Intelligence ResearchOct-1-2005

Effective coordination of agents' actions in partially-observable domains is a major challenge of multi-agent systems research. To address this, many researchers have developed techniques that allow the agents to make decisions based on estimates of the states and actions of other agents that are typically learnt using some form of machine learning algorithm. Nevertheless, many of these approaches fail to provide an actual means by which the necessary information is made available so that the estimates can be learnt. To this end, we argue that cooperative communication of state information between agents is one such mechanism. However, in a dynamically changing environment, the accuracy and timeliness of this communicated information determine the fidelity of the learned estimates and the usefulness of the actions taken based on these. Given this, we propose a novel information-sharing protocol, post-task-completion sharing, for the distribution of state information. We then show, through a formal analysis, the improvement in the quality of estimates produced using our strategy over the widely used protocol of sharing information between nearest neighbours. Moreover, communication heuristics designed around our information-sharing principle are subjected to empirical evaluation along with other benchmark strategies (including Littman's Q-routing and Stone's TPOT-RL) in a simulated call-routing application. These studies, conducted across a range of environmental settings, show that, compared to the different benchmarks used, our strategy generates an improvement of up to 60% in the call connection rate; of more than 1000% in the ability to connect long-distance calls; and incurs as low as 0.25 of the message overhead.

cooperative information, deviation, load 0, (7 more...)

doi: 10.1613/jair.1735

AI Access Foundation

10425

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Making Better Recommendations with Online Profiling Agents

Oh, Danny, Tan, Chew Lim

AI MagazineSep-15-2005

artificial intelligence, banking & finance, online, (8 more...)

Industry: Banking & Finance > Real Estate (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Synthetic Adversaries for Urban Combat Training

Wray, Robert E., Laird, John E., Nuxoll, Andrew, Stokes, Devvan, Kerfoot, Alex

AI MagazineSep-15-2005

Six high-level requirements drive the implementation of intelligent synthetic adversaries for training: (1) competence, (2) taskability, (3) observational fidelity, (4) behavior variability, most difficult tasks soldiers perform. Frequent Competence: The adversaries must perform training is an essential element in reducing the tactics and missions humans perform in casualties. For this application, the adversaries' environments is costly and restricted to physical goal is to defend a small multistoried mockups of buildings and small towns. The agents must move Environments (VIRTE) program is developing immersive virtual trainers for military operations through the environment, identify tactically on urbanized terrain (MOUT). In this relevant features (such as escape routes), and trainer, four-person fire teams of U.S. Marines communicate and coordinate with other are situated in a virtual urban environment and agents. Virtual opponents new missions for different training scenarios, are required to populate the environment and and they must change their objectives challenge the trainees. Behavior is not scripted or This article describes the general requirements specific to a particular mission, terrain, or operational for virtual MOUT opponents and our development setting, providing flexibility for operational of synthetic adversaries to meet use.

agent, artificial intelligence, game technology, (19 more...)

Country: North America > United States (1.00)

Industry:

Leisure & Entertainment > Games > Computer Games (1.00)
Government > Military (1.00)
Education > Educational Technology > Educational Software > Computer Based Training (0.34)

Technology:

Information Technology > Game Technology (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Games > Computer Games (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Making Better Recommendations with Online Profiling Agents

Oh, Danny, Tan, Chew Lim

AI MagazineSep-15-2005

In recent years, we have witnessed the success of autonomous agents applying machine-learning techniques across a wide range of applications. However, agents applying the same machine-learning techniques in online applications have not been so successful. Even agent-based hybrid recommender systems that combine information filtering techniques with collaborative filtering techniques have been applied with considerable success only to simple consumer goods such as movies, books, clothing, and food. Yet complex, adaptive autonomous agent systems that can handle complex goods such as real estate, vacation plans, insurance, mutual funds, and mortgages have emerged. To a large extent, the reinforcement learning methods developed to aid agents in learning have been more successfully deployed in offline applications. The inherent limitations in these methods have rendered them somewhat ineffective in online applications. In this article, we postulate that a small amount of prior knowledge and human-provided input can dramatically speed up online learning. We demonstrate that our agent HumanE -- with its prior knowledge or "experiences" about the real estate domain -- can effectively assist users in identifying requirements, especially unstated ones, quickly and unobtrusively.

artificial intelligence, humane, machine learning, (18 more...)

Country: North America > United States (1.00)

Industry:

Banking & Finance > Real Estate (1.00)
Education > Educational Setting > Online (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)