AITopics | Agents

Collaborating Authors

Agents

News Overviews Instructional Materials AI-Alerts Classics

Combinatorial Bandits for Incentivizing Agents with Dynamic Preferences

Fiez, Tanner, Sekar, Shreyas, Zheng, Liyuan, Ratliff, Lillian J.

arXiv.org Machine LearningJul-6-2018

The design of personalized incentives or recommendations to improve user engagement is gaining prominence as digital platform providers continually emerge. We propose a multi-armed bandit framework for matching incentives to users, whose preferences are unknown a priori and evolving dynamically in time, in a resource constrained environment. We design an algorithm that combines ideas from three distinct domains: (i) a greedy matching paradigm, (ii) the upper confidence bound algorithm (UCB) for bandits, and (iii) mixing times from the theory of Markov chains. For this algorithm, we provide theoretical bounds on the regret and demonstrate its performance via both synthetic and realistic (matching supply and demand in a bike-sharing platform) examples.

artificial intelligence, data mining, machine learning, (20 more...)

arXiv.org Machine Learning

1807.02297

Country:

Europe > Italy > Tuscany > Florence (0.04)
Asia > Japan > Honshū > Tōhoku (0.04)

Genre: Research Report (0.63)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.49)

Add feedback

Multi-robot Path Planning in Well-formed Infrastructures: Prioritized Planning vs. Prioritized Wait Adjustment (Preliminary Results)

Andreychuk, Anton, Yakovlev, Konstantin

arXiv.org Artificial IntelligenceJul-5-2018

We study the problem of planning collision-free paths for a group of homogeneous robots. We propose a novel approach for turning the paths that were planned egocentrically by the robots, e.g. without taking other robots' moves into account, into collision-free trajectories and evaluate it empirically. Suggested algorithm is much faster (up to one order of magnitude) than state-of-the-art but this comes at the price of notable drop-down of the solution cost.

artificial intelligence, planning & scheduling, robot, (13 more...)

arXiv.org Artificial Intelligence

1807.01909

Country:

Europe > Russia (0.04)
Asia > Russia (0.04)

Genre: Research Report > Promising Solution (0.49)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.70)

Add feedback

DeepMind AI's new trick is playing 'Quake III Arena' like a human

#artificialintelligenceJul-4-2018, 17:41:01 GMT

The team focused on a capture the flag mode, one in which the map changes from match to match. Its AI agents had to learn general strategies to be able to adapt to each new map, something humans do easily. The agents also needed to both cooperate with team members as well as compete against the opposite team, and be able to adjust to different enemy play styles. "Our agents must learn from scratch how to see, act, cooperate, and compete in unseen environments, all from a single reinforcement signal per match: whether their team won or not," wrote the researchers in a blog post. They trained a population of AI-powered agents that learn by playing the game, much like we do.

large language model, machine learning, natural language, (7 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.45)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Add feedback

The Power of Verification for Greedy Mechanism Design

Fotakis, Dimitris, Krysta, Piotr, Ventre, Carmine

Journal of Artificial Intelligence ResearchJul-4-2018

Greedy algorithms are known to provide, in polynomial time, near optimal approximation guarantees for Combinatorial Auctions (CAs) with multidimensional bidders. It is known that truthful greedy-like mechanisms for CAs with multi-minded bidders do not achieve good approximation guarantees. In this work, we seek a deeper understanding of greedy mechanism design and investigate under which general assumptions, we can have efficient and truthful greedy mechanisms for CAs. Towards this goal, we use the framework of priority algorithms and weak and strong verification, where the bidders are not allowed to overbid on their winning set or on any subset of this set, respectively. We provide a complete characterization of the power of weak verification showing that it is sufficient and necessary for any greedy fixed priority algorithm to become truthful with the use of money or not, depending on the ordering of the bids. Moreover, we show that strong verification is sufficient and necessary to obtain a 2-approximate truthful mechanism with money, based on a known greedy algorithm, for the problem of submodular CAs in finite bidding domains. Our proof is based on an interesting structural analysis of the strongly connected components of the declaration graph.

algorithm, artificial intelligence, verification, (15 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1.11215

AI Access Foundation

11215

Journal of Artificial Intelligence Research

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > United Kingdom > England > Merseyside > Liverpool (0.04)
Europe > United Kingdom > England > Essex (0.04)
Europe > Greece > Attica > Athens (0.04)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Model-Based Reasoning (0.62)

Add feedback

DeepMind AI's new trick is playing 'Quake III Arena' like a human

EngadgetJul-3-2018, 22:03:15 GMT

Research in AI continues to make video games better. The technology informs NPCs that can move and fight more convincingly, orcs with personalities and ever-more realistic visuals. Now researchers at DeepMind have taught an AI to play a customized version of Quake III Arena like a human. The team focused on a capture the flag mode, one in which the map changes from match to match. Its AI agents had to learn general strategies to be able to adapt to each new map, something humans do easily.

large language model, machine learning, natural language, (7 more...)

Engadget

Industry: Leisure & Entertainment > Games > Computer Games (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.63)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.63)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.43)

Add feedback

Human-level performance in first-person multiplayer games with population-based deep reinforcement learning

Jaderberg, Max, Czarnecki, Wojciech M., Dunning, Iain, Marris, Luke, Lever, Guy, Castaneda, Antonio Garcia, Beattie, Charles, Rabinowitz, Neil C., Morcos, Ari S., Ruderman, Avraham, Sonnerat, Nicolas, Green, Tim, Deason, Louise, Leibo, Joel Z., Silver, David, Hassabis, Demis, Kavukcuoglu, Koray, Graepel, Thore

arXiv.org Machine LearningJul-3-2018

Recent progress in artificial intelligence through reinforcement learning (RL) has shown great success on increasingly complex single-agent environments and two-player turn-based games. However, the real-world contains multiple agents, each learning and acting independently to cooperate and compete with other agents, and environments reflecting this degree of complexity remain an open challenge. In this work, we demonstrate for the first time that an agent can achieve human-level in a popular 3D multiplayer first-person video game, Quake III Arena Capture the Flag, using only pixels and game points as input. These results were achieved by a novel two-tier optimisation process in which a population of independent RL agents are trained concurrently from thousands of parallel matches with agents playing in teams together and against each other on randomly generated environments. Each agent in the population learns its own internal reward signal to complement the sparse delayed reward from winning, and selects actions using a novel temporally hierarchical representation that enables the agent to reason at multiple timescales. During game-play, these agents display human-like behaviours such as navigating, following, and defending based on a rich learned representation that is shown to encode high-level game knowledge. In an extensive tournament-style evaluation the trained agents exceeded the win-rate of strong human players both as teammates and opponents, and proved far stronger than existing state-of-the-art agents. These results demonstrate a significant jump in the capabilities of artificial agents, bringing us closer to the goal of human-level intelligence.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

arXiv.org Machine Learning

1807.01281

Country: Europe > United Kingdom > England > Greater London > London (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Can Artificial Intelligence End Your Video Buffering Problems? - Muvi

#artificialintelligenceJul-2-2018, 16:11:01 GMT

Currently, we stand on the brink of a fourth Industrial revolution. Artificial Intelligence or AI is the intelligence demonstrated by machines for performing tasks. It is a specialized section of computer science which focuses on creating intelligent machines that think, react and work like humans. Some of the activities computers with artificial intelligence are built for includes problem solving, learning, analysing, speech recognition, and much more. AI is a vast field in itself, and it encompasses a wide spectrum of technologies such as Machine Learning, Automated Intelligence System, Deep learning, Neural Network, Computational Argumentation, and Multi-agent Systems to solve problems that currently seem impossible.

artificial intelligence end, deep learning, machine learning, (2 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.33)

Add feedback

Path Finding for the Coalition of Co-operative Agents Acting in the Environment with Destructible Obstacles

Andreychuk, Anton, Yakovlev, Konstantin

arXiv.org Artificial IntelligenceJul-2-2018

The problem of planning a set of paths for the coalition of robots (agents) with different capabilities is considered in the paper. Some agents can modify the environment by destructing the obstacles thus allowing the other ones to shorten their paths to the goal. As a result the mutual solution of lower cost, e.g. time to completion, may be acquired. We suggest an original procedure to identify the obstacles for further removal that can be embedded into almost any heuristic search planner (we use Theta*) and evaluate it empirically. Results of the evaluation show that time-to-complete the mission can be decreased up to 9-12 % by utilizing the proposed technique.

artificial intelligence, obstacle, planning & scheduling, (17 more...)

arXiv.org Artificial Intelligence

1807.00771

Country:

Asia > Russia (0.05)
Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.05)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.97)

Add feedback

Reports of the AAAI 2017 Fall Symposium Series

Flenner, Arjuna (NAVAIR China Lake) | Fraune, Marlena R. (Indiana University) | Hiatt, Laura M. (Naval Research Laboratory (NRL)) | Kendall, Tony (Naval Postgraduate School) | Laird, John E. (University of Michigan) | Lebiere, Christian (Carnegie Mellon University) | Rosenbloom, Paul S. (Institute for Creative Technologies, University of Southern California) | Stein, Frank (IBM) | Topp, Elin A. (Lund University) | Unhelkar, Vaibhav V. (Massachusetts Institute of Technology) | Zhao, Ying (Naval Postgraduate School)

AI MagazineJul-1-2018

The AAAI 2017 Fall Symposium Series was held Thursday through Saturday, November 9–11, at the Westin Arlington Gateway in Arlington, Virginia, adjacent to Washington, DC. The titles of the six symposia were Artificial Intelligence for Human-Robot Interaction; Cognitive Assistance in Government and Public Sector Applications; Deep Models and Artificial Intelligence for Military Applications: Potentials, Theories, Practices, Tools and Risks; Human-Agent Groups: Studies, Algorithms and Challenges; Natural Communication for Human-Robot Collaboration; and A Standard Model of the Mind. The highlights of each symposium (except the Natural Communication for Human-Robot Collaboration symposium, whose organizers did not submit a report) are presented in this report.

artificial intelligence, machine learning, symposium, (15 more...)

AI Magazine

Country:

North America > United States > District of Columbia > Washington (0.24)
North America > United States > Virginia > Arlington County > Arlington (0.24)
North America > United States > Indiana (0.05)
(12 more...)

Industry:

Law (1.00)
Information Technology (1.00)
Health & Medicine (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.35)

Add feedback

Goal Reasoning: Foundations, Emerging Applications, and Prospects

Aha, David W. (NRL)

AI MagazineJul-1-2018

Goal reasoning (GR) has a bright future as a foundation for the research and development of intelligent agents. GR is the study of agents that can deliberate on and self-select their goals/objectives, which is a desirable capability for some applications of deliberative autonomy. While studied in diverse AI sub-communities for multiple applications, our group has focused on how GR can play a key role for controlling autonomous systems. Thus, its importance is rapidly growing and it merits increased attention, particularly from the perspective of research on AI safety. In this article, I introduce GR, briefly relate it to other AI topics, summarize some of our group’s work on GR foundations and emerging applications, and describe some current and future research directions.

artificial intelligence, machine learning, planning & scheduling, (16 more...)

AI Magazine

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.06)
North America > United States > District of Columbia > Washington (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
(14 more...)

Genre: Research Report (0.67)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Government > Military (1.00)
Leisure & Entertainment (0.67)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
(2 more...)

Add feedback