AITopics | Agents

Collaborating Authors

Agents

News Overviews Instructional Materials AI-Alerts Classics

Automated Intersection Management with MiniZinc

Rahman, Md. Mushfiqur, Zahin, Nahian Muhtasim, Mahmud, Kazi Raiyan, Ansar, Md. Azmaeen Bin

arXiv.org Artificial IntelligenceNov-15-2020

Ill-managed intersections are the primary reasons behind the increasing traffic problem in urban areas, leading to nonoptimal traffic-flow and unnecessary deadlocks. In this paper, we propose an automated intersection management system that extracts data from a well-defined grid of sensors and optimizes traffic flow by controlling traffic signals. The data extraction mechanism is independent of the optimization algorithm and this paper primarily emphasizes the later one. We have used MiniZinc modeling language to define our system as a constraint satisfaction problem which can be solved using any off-the-shelf solver. The proposed system performs much better than the systems currently in use. Our system reduces the mean waiting time and standard deviation of the waiting time of vehicles and avoids deadlocks.

average delay, intersection, vehicle, (14 more...)

arXiv.org Artificial Intelligence

2011.07509

Country:

Asia > Bangladesh > Dhaka Division > Dhaka District > Dhaka (0.05)
Africa > Ghana (0.04)

Genre: Research Report (0.50)

Industry:

Transportation > Infrastructure & Services (1.00)
Transportation > Ground > Road (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.88)

Add feedback

Distributed Bandits: Probabilistic Communication on $d$-regular Graphs

Madhushani, Udari, Leonard, Naomi Ehrich

arXiv.org Machine LearningNov-15-2020

We study the decentralized multi-agent multi-armed bandit problem for agents that communicate with probability over a network defined by a $d$-regular graph. Every edge in the graph has probabilistic weight $p$ to account for the ($1\!-\!p$) probability of a communication link failure. At each time step, each agent chooses an arm and receives a numerical reward associated with the chosen arm. After each choice, each agent observes the last obtained reward of each of its neighbors with probability $p$. We propose a new Upper Confidence Bound (UCB) based algorithm and analyze how agent-based strategies contribute to minimizing group regret in this probabilistic communication setting. We provide theoretical guarantees that our algorithm outperforms state-of-the-art algorithms. We illustrate our results and validate the theoretical claims using numerical simulations.

agent, algorithm, group cumulative regret, (13 more...)

arXiv.org Machine Learning

2011.0772

Country:

North America > United States (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Africa > South Sudan > Equatoria > Central Equatoria > Juba (0.04)

Genre: Research Report > New Finding (0.48)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.50)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback

A Distributed Privacy-Preserving Learning Dynamics in General Social Networks

Tao, Youming, Chen, Shuzhen, Li, Feng, Yu, Dongxiao, Yu, Jiguo, Sheng, Hao

arXiv.org Artificial IntelligenceNov-14-2020

In this paper, we study a distributed privacy-preserving learning problem in general social networks. Specifically, we consider a very general problem setting where the agents in a given multi-hop social network are required to make sequential decisions to choose among a set of options featured by unknown stochastic quality signals. Each agent is allowed to interact with its peers through multi-hop communications but with its privacy preserved. To serve the above goals, we propose a four-staged distributed social learning algorithm. In a nutshell, our algorithm proceeds iteratively, and in every round, each agent i) randomly perturbs its adoption for privacy-preserving purpose, ii) disseminates the perturbed adoption over the social network in a nearly uniform manner through random walking, iii) selects an option by referring to its peers' perturbed latest adoptions, and iv) decides whether or not to adopt the selected option according to its latest quality signal. By our solid theoretical analysis, we provide answers to two fundamental algorithmic questions about the performance of our four-staged algorithm: on one hand, we illustrate the convergence of our algorithm when there are a sufficient number of agents in the social network, each of which are with incomplete and perturbed knowledge as input; on the other hand, we reveal the quantitative trade-off between the privacy loss and the communication overhead towards the convergence. We also perform extensive simulations to validate our theoretical analysis and to verify the efficacy of our algorithm.

agent, algorithm, probability, (15 more...)

arXiv.org Artificial Intelligence

2011.09845

Country:

Asia > Singapore (0.04)
Asia > China > Beijing > Beijing (0.04)
North America > United States (0.04)
(2 more...)

Genre: Research Report (0.64)

Industry:

Information Technology > Services (1.00)
Education (1.00)
Information Technology > Security & Privacy (0.93)

Technology:

Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.34)

Add feedback

Opponent Learning Awareness and Modelling in Multi-Objective Normal Form Games

Rădulescu, Roxana, Verstraeten, Timothy, Zhang, Yijie, Mannion, Patrick, Roijers, Diederik M., Nowé, Ann

arXiv.org Artificial IntelligenceNov-14-2020

Many real-world multi-agent interactions consider multiple distinct criteria, i.e. the payoffs are multi-objective in nature. However, the same multi-objective payoff vector may lead to different utilities for each participant. Therefore, it is essential for an agent to learn about the behaviour of other agents in the system. In this work, we present the first study of the effects of such opponent modelling on multi-objective multi-agent interactions with non-linear utilities. Specifically, we consider two-player multi-objective normal form games with non-linear utility functions under the scalarised expected returns optimisation criterion. We contribute novel actor-critic and policy gradient formulations to allow reinforcement learning of mixed strategies in this setting, along with extensions that incorporate opponent policy reconstruction and learning with opponent learning awareness (i.e., learning while considering the impact of one's policy when anticipating the opponent's learning step). Empirical results in five different MONFGs demonstrate that opponent learning awareness and modelling can drastically alter the learning dynamics in this setting. When equilibria are present, opponent modelling can confer significant benefits on agents that implement it. When there are no Nash equilibria, opponent learning awareness and modelling allows agents to still converge to meaningful solutions that approximate equilibria.

agent, opponent, utility function, (16 more...)

arXiv.org Artificial Intelligence

2011.0729

Country:

Europe > Belgium (0.04)
Europe > Ireland (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.46)

Industry:

Education (0.68)
Leisure & Entertainment > Games (0.68)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

OpenAI proposes using reciprocity to encourage AI agents to work together

#artificialintelligenceNov-13-2020, 17:50:35 GMT

Many real-world problems require complex coordination between multiple agents -- e.g., people or algorithms. A machine learning technique called multi-agent reinforcement learning (MARL) has shown success with respect to this, mainly in two-team games like Go, DOTA 2, Starcraft, hide-and-seek, and capture the flag. But the human world is far messier than games. That's because humans face social dilemmas at multiple scales, from the interpersonal to the international, and they must decide not only how to cooperate but when to cooperate. To address this challenge, researchers at OpenAI propose training AI agents with what they call randomized uncertain social preferences (RUSP), an augmentation that expands the distribution of environments in which reinforcement learning agents train.

agent, encourage ai agent, reciprocity, (9 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.62)

Add feedback

DeepMind Lab2D

Beattie, Charles, Köppe, Thomas, Duéñez-Guzmán, Edgar A., Leibo, Joel Z.

arXiv.org Artificial IntelligenceNov-13-2020

We present DeepMind Lab2D, a scalable environment simulator for artificial intelligence research that facilitates researcher-led experimentation with environment design. DeepMind Lab2D was built with the specific needs of multi-agent deep reinforcement learning researchers in mind, but it may also be useful beyond that particular subfield.

agent, deepmind lab2d, learning, (14 more...)

arXiv.org Artificial Intelligence

2011.07027

Country: Europe > United Kingdom (0.04)

Genre: Research Report (0.40)

Industry:

Leisure & Entertainment > Games > Computer Games (0.69)
Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Add feedback

Blockchain helps determine 'green' parking price in Munich

#artificialintelligenceNov-12-2020, 14:10:11 GMT

Token reward airdrops hope to "nudge" car users into more sustainable behaviors. Artificial intelligence specialists Fetch.ai, and blockchain solutions provider Datarella have announced the launch of a "Smart City" infrastructure trial in Munich, Germany, on Nov. 12. The trial will be centered around the Connex Buildings business center in the city and will use a multi-agent blockchain-based AI platform to optimize parking space management at the building. This is designed to encourage reduced car use, and hence CO2 emissions. Autonomous economic agents will negotiate the "price" of parking spaces between the operators and users.

munich, parking price, parking space, (6 more...)

#artificialintelligence

Country: Europe > Germany > Bavaria > Upper Bavaria > Munich (0.62)

Industry: Banking & Finance (0.34)

Technology:

Information Technology > e-Commerce > Financial Technology (0.96)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.59)

Add feedback

Learning Latent Representations to Influence Multi-Agent Interaction

Xie, Annie, Losey, Dylan P., Tolsma, Ryan, Finn, Chelsea, Sadigh, Dorsa

arXiv.org Artificial IntelligenceNov-12-2020

Seamlessly interacting with humans or robots is hard because these agents are non-stationary. They update their policy in response to the ego agent's behavior, and the ego agent must anticipate these changes to co-adapt. Inspired by humans, we recognize that robots do not need to explicitly model every low-level action another agent will make; instead, we can capture the latent strategy of other agents through high-level representations. We propose a reinforcement learning-based framework for learning latent representations of an agent's policy, where the ego agent identifies the relationship between its behavior and the other agent's future strategy. The ego agent then leverages these latent dynamics to influence the other agent, purposely guiding them towards policies suitable for co-adaptation. Across several simulated domains and a real-world air hockey game, our approach outperforms the alternatives and learns to influence the other agent.

agent, ego agent, interaction, (14 more...)

arXiv.org Artificial Intelligence

2011.06619

Country:

North America > United States > Virginia (0.04)
North America > United States > Michigan > Washtenaw County > Ann Arbor (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre: Research Report (0.82)

Industry: Leisure & Entertainment > Sports > Hockey (0.70)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

Performance of Bounded-Rational Agents With the Ability to Self-Modify

Tětek, Jakub, Sklenka, Marek, Gavenčiak, Tomáš

arXiv.org Artificial IntelligenceNov-12-2020

Self-modification of agents embedded in complex environments is hard to avoid, whether it happens via direct means (e.g. own code modification) or indirectly (e.g. influencing the operator, exploiting bugs or the environment). While it has been argued that intelligent agents have an incentive to avoid modifying their utility function so that their future instances will work towards the same goals, it is not clear whether this also applies in non-dualistic scenarios, where the agent is embedded in the environment. The problem of self-modification safety is raised by Bostrom in Superintelligence (2014) in the context of safe AGI deployment. In contrast to Everitt et al. (2016), who formally show that providing an option to self-modify is harmless for perfectly rational agents, we show that for agents with bounded rationality, self-modification may cause exponential deterioration in performance and gradual misalignment of a previously aligned agent. We investigate how the size of this effect depends on the type and magnitude of imperfections in the agent's rationality (1-4 below). We also discuss model assumptions and the wider problem and framing space. Specifically, we introduce several types of a bounded-rational agent, which either (1) doesn't always choose the optimal action, (2) is not perfectly aligned with human values, (3) has an innacurate model of the environment, or (4) uses the wrong temporal discounting factor. We show that while in the cases (2)-(4) the misalignment caused by the agent's imperfection does not worsen over time, with (1) the misalignment may grow exponentially.

agent, relative error, utility function, (14 more...)

arXiv.org Artificial Intelligence

2011.06275

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
North America > United States > New York > New York County > New York City (0.04)
Europe > Denmark > Capital Region > Copenhagen (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.93)

Add feedback

Voca: Leader in Voice AI Virtual Agents for Contact Centers

#artificialintelligenceNov-11-2020, 19:10:27 GMT

Voca's yes attitude, ease in responsiveness, willingness to go above and beyond puts Voca at the top of their game, providing us with the innovative technology to generate more sales, leads, and remain cost-effective. Working with Voca places NRS as a leader in our field and makes us excited for the future.

contact center, voca, voice ai virtual agent

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.40)

Add feedback