AITopics | agent learn

Collaborating Authors

agent learn

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

#IJCAI2025 distinguished paper: Combining MORL with restraining bolts to learn normative behaviour

AIHubSep-4-2025, 08:38:07 GMT

Image provided by the authors – generated using Gemini. For many of us, artificial intelligence (AI) has become part of everyday life, and the rate at which we assign previously human roles to AI systems shows no signs of slowing down. AI systems are the crucial ingredients of many technologies -- e.g., self-driving cars, smart urban planning, digital assistants -- across a growing number of domains. At the core of many of these technologies are autonomous agents -- systems designed to act on behalf of humans and make decisions without direct supervision. In order to act effectively in the real world, these agents must be capable of carrying out a wide range of tasks despite possibly unpredictable environmental conditions, which often requires some form of machine learning (ML) for achieving adaptive behaviour.

agent, artificial intelligence, obligation, (15 more...)

AIHub

Country: Europe > Austria > Vienna (0.05)

Industry: Transportation > Ground > Road (0.36)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.69)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.57)

Add feedback

Learning to flock in open space by avoiding collisions and staying together

Brambati, Martino, Celani, Antonio, Gherardi, Marco, Ginelli, Francesco

arXiv.org Artificial IntelligenceJun-23-2025

The synchronized flight of bird flocks, exemplified by starling murmurations, is perhaps the most striking example of collective behavior in natural systems, which fascinated scholars for quite a long time [1]. Evolutionary biologists, for instance, have long debated the advantages of living in groups [2], which should offer increased protection from predation by diluting the individual risk and 1 possibly confusing the attackers by the sheer size of the assembly. Flocking behavior involves a high degree of order in the individual directions of motion [3], and has been reproduced by minimal models of self-propelling particles (SPPs), such as Craig Reynolds Boids [4] or the celebrated Vicsek model [5] that has long captivated the attention of statistical physicists and played a pivotal role in the birth of the active matter research field. The essential ingredient of these models is the tendency of individual particles to align their direction of motion with those of their local neighbours, which is enough to promote long range order in systems with finite density (even in two spatial dimensions, due to the non-equilibrium nature of self-propelled particles) such as in toy models with periodic boundary conditions. In open systems, constituted by a finite number of individuals in an open, infinite space, purely alignment interactions are however not enough to maintain group cohesion.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

arXiv.org Artificial Intelligence

2506.15587

Country: Europe > Italy (0.28)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.95)

Add feedback

Vision based driving agent for race car simulation environments

Bári, Gergely, Palkovics, László

arXiv.org Artificial IntelligenceApr-15-2025

In recent years, autonomous driving has become a popular field of study. As control at tire grip limit is essential during emergency situations, algorithms developed for racecars are useful for road cars too. This paper examines the use of Deep Reinforcement Learning (DRL) to solve the problem of "grip limit driving" in a simulated environment. Proximal Policy Optimization (PPO) method is used to train an agent to control the steering wheel and pedals of the vehicle, using only visual inputs to achieve professional human lap times. The paper outlines the formulation of the task of time optimal driving on a race track as a deep reinforcement learning problem, and explains the chosen observations, actions, and reward functions. The results demonstrate human-like learning and driving behavior that utilize maximum tire grip potential.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Artificial Intelligence

2504.10266

Genre: Research Report > New Finding (0.34)

Industry: Leisure & Entertainment > Sports > Motorsports (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Deep Learning Agents Trained For Avoidance Behave Like Hawks And Doves

Reddi, Aryaman, Vinnicombe, Glenn

arXiv.org Artificial IntelligenceMar-14-2025

We present heuristically optimal strategies expressed by deep learning agents playing a simple avoidance game. We analyse the learning and behaviour of two agents within a symmetrical grid world that must cross paths to reach a target destination without crashing into each other or straying off of the grid world in the wrong direction. The agent policy is determined by one neural network that is employed in both agents. Our findings indicate that the fully trained network exhibits behaviour similar to that of the game Hawks and Doves, in that one agent employs an aggressive strategy to reach the target while the other learns how to avoid the aggressive agent.

agent, arxiv preprint arxiv, reinforcement learning, (12 more...)

arXiv.org Artificial Intelligence

2503.11452

Country:

Europe > Germany > Hesse > Darmstadt Region > Darmstadt (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.60)

Add feedback

Reinforcement-Learning based routing for packet-optical networks with hybrid telemetry

Navarro, A. L. García, Koneva, Nataliia, Sánchez-Macián, Alfonso, Hernández, José Alberto, de Dios, Óscar González, Rivas-Moscoso, J. M.

arXiv.org Artificial IntelligenceJun-21-2024

This article provides a methodology and open-source implementation of Reinforcement Learning algorithms for finding optimal routes in a packet-optical network scenario. The algorithm uses measurements provided by the physical layer (pre-FEC bit error rate and propagation delay) and the link layer (link load) to configure a set of latency-based rewards and penalties based on such measurements. Then, the algorithm executes Q-learning based on this set of rewards for finding the optimal routing strategies. It is further shown that the algorithm dynamically adapts to changing network conditions by re-calculating optimal policies upon either link load changes or link degradation as measured by pre-FEC BER.

algorithm, penalty, reinforcement learning, (13 more...)

arXiv.org Artificial Intelligence

2406.12602

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Europe > Spain > Galicia > Madrid (0.05)

Genre: Research Report (0.40)

Industry: Telecommunications > Networks (0.86)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Exploring the Benefits of Teams in Multiagent Learning

Radke, David, Larson, Kate, Brecht, Tim

arXiv.org Artificial IntelligenceJul-31-2023

For problems requiring cooperation, many multiagent systems implement solutions among either individual agents or across an entire population towards a common goal. Multiagent teams are primarily studied when in conflict; however, organizational psychology (OP) highlights the benefits of teams among human populations for learning how to coordinate and cooperate. In this paper, we propose a new model of multiagent teams for reinforcement learning (RL) agents inspired by OP and early work on teams in artificial intelligence. We validate our model using complex social dilemmas that are popular in recent multiagent RL and find that agents divided into teams develop cooperative pro-social policies despite incentives to not cooperate. Furthermore, agents are better able to coordinate and learn emergent roles within their teams and achieve higher rewards compared to when the interests of all agents are aligned.

agent, artificial intelligence, team structure, (17 more...)

arXiv.org Artificial Intelligence

2205.02328

Country: North America > Canada > Ontario (0.04)

Genre: Research Report > New Finding (0.47)

Industry: Leisure & Entertainment > Games (0.46)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.46)

Add feedback

Towards a Better Understanding of Learning with Multiagent Teams

Radke, David, Larson, Kate, Brecht, Tim, Tilbury, Kyle

arXiv.org Artificial IntelligenceJun-28-2023

While it has long been recognized that a team of individual learning agents can be greater than the sum of its parts, recent work has shown that larger teams are not necessarily more effective than smaller ones. In this paper, we study why and under which conditions certain team structures promote effective learning for a population of individual learning agents. We show that, depending on the environment, some team structures help agents learn to specialize into specific roles, resulting in more favorable global results. However, large teams create credit assignment challenges that reduce coordination, leading to large teams performing poorly compared to smaller ones. We support our conclusions with both theoretical analysis and empirical results.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

arXiv.org Artificial Intelligence

2306.16205

Country:

North America > United States > New York (0.04)
North America > Canada > Ontario (0.04)

Genre: Research Report > New Finding (0.88)

Industry:

Education (0.68)
Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.67)

Add feedback

A far-sighted approach to machine learning G.R. Jenkin & Associates

#artificialintelligenceJan-7-2023, 23:20:50 GMT

The players can cooperate to achieve an objective, and compete against other players with conflicting interests. Creating artificial intelligence agents that can learn to compete and cooperate as effectively as humans remains a thorny problem. A key challenge is enabling AI agents to anticipate future behaviors of other agents when they are all learning simultaneously. Because of the complexity of this problem, current approaches tend to be myopic; the agents can only guess the next few moves of their teammates or competitors, which leads to poor performance in the long run. Researchers from MIT, the MIT-IBM Watson AI Lab, and elsewhere have developed a new approach that gives AI agents a farsighted perspective.

agent, artificial intelligence, machine learning, (17 more...)

#artificialintelligence

Industry: Information Technology (0.39)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Trajkovski

AAAI ConferencesFeb-8-2022, 11:02:17 GMT

In this paper we explain how IETAL agents learn their environment, and how they build their intrinsic, internal representation of it, which they then use to build their expectations when on quest to satisfy its active drives. As environments change (with or without other agents present in them), the agents learn to new and "forget" irrelevant, "old" associations made. We discuss the concept of emotional context of associations, and show a gallery of simulations of behaviors in small multiagent societies.

agent learn, trajkovski

AAAI Conferences

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.74)

Add feedback

IA : Deep Reinforcement learning. A mimicry of Human evolution?

#artificialintelligenceDec-20-2021, 13:25:19 GMT

DRL is an AI technique that aims to take appropriate actions to maximise reward in a certain situation (game/simulation/reality). Before further explaining, it is necessary to give some definitions: - Agent: It is the "player" of the game, the entity who's taking actions, he follows a strategy (called policy) to evolve in the environment. His ultimate goal is to maximize his reward. The environment is said to be in a state s at a given time - Policy: It is the strategy which drives the Agent actions, it is designed by a NN. The policy can change as the Agent learns from his experiences - Reward: A metric aiming to determine the performance of the Agent's actions within the environment Now let's take an example to illustrate the mecanisms of DRL: The famous card game of Poker Texas Hold'em (PTH). In PTH, the agents are the players and the environment is the set of rules of PTH (blinds, number of cards, minimum bet, playing order…).

agent, deep reinforcement, human evolution, (5 more...)

#artificialintelligence

Country: North America > United States > Texas (0.27)

Industry: Leisure & Entertainment > Games > Poker (0.59)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.85)

Add feedback