AITopics | Tadiparthi, Vaishnav

Collaborating Authors

Tadiparthi, Vaishnav

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Speaking the Language of Teamwork: LLM-Guided Credit Assignment in Multi-Agent Reinforcement Learning

Lin, Muhan, Shi, Shuyang, Guo, Yue, Tadiparthi, Vaishnav, Chalaki, Behdad, Pari, Ehsan Moradi, Stepputtis, Simon, Kim, Woojun, Campbell, Joseph, Sycara, Katia

arXiv.org Artificial IntelligenceFeb-5-2025

Credit assignment, the process of attributing credit or blame to individual agents for their contributions to a team's success or failure, remains a fundamental challenge in multi-agent reinforcement learning (MARL), particularly in environments with sparse rewards. Commonly-used approaches such as value decomposition often lead to suboptimal policies in these settings, and designing dense reward functions that align with human intuition can be complex and labor-intensive. In this work, we propose a novel framework where a large language model (LLM) generates dense, agent-specific rewards based on a natural language description of the task and the overall team goal. By learning a potential-based reward function over multiple queries, our method reduces the impact of ranking errors while allowing the LLM to evaluate each agent's contribution to the overall task. Through extensive experiments, we demonstrate that our approach achieves faster convergence and higher policy returns compared to state-of-the-art MARL baselines.

large language model, machine learning, reinforcement learning, (15 more...)

arXiv.org Artificial Intelligence

2502.03723

Country: North America > United States (0.28)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

In Search of a Lost Metric: Human Empowerment as a Pillar of Socially Conscious Navigation

Baddam, Vasanth Reddy, Chalaki, Behdad, Tadiparthi, Vaishnav, Mahjoub, Hossein Nourkhiz, Moradi-Pari, Ehsan, Eldardiry, Hoda, Boker, Almuatazbellah

arXiv.org Artificial IntelligenceJan-2-2025

In social robot navigation, traditional metrics like proxemics and behavior naturalness emphasize human comfort and adherence to social norms but often fail to capture an agent's autonomy and adaptability in dynamic environments. This paper introduces human empowerment, an information-theoretic concept that measures a human's ability to influence their future states and observe those changes, as a complementary metric for evaluating social compliance. This metric reveals how robot navigation policies can indirectly impact human empowerment. We present a framework that integrates human empowerment into the evaluation of social performance in navigation tasks. Through numerical simulations, we demonstrate that human empowerment as a metric not only aligns with intuitive social behavior, but also shows statistically significant differences across various robot navigation policies. These results provide a deeper understanding of how different policies affect social compliance, highlighting the potential of human empowerment as a complementary metric for future research in social navigation.

artificial intelligence, empowerment, navigation, (15 more...)

arXiv.org Artificial Intelligence

2501.01539

Country: North America > United States (0.16)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Language Grounded Multi-agent Reinforcement Learning with Human-interpretable Communication

Li, Huao, Mahjoub, Hossein Nourkhiz, Chalaki, Behdad, Tadiparthi, Vaishnav, Lee, Kwonjoon, Moradi-Pari, Ehsan, Lewis, Charles Michael, Sycara, Katia P

arXiv.org Artificial IntelligenceNov-25-2024

Multi-Agent Reinforcement Learning (MARL) methods have shown promise in enabling agents to learn a shared communication protocol from scratch and accomplish challenging team tasks. However, the learned language is usually not interpretable to humans or other agents not co-trained together, limiting its applicability in ad-hoc teamwork scenarios. In this work, we propose a novel computational pipeline that aligns the communication space between MARL agents with an embedding space of human natural language by grounding agent communications on synthetic data generated by embodied Large Language Models (LLMs) in interactive teamwork scenarios. Our results demonstrate that introducing language grounding not only maintains task performance but also accelerates the emergence of communication. Furthermore, the learned communication protocols exhibit zero-shot generalization capabilities in ad-hoc teamwork scenarios with unseen teammates and novel task states. This work presents a significant step toward enabling effective communication and collaboration between artificial agents and humans in real-world teamwork settings.

large language model, machine learning, reinforcement learning, (18 more...)

arXiv.org Artificial Intelligence

2409.17348

Country: North America > United States (0.28)

Genre: Research Report > New Finding (1.00)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(2 more...)

Add feedback

Navigating Noisy Feedback: Enhancing Reinforcement Learning with Error-Prone Language Models

Lin, Muhan, Shi, Shuyang, Guo, Yue, Chalaki, Behdad, Tadiparthi, Vaishnav, Pari, Ehsan Moradi, Stepputtis, Simon, Campbell, Joseph, Sycara, Katia

arXiv.org Artificial IntelligenceOct-22-2024

The correct specification of reward models is a well-known challenge in reinforcement learning. Hand-crafted reward functions often lead to inefficient or suboptimal policies and may not be aligned with user values. Reinforcement learning from human feedback is a successful technique that can mitigate such issues, however, the collection of human feedback can be laborious. Recent works have solicited feedback from pre-trained large language models rather than humans to reduce or eliminate human effort, however, these approaches yield poor performance in the presence of hallucination and other errors. This paper studies the advantages and limitations of reinforcement learning from large language model feedback and proposes a simple yet effective method for soliciting and applying feedback as a potential-based shaping function. We theoretically show that inconsistent rankings, which approximate ranking errors, lead to uninformative rewards with our approach. Our method empirically improves convergence speed and policy returns over commonly used baselines even with significant ranking errors, and eliminates the need for complex post-processing of reward functions.

large language model, machine learning, reinforcement learning, (18 more...)

arXiv.org Artificial Intelligence

2410.17389

Country: North America > United States (0.68)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.91)

Add feedback

Multi-Robot Cooperative Navigation in Crowds: A Game-Theoretic Learning-Based Model Predictive Control Approach

Le, Viet-Anh, Tadiparthi, Vaishnav, Chalaki, Behdad, Mahjoub, Hossein Nourkhiz, D'sa, Jovin, Moradi-Pari, Ehsan, Malikopoulos, Andreas A.

arXiv.org Artificial IntelligenceOct-10-2023

In this paper, we develop a control framework for the coordination of multiple robots as they navigate through crowded environments. Our framework comprises of a local model predictive control (MPC) for each robot and a social long short-term memory model that forecasts pedestrians' trajectories. We formulate the local MPC formulation for each individual robot that includes both individual and shared objectives, in which the latter encourages the emergence of coordination among robots. Next, we consider the multi-robot navigation and human-robot interaction, respectively, as a potential game and a two-player game, then employ an iterative best response approach to solve the resulting optimization problems in a centralized and distributed fashion. Finally, we demonstrate the effectiveness of coordination among robots in simulated crowd navigation.

artificial intelligence, learning-based model predictive control approach, multi-robot cooperative navigation, (1 more...)

arXiv.org Artificial Intelligence

2310.06964

Genre: Research Report (0.40)

Industry: Energy > Oil & Gas > Upstream (0.60)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Social Navigation in Crowded Environments with Model Predictive Control and Deep Learning-Based Human Trajectory Prediction

Le, Viet-Anh, Chalaki, Behdad, Tadiparthi, Vaishnav, Mahjoub, Hossein Nourkhiz, D'sa, Jovin, Moradi-Pari, Ehsan

arXiv.org Artificial IntelligenceSep-28-2023

Crowd navigation has received increasing attention from researchers over the last few decades, resulting in the emergence of numerous approaches aimed at addressing this problem to date. Our proposed approach couples agent motion prediction and planning to avoid the freezing robot problem while simultaneously capturing multi-agent social interactions by utilizing a state-of-the-art trajectory prediction model i.e., social long short-term memory model (Social-LSTM). Leveraging the output of Social-LSTM for the prediction of future trajectories of pedestrians at each time-step given the robot's possible actions, our framework computes the optimal control action using Model Predictive Control (MPC) for the robot to navigate among pedestrians. We demonstrate the effectiveness of our proposed approach in multiple scenarios of simulated crowd navigation and compare it against several state-of-the-art reinforcement learning-based methods.

artificial intelligence, deep learning-based human trajectory prediction, machine learning, (3 more...)

arXiv.org Artificial Intelligence

2309.16838

Genre: Research Report (0.40)

Industry: Energy > Oil & Gas > Upstream (0.60)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Minimally Disruptive Cooperative Lane-change Maneuvers

Chalaki, Behdad, Tadiparthi, Vaishnav, Mahjoub, Hossein Nourkhiz, D'sa, Jovin, Moradi-Pari, Ehsan, Armijos, Andres S. Chavez, Li, Anni, Cassandras, Christos G.

arXiv.org Artificial IntelligenceMar-10-2023

A lane-change maneuver on a congested highway could be severely disruptive or even infeasible without the cooperation of neighboring cars. However, cooperation with other vehicles does not guarantee that the performed maneuver will not have a negative impact on traffic flow unless it is explicitly considered in the cooperative controller design. In this letter, we present a socially compliant framework for cooperative lane-change maneuvers for an arbitrary number of CAVs on highways that aims to interrupt traffic flow as minimally as possible. Moreover, we explicitly impose feasibility constraints in the optimization formulation by using reachability set theory, leading to a unified design that removes the need for an iterative procedure used in prior work. We quantitatively evaluate the effectiveness of our framework and compare it against previously offered approaches in terms of maneuver time and incurred throughput disruption.

artificial intelligence, maneuver, vehicle, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/LCSYS.2023.3279008

2303.05991

Country: North America > United States (0.28)

Genre: Research Report (0.50)

Industry: Transportation > Ground > Road (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Robots (0.95)

Add feedback

Cooperative Energy and Time-Optimal Lane Change Maneuvers with Minimal Highway Traffic Disruption

Armijos, Andres S. Chavez, Li, Anni, Cassandras, Christos G., Al-Nadawi, Yasir K., Araki, Hidekazu, Chalaki, Behdad, Moradi-Pari, Ehsan, Mahjoub, Hossein Nourkhiz, Tadiparthi, Vaishnav

arXiv.org Artificial IntelligenceNov-15-2022

We derive optimal control policies for a Connected Automated Vehicle (CAV) and cooperating neighboring CAVs to carry out a lane change maneuver consisting of a longitudinal phase where the CAV properly positions itself relative to the cooperating neighbors and a lateral phase where it safely changes lanes. In contrast to prior work on this problem, where the CAV "selfishly" only seeks to minimize its maneuver time, we seek to ensure that the fast-lane traffic flow is minimally disrupted (through a properly defined metric). Additionally, when performing lane-changing maneuvers, we optimally select the cooperating vehicles from a set of feasible neighboring vehicles and experimentally show that the highway throughput is improved compared to the baseline case of human-driven vehicles changing lanes with no cooperation. When feasible solutions do not exist for a given maximal allowable disruption, we include a time relaxation method trading off a longer maneuver time with reduced disruption. Our analysis is also extended to multiple sequential maneuvers. Simulation results show the effectiveness of our controllers in terms of safety guarantees and up to 16% and 90% average throughput and maneuver time improvement respectively when compared to maneuvers with no cooperation.

artificial intelligence, constraint, optimization problem, (19 more...)

arXiv.org Artificial Intelligence

2211.08636

Country: North America > United States (0.46)

Genre: Research Report > New Finding (0.34)

Industry: Transportation > Ground > Road (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.48)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.46)

Add feedback