AITopics | Shah, Ameesh

Collaborating Authors

Shah, Ameesh

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Learning Symbolic Task Decompositions for Multi-Agent Teams

Shah, Ameesh, Lauffer, Niklas, Chen, Thomas, Pitta, Nikhil, Seshia, Sanjit A.

arXiv.org Artificial IntelligenceFeb-18-2025

One approach for improving sample efficiency in cooperative multi-agent learning is to decompose overall tasks into sub-tasks that can be assigned to individual agents. We study this problem in the context of reward machines: symbolic tasks that can be formally decomposed into sub-tasks. In order to handle settings without a priori knowledge of the environment, we introduce a framework that can learn the optimal decomposition from model-free interactions with the environment. Our method uses a task-conditioned architecture to simultaneously learn an optimal decomposition and the corresponding agents' policies for each sub-task. In doing so, we remove the need for a human to manually design the optimal decomposition while maintaining the sample-efficiency benefits of improved credit assignment. We provide experimental results in several deep reinforcement learning settings, demonstrating the efficacy of our approach. Our results indicate that our approach succeeds even in environments with codependent agent dynamics, enabling synchronous multi-agent learning not achievable in previous works.

artificial intelligence, decomposition, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2502.13376

Country:

Europe (0.93)
North America > United States > Michigan (0.14)
North America > United States > Massachusetts (0.14)
North America > United States > California (0.14)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.34)

Add feedback

LTL-Constrained Policy Optimization with Cycle Experience Replay

Shah, Ameesh, Voloshin, Cameron, Yang, Chenxi, Verma, Abhinav, Chaudhuri, Swarat, Seshia, Sanjit A.

arXiv.org Artificial IntelligenceMay-24-2024

Linear Temporal Logic (LTL) offers a precise means for constraining the behavior of reinforcement learning agents. However, in many tasks, LTL is insufficient for task specification; LTL-constrained policy optimization, where the goal is to optimize a scalar reward under LTL constraints, is needed. Prior methods for this constrained problem are restricted to finite state spaces. In this work, we present Cycle Experience Replay (CyclER), a reward-shaping approach to this problem that allows continuous state and action spaces and the use of function approximations. CyclER guides a policy towards satisfaction by encouraging partial behaviors compliant with the LTL constraint, using the structure of the constraint. In doing so, it addresses the optimization challenges stemming from the sparse nature of LTL satisfaction. We evaluate CyclER in three continuous control domains. On these tasks, CyclER outperforms existing reward-shaping methods at finding performant and LTL-satisfying policies.

machine learning, reinforcement learning, specification, (19 more...)

arXiv.org Artificial Intelligence

2404.11578

Country:

Europe (0.67)
North America > United States > California > Alameda County > Berkeley (0.14)

Genre: Research Report (1.00)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Learning Formal Specifications from Membership and Preference Queries

Shah, Ameesh, Vazquez-Chanlatte, Marcell, Junges, Sebastian, Seshia, Sanjit A.

arXiv.org Artificial IntelligenceJul-19-2023

Active learning is a well-studied approach to learning formal specifications, such as automata. In this work, we extend active specification learning by proposing a novel framework that strategically requests a combination of membership labels and pair-wise preferences, a popular alternative to membership labels. The combination of pair-wise preferences and membership labels allows for a more flexible approach to active specification learning, which previously relied on membership labels only. We instantiate our framework in two different domains, demonstrating the generality of our approach. Our results suggest that learning from both modalities allows us to robustly and conveniently identify specifications via membership and preferences.

artificial intelligence, logic & formal reasoning, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2307.10434

Country: North America > United States > Hawaii (0.14)

Genre: Research Report > New Finding (1.00)

Industry: Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.62)

Add feedback

Who Needs to Know? Minimal Knowledge for Optimal Coordination

Lauffer, Niklas, Shah, Ameesh, Carroll, Micah, Dennis, Michael, Russell, Stuart

arXiv.org Artificial IntelligenceJul-13-2023

If much of the information is irrelevant, it's easy to To optimally coordinate with others in cooperative imagine how this could lead to significant increases in efficiency games, it is often crucial to have information for finding optimal policies. For example, this could about one's collaborators: successful driving requires allow a focused effort on few-shot or zero-shot adaptation to understanding which side of the road to co-players (Zand et al., 2022; Albrecht & Stone, 2017; Stone drive on. However, not every feature of collaborators et al., 2010; Hu et al., 2020) or more efficient DecPOMDP is strategically relevant: the fine-grained planning algorithms (Szer & Charpillet, 2006; Seuken & acceleration of drivers may be ignored while maintaining Zilberstein, 2007). In order to leverage these benefits, we optimal coordination. We show that there build the theory, data structures, and algorithms required to is a well-defined dichotomy between strategically distinguish between relevant and irrelevant information.

artificial intelligence, game theory, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2306.09309

Country: North America > United States > California > Alameda County > Berkeley (0.14)

Genre: Research Report (0.64)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Game Theory (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Specification-Guided Data Aggregation for Semantically Aware Imitation Learning

Shah, Ameesh, DeCastro, Jonathan, Gideon, John, Yalcinkaya, Beyazit, Rosman, Guy, Seshia, Sanjit A.

arXiv.org Artificial IntelligenceMar-29-2023

Advancements in simulation and formal methods-guided environment sampling have enabled the rigorous evaluation of machine learning models in a number of safety-critical scenarios, such as autonomous driving. Application of these environment sampling techniques towards improving the learned models themselves has yet to be fully exploited. In this work, we introduce a novel method for improving imitation-learned models in a semantically aware fashion by leveraging specification-guided sampling techniques as a means of aggregating expert data in new environments. Specifically, we create a set of formal specifications as a means of partitioning the space of possible environments into semantically similar regions, and identify elements of this partition where our learned imitation behaves most differently from the expert. We then aggregate expert data on environments in these identified regions, leading to more accurate imitation of the expert's behavior semantics. We instantiate our approach in a series of experiments in the CARLA driving simulator, and demonstrate that our approach leads to models that are more accurate than those learned with other environment sampling methods.

artificial intelligence, machine learning, specification, (16 more...)

arXiv.org Artificial Intelligence

2303.1701

Country:

Europe (1.00)
North America > United States > New York (0.29)
North America > United States > California (0.28)

Genre: Research Report (1.00)

Industry:

Transportation (0.67)
Automobiles & Trucks (0.49)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.88)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.54)

Add feedback

Demonstration Informed Specification Search

Vazquez-Chanlatte, Marcell, Shah, Ameesh, Lederman, Gil, Seshia, Sanjit A.

arXiv.org Artificial IntelligenceDec-20-2021

This paper considers the problem of learning history dependent task specifications, e.g. automata and temporal logic, from expert demonstrations. Unfortunately, the (countably infinite) number of tasks under consideration combined with an a-priori ignorance of what historical features are needed to encode the demonstrated task makes existing approaches to learning tasks from demonstrations inapplicable. To address this deficit, we propose Demonstration Informed Specification Search (DISS): a family of algorithms parameterized by black box access to (i) a maximum entropy planner and (ii) an algorithm for identifying concepts, e.g., automata, from labeled examples. DISS works by alternating between (i) conjecturing labeled examples to make the demonstrations less surprising and (ii) sampling concepts consistent with the current labeled examples. In the context of tasks described by deterministic finite automata, we provide a concrete implementation of DISS that efficiently combines partial knowledge of the task and a single expert demonstration to identify the full task specification.

demonstration, machine learning, reinforcement learning, (17 more...)

arXiv.org Artificial Intelligence

2112.10807

Country:

Europe (0.68)
North America > United States > California (0.28)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)

Add feedback

Learning Differentiable Programs with Admissible Neural Heuristics

Shah, Ameesh, Zhan, Eric, Sun, Jennifer J., Verma, Abhinav, Yue, Yisong, Chaudhuri, Swarat

arXiv.org Artificial IntelligenceJul-25-2020

We study the problem of learning differentiable functions expressed as programs in a domain-specific language. Such programmatic models can offer benefits such as composability and interpretability; however, learning them requires optimizing over a combinatorial space of program "architectures". We frame this optimization problem as a search in a weighted graph whose paths encode top-down derivations of program syntax. Our key innovation is to view various classes of neural networks as continuous relaxations over the space of programs, which can then be used to complete any partial program. This relaxed program is differentiable and can be trained end-to-end, and the resulting training loss is an approximately admissible heuristic that can guide the combinatorial search. We instantiate our approach on top of the A-star algorithm and an iteratively deepened branch-and-bound search, and use these algorithms to learn programmatic classifiers in three sequence classification tasks. Our experiments show that the algorithms outperform state-of-the-art methods for program learning, and that they discover programmatic classifiers that yield natural interpretations and achieve competitive accuracy.

algorithm, artificial intelligence, neural network, (18 more...)

arXiv.org Artificial Intelligence

2007.12101

Country:

North America > United States (1.00)
North America > Canada (0.93)

Genre: Research Report (0.84)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback