AITopics | Mesbahi, Mehran

Plotting

Mesbahi, Mehran

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Estimation-Aware Trajectory Optimization with Set-Valued Measurement Uncertainties

Deole, Aditya, Mesbahi, Mehran

arXiv.org Artificial IntelligenceJan-15-2025

In this paper, we present an optimization-based framework for generating estimation-aware trajectories in scenarios where measurement (output) uncertainties are state-dependent and set-valued. The framework leverages the concept of regularity for set-valued output maps. Specifically, we demonstrate that, for output-regular maps, one can utilize a set-valued observability measure that is concave with respect to finite-horizon state trajectories. By maximizing this measure, optimized estimation-aware trajectories can be designed for a broad class of systems, including those with locally linearized dynamics. To illustrate the effectiveness of the proposed approach, we provide a representative example in the context of trajectory planning for vision-based estimation. We present an estimation-aware trajectory for an uncooperative target-tracking problem that uses a machine learning (ML)-based estimation module on an ego-satellite.

artificial intelligence, machine learning, trajectory, (18 more...)

arXiv.org Artificial Intelligence

2501.09192

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > Washington > King County > Seattle (0.14)

Genre: Research Report (0.40)

Industry: Energy (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Multi Agent Reinforcement Learning for Sequential Satellite Assignment Problems

Holder, Joshua, Jaques, Natasha, Mesbahi, Mehran

arXiv.org Artificial IntelligenceDec-20-2024

Assignment problems are a classic combinatorial optimization problem in which a group of agents must be assigned to a group of tasks such that maximum utility is achieved while satisfying assignment constraints. Given the utility of each agent completing each task, polynomial-time algorithms exist to solve a single assignment problem in its simplest form. However, in many modern-day applications such as satellite constellations, power grids, and mobile robot scheduling, assignment problems unfold over time, with the utility for a given assignment depending heavily on the state of the system. We apply multi-agent reinforcement learning to this problem, learning the value of assignments by bootstrapping from a known polynomial-time greedy solver and then learning from further experience. We then choose assignments using a distributed optimal assignment mechanism rather than by selecting them directly. We demonstrate that this algorithm is theoretically justified and avoids pitfalls experienced by other RL algorithms in this setting. Finally, we show that our algorithm significantly outperforms other methods in the literature, even while scaling to realistic scenarios with hundreds of agents and tasks.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Artificial Intelligence

2412.15573

Country: North America > United States > Washington > King County > Seattle (0.14)

Genre: Research Report (0.50)

Industry:

Energy > Power Industry (0.66)
Transportation > Ground > Road (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Data-Guided Regulator for Adaptive Nonlinear Control

Rahimi, Niyousha, Mesbahi, Mehran

arXiv.org Artificial IntelligenceNov-20-2023

A critical aspect of autonomous operations in safety-critical scenarios is learning from available data for quick adaptation to new environments while maintaining safety. Examples include aircraft emergency landing scenarios in adverse weather conditions and agile quadrotor flights through low clearance gates in the presence of dynamic and strong wind conditions [1]. From a system theoretic perspective, this system feature maps to having the autonomous agent handle parametric model uncertainties and disturbances with control-theoretic guarantees such as stability and tracking error convergence, common in adaptive control settings [2, 3]. A rich body of literature has analyzed classical adaptive control algorithms' stability and convergence properties for continuous-time dynamical systems. Such studies include the use of PI (proportional integral) controllers [4] for a class of linear time-varying systems to guarantee (I) infinite-time convergence of the tracking error to zero, i.e., the difference between actual and nominal states () = () (), for any constant exogenous disturbance (denoted by), (II) infinite-time convergence of the tracking error () to a bound which is proportional to the bound on the magnitude of the rate of the exogenous signal ().

artificial intelligence, controller, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2311.1223

Country: North America > United States > Washington > King County > Seattle (0.14)

Genre: Research Report (0.64)

Industry: Transportation > Air (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.34)

Add feedback

Data-Driven Structured Policy Iteration for Homogeneous Distributed Systems

Alemzadeh, Siavash, Talebi, Shahriar, Mesbahi, Mehran

arXiv.org Artificial IntelligenceNov-16-2023

Control of networked systems, comprised of interacting agents, is often achieved through modeling the underlying interactions. Constructing accurate models of such interactions--in the meantime--can become prohibitive in applications. Data-driven control methods avoid such complications by directly synthesizing a controller from the observed data. In this paper, we propose an algorithm referred to as Data-driven Structured Policy Iteration (D2SPI), for synthesizing an efficient feedback mechanism that respects the sparsity pattern induced by the underlying interaction network. In particular, our algorithm uses temporary "auxiliary" communication links in order to enable the required information exchange on a (smaller) sub-network during the "learning phase" -- links that will be removed subsequently for the final distributed feedback synthesis. We then proceed to show that the learned policy results in a stabilizing structured policy for the entire network. Our analysis is then followed by showing the stability and convergence of the proposed distributed policies throughout the learning phase, exploiting a construct referred to as the "Patterned monoid.'' The performance of D2SPI is then demonstrated using representative simulation scenarios.

artificial intelligence, controller, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2103.11572

Country:

North America > United States > California (0.28)
North America > United States > Washington > King County > Seattle (0.14)
North America > United States > Florida > Orange County > Orlando (0.14)

Genre:

Personal (0.46)
Research Report (0.40)

Industry:

Aerospace & Defense (0.67)
Education (0.46)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.34)

Add feedback

Adaptive Traffic Control with Deep Reinforcement Learning: Towards State-of-the-art and Beyond

Alemzadeh, Siavash, Moslemi, Ramin, Sharma, Ratnesh, Mesbahi, Mehran

arXiv.org Machine LearningJul-21-2020

In this work, we study adaptive data-guided traffic planning and control using Reinforcement Learning (RL). We shift from the plain use of classic methods towards state-of-the-art in deep RL community. We embed several recent techniques in our algorithm that improve the original Deep Q-Networks (DQN) for discrete control and discuss the traffic-related interpretations that follow. We propose a novel DQN-based algorithm for Traffic Control (called TC-DQN+) as a tool for fast and more reliable traffic decision-making. We introduce a new form of reward function which is further discussed using illustrative examples with comparisons to traditional traffic control methods.

algorithm, artificial intelligence, reinforcement learning, (15 more...)

arXiv.org Machine Learning

2007.1096

Country:

North America > United States > California > Santa Clara County (0.14)
North America > United States > Washington > King County > Seattle (0.14)

Genre: Research Report > New Finding (0.68)

Industry: Transportation > Infrastructure & Services (0.31)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Global Convergence of Policy Gradient Methods for Linearized Control Problems

Fazel, Maryam, Ge, Rong, Kakade, Sham M., Mesbahi, Mehran

arXiv.org Machine LearningJan-15-2018

Direct policy gradient methods for reinforcement learning and continuous control problems are a popular approach for a variety of reasons: 1) they are easy to implement without explicit knowledge of the underlying model 2) they are an "end-to-end" approach, directly optimizing the performance metric of interest 3) they inherently allow for richly parameterized policies. A notable drawback is that even in the most basic continuous control problem (that of linear quadratic regulators), these methods must solve a non-convex optimization problem, where little is understood about their efficiency from both computational and statistical perspectives. In contrast, system identification and model based planning in optimal control theory have a much more solid theoretical footing, where much is known with regards to their computational and statistical properties. This work bridges this gap showing that (model free) policy gradient methods globally converge to the optimal solution and are efficient (polynomially so in relevant problem dependent quantities) with regards to their sample and computational complexities.

artificial intelligence, gradient, optimization problem, (17 more...)

arXiv.org Machine Learning

1801.05039

Country:

North America > United States > Washington > King County > Seattle (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report (0.50)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Robots (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.48)

Add feedback