AITopics | feedback policy

Collaborating Authors

feedback policy

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

856b503e276cc491e7e6e0ac1b9f4b17-Supplemental.pdf

Neural Information Processing SystemsFeb-9-2026, 16:15:41 GMT

algorithm, nullnull null, sequence, (16 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > New Jersey (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(2 more...)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
Information Technology > Data Science > Data Mining > Big Data (0.45)

Add feedback

OnlineControlofUnknownTime-VaryingDynamical Systems

Neural Information Processing SystemsFeb-9-2026, 16:15:37 GMT

In fact, our algorithm enjoys sublinearadaptive regret bounds, which is a strictly stronger metric than standard regret and is more appropriate fortime-varying systems.

artificial intelligence, arxivpreprintarxiv, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > France > Hauts-de-France > Nord > Lille (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.89)

Add feedback

6332a8f62e3a9d5831724f2ffe55cae0-Supplemental.pdf

Neural Information Processing SystemsFeb-9-2026, 00:45:35 GMT

equation, stability, trajectory, (16 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Online Control of Unknown Time-Varying Dynamical Systems

Neural Information Processing SystemsDec-24-2025, 09:57:00 GMT

We study online control of time-varying linear systems with unknown dynamics in the nonstochastic control model. At a high level, we demonstrate that this setting is \emph{qualitatively harder} than that of either unknown time-invariant or known time-varying dynamics, and complement our negative results with algorithmic upper bounds in regimes where sublinear regret is possible. More specifically, we study regret bounds with respect to common classes of policies: Disturbance Action (SLS), Disturbance Response (Youla), and linear feedback policies. While these three classes are essentially equivalent for LTI systems, we demonstrate that these equivalences break down for time-varying systems. We prove a lower bound that no algorithm can obtain sublinear regret with respect to the first two classes unless a certain measure of system variability also scales sublinearly in the horizon. Furthermore, we show that offline planning over the state linear feedback policies is NP-hard, suggesting hardness of the online learning problem. On the positive side, we give an efficient algorithm that attains a sublinear regret bound against the class of Disturbance Response policies up to the aforementioned system variability term. In fact, our algorithm enjoys sublinear \emph{adaptive} regret bounds, which is a strictly stronger metric than standard regret and is more appropriate for time-varying systems. We sketch extensions to Disturbance Action policies and partial observation, and propose an inefficient algorithm for regret against linear state feedback policies.

name change, online control, unknown time-varying dynamical system, (8 more...)

Neural Information Processing Systems

Industry: Education (0.59)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Online Control of Unknown Time-Varying Dynamical Systems

Neural Information Processing SystemsAug-15-2025, 15:32:06 GMT

In fact, our algorithm enjoys sublinear adaptive regret bounds, which is a strictly stronger metric than standard regret and is more appropriate for time-varying systems.

artificial intelligence, data mining, machine learning, (20 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > New Jersey (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(2 more...)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
Information Technology > Data Science > Data Mining > Big Data (0.45)

Add feedback

Online Control of Unknown Time-Varying Dynamical Systems

Neural Information Processing SystemsAug-15-2025, 15:32:02 GMT

In fact, our algorithm enjoys sublinear adaptive regret bounds, which is a strictly stronger metric than standard regret and is more appropriate for time-varying systems.

artificial intelligence, data mining, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > New Jersey (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(2 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.46)

Add feedback

6332a8f62e3a9d5831724f2ffe55cae0-Supplemental.pdf

Neural Information Processing SystemsAug-14-2025, 20:20:32 GMT

equation, stability, trajectory, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > New York County > New York City (0.14)
Europe > Switzerland > Zürich > Zürich (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Learning Stable Deep Dynamics Models for Partially Observed or Delayed Dynamical Systems

Neural Information Processing SystemsAug-14-2025, 20:20:28 GMT

To alleviate this issue, we propose to augment the system's state with its history.

equation, stability, trajectory, (15 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.05)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)

Add feedback

Optimal Modified Feedback Strategies in LQ Games under Control Imperfections

Rabbani, Mahdis, Mojahed, Navid, Nazari, Shima

arXiv.org Artificial IntelligenceMar-24-2025

Game-theoretic approaches and Nash equilibrium have been widely applied across various engineering domains. However, practical challenges such as disturbances, delays, and actuator limitations can hinder the precise execution of Nash equilibrium strategies. This work explores the impact of such implementation imperfections on game trajectories and players' costs within the context of a two-player linear quadratic (LQ) nonzero-sum game. Specifically, we analyze how small deviations by one player affect the state and cost function of the other player. To address these deviations, we propose an adjusted control policy that not only mitigates adverse effects optimally but can also exploit the deviations to enhance performance. Rigorous mathematical analysis and proofs are presented, demonstrating through a representative example that the proposed policy modification achieves up to $61\%$ improvement compared to the unadjusted feedback policy and up to $0.59\%$ compared to the feedback Nash strategy.

artificial intelligence, deviation, game theory, (16 more...)

arXiv.org Artificial Intelligence

2503.192

Country:

North America > United States > California > Yolo County > Davis (0.04)
Europe > Poland > Masovia Province > Warsaw (0.04)

Genre: Research Report (0.50)

Industry: Transportation (0.68)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Robots (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.94)

Add feedback

Filters

Collaborating Authors

feedback policy

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

856b503e276cc491e7e6e0ac1b9f4b17-Supplemental.pdf

OnlineControlofUnknownTime-VaryingDynamical Systems

6332a8f62e3a9d5831724f2ffe55cae0-Supplemental.pdf

6332a8f62e3a9d5831724f2ffe55cae0-Paper.pdf

Online Control of Unknown Time-Varying Dynamical Systems

Online Control of Unknown Time-Varying Dynamical Systems

Online Control of Unknown Time-Varying Dynamical Systems

6332a8f62e3a9d5831724f2ffe55cae0-Supplemental.pdf

Learning Stable Deep Dynamics Models for Partially Observed or Delayed Dynamical Systems

Optimal Modified Feedback Strategies in LQ Games under Control Imperfections