AITopics | value propagation

Collaborating Authors

value propagation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Probabilistic Attention for Interactive Segmentation

Neural Information Processing SystemsMay-1-2026, 01:53:12 GMT

We provide a probabilistic interpretation of attention and show that the standard dotproduct attention in transformers is a special case of Maximum APosteriori (MAP) inference. The proposed approach suggests the use of Expectation Maximization algorithms for online adaptation of key and value model parameters. This approach is useful for cases in which external agents, e.g., annotators, provide inference-time information about the correct values of some tokens, e.g., the semantic category of some pixels, and we need for this new information to propagate to other tokens in a principled manner. We illustrate the approach on an interactive semantic segmentation task in which annotators and models collaborate online to improve annotation efficiency. Using standard benchmarks, we observe that key adaptation boosts model performance ( 10% mIoU) in the low feedback regime and value propagation improves model responsiveness in the high feedback regime.

computer vision, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Genre: Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.95)
(2 more...)

Add feedback

Value Propagation for Decentralized Networked Deep Multi-agent Reinforcement Learning

Chao Qu, Shie Mannor, Huan Xu, Yuan Qi, Le Song, Junwu Xiong

Neural Information Processing SystemsFeb-12-2026, 20:47:50 GMT

Neural Information Processing Systems http://nips.cc/

agent, algorithm, value propagation, (14 more...)

Neural Information Processing Systems

Country: North America > Canada > Ontario > Toronto (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.93)

Add feedback

8a0e1141fd37fa5b98d5bb769ba1a7cc-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-12-2026, 20:47:34 GMT

When they optimize the squared TD-error, they do not calculate gradient w.r.t. the parameterθ of the18 target(e.g.

artificial intelligence, experiment, thanksforthecomment, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.38)

Add feedback

23937b42f9273974570fb5a56a6652ee-Supplemental.pdf

Neural Information Processing SystemsFeb-7-2026, 21:05:14 GMT

computer vision, segmentation, value propagation, (13 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.71)

Add feedback

Value Propagation for Decentralized Networked Deep Multi-agent Reinforcement Learning

Neural Information Processing SystemsDec-25-2025, 16:37:59 GMT

We consider the networked multi-agent reinforcement learning (MARL) problem in a fully decentralized setting, where agents learn to coordinate to achieve joint success. This problem is widely encountered in many areas including traffic control, distributed control, and smart grids. We assume each agent is located at a node of a communication network and can exchange information only with its neighbors. Using softmax temporal consistency, we derive a primal-dual decentralized optimization method and obtain a principled and data-efficient iterative algorithm named {\em value propagation}. We prove a non-asymptotic convergence rate of $\mathcal{O}(1/T)$ with nonlinear function approximation. To the best of our knowledge, it is the first MARL algorithm with a convergence guarantee in the control, off-policy, non-linear function approximation, fully decentralized setting.

name change, networked deep multi-agent reinforcement learning, value propagation, (3 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.65)

Add feedback

Value Propagation for Decentralized Networked Deep Multi-agent Reinforcement Learning

Chao Qu, Shie Mannor, Huan Xu, Yuan Qi, Le Song, Junwu Xiong

Neural Information Processing SystemsOct-3-2025, 04:13:28 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country: North America > Canada (0.28)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.93)

Add feedback

Probabilistic Attention for Interactive Segmentation: Supplementary Material

Neural Information Processing SystemsOct-2-2025, 21:57:10 GMT

Specifically, we plot the improvement in mask accuracy, i.e. mean IOU relative to ground truth, as a function of

artificial intelligence, machine learning, segmentation, (14 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.71)

Add feedback

Reviews: Value Propagation for Decentralized Networked Deep Multi-agent Reinforcement Learning

Neural Information Processing SystemsJan-25-2025, 10:07:30 GMT

This paper tackles the problem of decentralized learning in multi-agent environments. While many recent approaches use a combination of centralized learning and decentralized execution, the decentralized learning paradigm is motivated by scenarios where a centralized agent (e.g. a value function) may be too expensive to use, or may have undesirable privacy implications. However, previous decentralized learning approaches haven't been very effective for multi-agent problems. The paper proposes a new algorithm, value propagation, and prove that it converges in the non-linear function approximation case. To my knowledge, the value propagation algorithm is novel and interesting.

algorithm, decentralized learning method, networked deep multi-agent reinforcement learning, (9 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.86)

Add feedback

Reviews: Value Propagation for Decentralized Networked Deep Multi-agent Reinforcement Learning

Neural Information Processing SystemsJan-25-2025, 10:07:19 GMT

All reviewers agree that this paper provide novel theoretical results applicable to the single-agent setting.

networked deep multi-agent reinforcement learning, reviewer, value propagation

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.85)

Add feedback

Filters

Collaborating Authors

value propagation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Probabilistic Attention for Interactive Segmentation

Value Propagation for Decentralized Networked Deep Multi-agent Reinforcement Learning

8a0e1141fd37fa5b98d5bb769ba1a7cc-AuthorFeedback.pdf

23937b42f9273974570fb5a56a6652ee-Supplemental.pdf

23937b42f9273974570fb5a56a6652ee-Paper.pdf

Value Propagation for Decentralized Networked Deep Multi-agent Reinforcement Learning

Value Propagation for Decentralized Networked Deep Multi-agent Reinforcement Learning

Probabilistic Attention for Interactive Segmentation: Supplementary Material

Reviews: Value Propagation for Decentralized Networked Deep Multi-agent Reinforcement Learning

Reviews: Value Propagation for Decentralized Networked Deep Multi-agent Reinforcement Learning