AITopics | shaq

SHAQ: IncorporatingShapleyValueTheoryinto Multi-AgentQ-Learning

Neural Information Processing SystemsFeb-7-2026, 23:54:18 GMT

Value factorisation is a useful technique for multi-agent reinforcement learning (MARL) in global reward game, however, its underlying mechanism is not yet fully understood.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

Europe > Sweden > Stockholm > Stockholm (0.04)
North America > United States > California (0.04)
Europe > Germany > Baden-Württemberg > Freiburg (0.04)
Europe > France (0.04)

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.87)

Add feedback

SHAQ: Incorporating Shapley Value Theoryinto Multi-Agent Q-Learning

Neural Information Processing SystemsFeb-7-2026, 23:54:14 GMT

machine learning, reinforcement learning, shaq, (12 more...)

Neural Information Processing Systems

Country:

Europe > Sweden > Stockholm > Stockholm (0.04)
Europe > Germany > Baden-Württemberg > Freiburg (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
(5 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.83)

Add feedback

SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning

Neural Information Processing SystemsDec-23-2025, 22:32:49 GMT

Value factorisation is a useful technique for multi-agent reinforcement learning (MARL) in global reward game, however, its underlying mechanism is not yet fully understood. This paper studies a theoretical framework for value factorisation with interpretability via Shapley value theory. We generalise Shapley value to Markov convex game called Markov Shapley value (MSV) and apply it as a value factorisation method in global reward game, which is obtained by the equivalence between the two games. Based on the properties of MSV, we derive Shapley-Bellman optimality equation (SBOE) to evaluate the optimal MSV, which corresponds to an optimal joint deterministic policy. Furthermore, we propose Shapley-Bellman operator (SBO) that is proved to solve SBOE. With a stochastic approximation and some transformations, a new MARL algorithm called Shapley Q-learning (SHAQ) is established, the implementation of which is guided by the theoretical results of SBO and MSV. We also discuss the relationship between SHAQ and relevant value factorisation methods. In the experiments, SHAQ exhibits not only superior performances on all tasks but also the interpretability that agrees with the theoretical analysis.

artificial intelligence, machine learning, reinforcement learning, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning

Neural Information Processing SystemsOct-3-2025, 02:21:21 GMT

V alue factorisation is a useful technique for multi-agent reinforcement learning (MARL) in global reward game, however, its underlying mechanism is not yet fully understood. This paper studies a theoretical framework for value factorisation with interpretability via Shapley value theory.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.67)
Europe > United Kingdom > England (0.27)

Genre: Research Report (1.00)

Industry:

Government (0.67)
Information Technology (0.67)
Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.46)

Add feedback

27985d21f0b751b933d675930aa25022-Paper-Conference.pdf

Neural Information Processing SystemsOct-3-2025, 02:21:17 GMT

artificial intelligence, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States (0.68)
Europe > United Kingdom > England (0.28)

Genre: Research Report (0.93)

Industry: Information Technology (0.46)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.98)
(2 more...)

Add feedback

SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning

Neural Information Processing SystemsOct-10-2024, 10:05:40 GMT

Value factorisation is a useful technique for multi-agent reinforcement learning (MARL) in global reward game, however, its underlying mechanism is not yet fully understood. This paper studies a theoretical framework for value factorisation with interpretability via Shapley value theory. We generalise Shapley value to Markov convex game called Markov Shapley value (MSV) and apply it as a value factorisation method in global reward game, which is obtained by the equivalence between the two games. Based on the properties of MSV, we derive Shapley-Bellman optimality equation (SBOE) to evaluate the optimal MSV, which corresponds to an optimal joint deterministic policy. Furthermore, we propose Shapley-Bellman operator (SBO) that is proved to solve SBOE.

incorporating shapley value theory, multi-agent q-learning, value factorisation method, (6 more...)

Neural Information Processing Systems

Genre: Play > Prospect (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Amazon ditches Alexa's celebrity voices and will issue refunds upon request

EngadgetMay-30-2023, 16:55:11 GMT

If you've been saving up to integrate Shaq's voice into your Alexa devices, you've officially blown it. Amazon is ditching all of its Alexa-enabled celebrity voices, including Shaquille O'Neal, Melissa McCarthy and, say it ain't so, Samuel L. Jackson. The distinct voice options will no longer be available for purchase and will no longer function even if you made a purchase a while back, as reported by The Verge. That brings us to the topic of refunds, and it looks like there won't be any. This isn't earth-shattering news, as the voice options launched for just $1 before moving up to $5 in recent months.

alexa, celebrity voice, refund, (6 more...)

Engadget

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.35)

Add feedback

SHAQ: Single Headed Attention with Quasi-Recurrence

Bharwani, Nashwin, Kushner, Warren, Dandona, Sangeet, Schreiber, Ben

arXiv.org Artificial IntelligenceAug-18-2021

Natural Language Processing research has recently been dominated by large scale transformer models. Although they achieve state of the art on many important language tasks, transformers often require expensive compute resources, and days spanning to weeks to train. This is feasible for researchers at big tech companies and leading research universities, but not for scrappy start-up founders, students, and independent researchers. Stephen Merity's SHA-RNN, a compact, hybrid attention-RNN model, is designed for consumer-grade modeling as it requires significantly fewer parameters and less training time to reach near state of the art results. We analyze Merity's model here through an exploratory model analysis over several units of the architecture considering both training time and overall quality in our assessment. Ultimately, we combine these findings into a new architecture which we call SHAQ: Single Headed Attention Quasi-recurrent Neural Network. With our new architecture we achieved similar accuracy results as the SHA-RNN while accomplishing a 4x speed boost in training.

architecture, baseline, merity, (17 more...)

arXiv.org Artificial Intelligence

2108.08207

Country: South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

SHAQ: Incorporating Shapley Value Theory into Q-Learning for Multi-Agent Reinforcement Learning

Wang, Jianhong, Wang, Jinxin, Zhang, Yuan, Gu, Yunjie, Kim, Tae-Kyun

arXiv.org Artificial IntelligenceMay-31-2021

Value factorisation proves to be a very useful technique in multi-agent reinforcement learning (MARL), but the underlying mechanism is not yet fully understood. This paper explores a theoretic basis for value factorisation. We generalise the Shapley value in the coalitional game theory to a Markov convex game (MCG) and use it to guide value factorisation in MARL. We show that the generalised Shapley value possesses several features such as (1) accurate estimation of the maximum global value, (2) fairness in the factorisation of the global value, and (3) being sensitive to dummy agents. The proposed theory yields a new learning algorithm called Sharpley Q-learning (SHAQ), which inherits the important merits of ordinary Q-learning but extends it to MARL. In comparison with prior-arts, SHAQ has a much weaker assumption (MCG) that is more compatible with real-world problems, but has superior explainability and performance in many cases. We demonstrated SHAQ and verified the theoretic claims on Predator-Prey and StarCraft Multi-Agent Challenge (SMAC).

agent, coalition, shapley value, (12 more...)

arXiv.org Artificial Intelligence

2105.15013

Country:

Europe > Sweden > Stockholm > Stockholm (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(3 more...)

Genre: Research Report (0.50)

Industry: Leisure & Entertainment > Games (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

GIPHY's Open Sourced Celebrity Detector Thinks Shaq Is Terry Crews - Codesmith Development

#artificialintelligenceMar-7-2019, 21:01:52 GMT

GIPHY recently released its machine learning model, GIPHY Celebrity Detector, under the Mozilla Public License 2.0(MLP). While there are numerous face recognition models like OpenFace out there, they don't have the quirk of being specifically trained to accurately analyze a celebrity's face. GIHPY boasts a 98% accuracy rate. Of course, Redditors tested out this claim by conducting an experiment of their own. One Redditor achieved a great outcome when submitting Will Smith.

artificial intelligence, machine learning, social media, (8 more...)

#artificialintelligence

Industry: Media > News (0.61)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.75)

Add feedback

Filters

Collaborating Authors

shaq

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

SHAQ: IncorporatingShapleyValueTheoryinto Multi-AgentQ-Learning

SHAQ: Incorporating Shapley Value Theoryinto Multi-Agent Q-Learning

SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning

SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning

27985d21f0b751b933d675930aa25022-Paper-Conference.pdf

SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning

Amazon ditches Alexa's celebrity voices and will issue refunds upon request

SHAQ: Single Headed Attention with Quasi-Recurrence

SHAQ: Incorporating Shapley Value Theory into Q-Learning for Multi-Agent Reinforcement Learning

GIPHY's Open Sourced Celebrity Detector Thinks Shaq Is Terry Crews - Codesmith Development