AITopics | Agents

a3f8f584febcc88ed8cdeb30b096db34-Paper-Conference.pdf

Neural Information Processing SystemsAug-17-2025, 09:01:34 GMT

artificial intelligence, machine learning, markov game, (18 more...)

Neural Information Processing Systems

Country: North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications (0.67)

Add feedback

Supplementary Material Contextual Games: Multi-Agent Learning with Side Information Pier Giuseppe Sessa, Ilija Bogunovic, Andreas Krause, Maryam Kamgarpour (NeurIPS 2020)

Neural Information Processing SystemsAug-17-2025, 08:54:25 GMT

The theoretical guarantees obtained in Section 3 rely on the following two main lemmas. 's be the distributions computed using the MW rule: p Similarly to Appendix A.1, we let As it was done in proof of Theorems 1 and Appendix A.1, Also, we now explicitly consider the adaptiveness of the adversary. The second equality follows by the law of total expectation. Consider a contextual game and assume contexts are sampled i.i.d. Hoeffding's inequality [21] shows that for any > 0 P E In this section we describe the experimental setup of the contextual traffic routing game of Section 5.

inequality, probability, sequence, (11 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.83)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.64)

Add feedback

Contextual Games: Multi-Agent Learning with Side Information

Neural Information Processing SystemsAug-17-2025, 08:54:17 GMT

Motivated by these considerations, we introduce the new class of contextual games .

contextual game, data mining, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Europe > Switzerland > Zürich > Zürich (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.64)
Information Technology > Data Science > Data Mining > Big Data (0.47)

Add feedback

c97e7a5153badb6576d8939469f58336-Supplemental.pdf

Neural Information Processing SystemsAug-17-2025, 08:53:51 GMT

artificial intelligence, machine learning, qtran, (18 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.97)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.70)

Add feedback

c97e7a5153badb6576d8939469f58336-Paper.pdf

Neural Information Processing SystemsAug-17-2025, 08:53:47 GMT

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
(3 more...)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (1.00)
Information Technology > Data Science (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
(2 more...)

Add feedback

Appendix A Lower Bound In this section, we establish a lower bound on the expected regret of any algorithm for our multi-agent

Neural Information Processing SystemsAug-17-2025, 08:53:27 GMT

Our goal in this section is twofold. To avoid excessive repetition of notation and proof arguments, we purposefully leave this section not self-contained and only outline these adjustments needed. We refer an interested reader to the work of Auer et al. We leave it to future work to optimize the dependence on N and K . In our MA-MAB problem, an algorithm is allowed to "pull" a distribution over the arms In the proof of their Lemma A.1, in the explanation of their Equation (30), they cite the assumption that given the rewards observed in the first Finally, in the proof of their Theorem A.2, they again consider the probability Thus, we have the following lower bound.

algorithm, artificial intelligence, nsw, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.50)

Add feedback

c96ebeee051996333b6d70b2da6191b0-Paper.pdf

Neural Information Processing SystemsAug-17-2025, 08:53:24 GMT

algorithm, artificial intelligence, machine learning, (13 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.28)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Africa > South Sudan > Equatoria > Central Equatoria > Juba (0.04)

Genre: Research Report (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science (0.73)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.46)

Add feedback

c96c08f8bb7960e11a1239352a479053-Paper.pdf

Neural Information Processing SystemsAug-17-2025, 08:51:53 GMT

artificial intelligence, machine learning, subgame, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas (0.05)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > New Jersey > Mercer County > Princeton (0.04)

Genre: Research Report (0.68)

Industry: Leisure & Entertainment > Games > Chess (0.50)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Games (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.47)

Add feedback

a38df2dd882bf7059a1914dd5547af87-Paper-Conference.pdf

Neural Information Processing SystemsAug-17-2025, 08:47:57 GMT

artificial intelligence, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country:

Europe > Netherlands (0.04)
North America > United States > California > Los Angeles County > Santa Monica (0.04)

Industry:

Leisure & Entertainment > Games (1.00)
Government (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Game Theory (0.78)

Add feedback

Appendices for No-regret Learning in Price Competitions under Consumer Reference Effects A Expanded Literature Review

Neural Information Processing SystemsAug-17-2025, 07:08:09 GMT

There are also very recent works that address the dynamic pricing problem with consumer reference effects under uncertain demand. Nevertheless, these two lines of works are oblivious to consumer reference effects. In contrast to these two papers, our work studies price competitions over an infinite time horizon where reference prices adjust over time, and provides theoretical guarantees for the convergence of pricing strategies under the partial information setting. In their setting, the subgradient for each bidder's objective is a function of all bidders' decisions as well as its budget rate (i.e. total fixed budget divided by a given time horizon), which can be B.1 Proof of Theorem 3.1 (i) By first order conditions, we know that arg max We now follow a similar proof to that of Tarski's fixed point theorem: consider the set Note that convergence is monotonic because U () is nondecreasing. This implies that under Assumption 1, the interior SNE is unique.

artificial intelligence, equation, survey article, (16 more...)

Neural Information Processing Systems

Genre: Overview (0.40)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.46)

Add feedback

Filters

Collaborating Authors

Agents

a3f8f584febcc88ed8cdeb30b096db34-Paper-Conference.pdf

Supplementary Material Contextual Games: Multi-Agent Learning with Side Information Pier Giuseppe Sessa, Ilija Bogunovic, Andreas Krause, Maryam Kamgarpour (NeurIPS 2020)

Contextual Games: Multi-Agent Learning with Side Information

c97e7a5153badb6576d8939469f58336-Supplemental.pdf

c97e7a5153badb6576d8939469f58336-Paper.pdf

Appendix A Lower Bound In this section, we establish a lower bound on the expected regret of any algorithm for our multi-agent

c96ebeee051996333b6d70b2da6191b0-Paper.pdf

c96c08f8bb7960e11a1239352a479053-Paper.pdf

a38df2dd882bf7059a1914dd5547af87-Paper-Conference.pdf

Appendices for No-regret Learning in Price Competitions under Consumer Reference Effects A Expanded Literature Review