AITopics | Agents

Collaborating Authors

Agents

News Overviews Instructional Materials AI-Alerts Classics

No-regret Learning in Price Competitions under Consumer Reference Effects

Neural Information Processing SystemsAug-17-2025, 07:08:02 GMT

We study long-run market stability for repeated price competitions between two firms, where consumer demand depends on firms' posted prices and consumers'

reference price, sne, step size, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > Canada (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Industry: Education (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Game Theory (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.46)

Add feedback

We first thank all reviewers for their thoughtful comments, and we wish everyone health during these hard times

Neural Information Processing SystemsAug-17-2025, 07:07:50 GMT

We first thank all reviewers for their thoughtful comments, and we wish everyone health during these hard times. We acknowledge the simplicity in our linear demand and reference price update models. These references are also discussed in Section 2 of the paper. The gradient of revenue can be calculated using estimated elasticity, observed sales (i.e. Assumption 1 is invoked in all theorems and lemmas of Section 5, and we will clearly state this in the revised paper. In the proof of Lemma 3.2, we show that This means if firms are willing to consider both prices near zero and those sufficiently large, Assumption 1 holds.

artificial intelligence, reference price, step size, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.49)

Add feedback

f50a6c02a3fc5a3a5d4d9391f05f3efc-Paper.pdf

Neural Information Processing SystemsAug-17-2025, 07:06:45 GMT

artificial intelligence, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Oregon (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.46)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.46)

Add feedback

9e3b203e72c4e058de26d02a92a81844-Paper-Conference.pdf

Neural Information Processing SystemsAug-17-2025, 06:57:04 GMT

artificial intelligence, machine learning, trajectory, (15 more...)

Neural Information Processing Systems

Country:

Africa > Central African Republic > Ombella-M'Poko > Bimbo (0.04)
Asia > China > Shanghai > Shanghai (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback

Polynomial-Time Optimal Equilibria with a Mediator in Extensive-Form Games

Neural Information Processing SystemsAug-17-2025, 06:41:53 GMT

For common notions of correlated equilibrium in extensive-form games, computing an optimal ( e.g., welfare-maximizing) equilibrium is NP-hard. Other equilibrium notions-- communication [11] and certification [12] equilibria--augment the game with a mediator that has the power to both send and receive messages to and from the players--and, in particular, to remember the messages. In this paper, we investigate both notions in extensive-form games from a computational lens. We show that optimal equilibria in both notions can be computed in polynomial time, the latter under a natural additional assumption known in the literature. Our proof works by constructing a mediator-augmented game of polynomial size that explicitly represents the mediator's decisions and actions.

artificial intelligence, machine learning, mediator, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.04)
North America > United States > Florida > Hillsborough County > Tampa (0.04)
(4 more...)

Industry: Leisure & Entertainment > Games (0.68)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.66)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.47)

Add feedback

9c7008aff45b5d8f0973b23e1a22ada0-Paper-Conference.pdf

Neural Information Processing SystemsAug-17-2025, 06:10:54 GMT

arxiv preprint arxiv, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia (0.04)
Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Materials (0.93)
Leisure & Entertainment > Games > Computer Games (0.49)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(2 more...)

Add feedback

Supplementary Materials of The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games

Neural Information Processing SystemsAug-17-2025, 06:01:25 GMT

We assume here that all agents share critic and actor networks, for notational convenience. Gaussian Distribution, from which an action is sampled, in continuous action spaces. In the loss functions above, B refers to the batch size and n refers to the number of agents. Multi-agent Particle-World Environment (MPE) was introduced in (Lowe et al., 2017). StarCraftII Micromanagement Challenge (SMAC) tasks were introduced in (Rashid et al., 2019).

agent, artificial intelligence, mappo, (17 more...)

Neural Information Processing Systems

Country: