AITopics

Country:

North America > United States > Texas (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > Canada > Quebec > Montreal (0.04)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.94)

Neural Information Processing SystemsDec-24-2025, 05:18:06 GMT

Learning in two-player zero-sum partially observable Markov games with perfect recall

We study the problem of learning a Nash equilibrium (NE) in an extensive game with imperfect information (EGII) through self-play. Precisely, we focus on two-player, zero-sum, episodic, tabular EGII under the \textit{perfect-recall} assumption where the only feedback is realizations of the game (bandit feedback). In particular the \textit{dynamics of the EGII is not known}---we can only access it by sampling or interacting with a game simulator. For this learning setting, we provide the Implicit Exploration Online Mirror Descent (IXOMD) algorithm. It is a model-free algorithm with a high-probability bound on convergence rate to the NE of order $1/\sqrt{T}$ where~$T$ is the number of played games. Moreover IXOMD is computationally efficient as it needs to perform the updates only along the sampled trajectory.

learning, name change, observable markov game, (7 more...)

Technology:

Information Technology > Game Theory (0.61)
Information Technology > Artificial Intelligence > Machine Learning (0.59)

arXiv.org Artificial IntelligenceDec-3-2025

Dominated Actions in Imperfect-Information Games

Ganzfried, Sam

Dominance is a fundamental concept in game theory. In normal-form games dominated strategies can be identified in polynomial time. As a consequence, iterative removal of dominated strategies can be performed efficiently as a preprocessing step for reducing the size of a game before computing a Nash equilibrium. For imperfect-information games in extensive form, we could convert the game to normal form and then iteratively remove dominated strategies in the same way; however, this conversion may cause an exponential blowup in game size. In this paper we define and study the concept of dominated actions in imperfect-information games. Our main result is a polynomial-time algorithm for determining whether an action is dominated (strictly or weakly) by any mixed strategy in n-player games, which can be extended to an algorithm for iteratively removing dominated actions. This allows us to efficiently reduce the size of the game tree as a preprocessing step for Nash equilibrium computation. We explore the role of dominated actions empirically in "All In or Fold" No-Limit Texas Hold'em poker.

artificial intelligence, game theory, information, (18 more...)

2504.09716

Country: North America > United States > Texas (0.24)

Genre:

Workflow (0.55)
Research Report (0.50)

Industry: Leisure & Entertainment > Games > Poker (0.34)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)

Neural Information Processing SystemsSep-29-2025, 14:32:23 GMT

037a595e6f4f0576a9efe43154d71c18-Paper.pdf

artificial intelligence, machine learning, precision and recall, (16 more...)

Country:

North America > Canada > Ontario > Toronto (0.28)
North America > United States (0.14)
North America > Canada > Quebec > Montreal (0.04)
(2 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Data Science (0.95)

Mori, Larissa, de Oliveira, Carlos Sousa, Yih, Yuehwern, Ventresca, Mario

Assessing the Performance Gap Between Lexical and Semantic Models for Information Retrieval With Formulaic Legal Language

arXiv.org Artificial IntelligenceJun-17-2025

Legal passage retrieval is an important task that assists legal practitioners in the time-intensive process of finding relevant precedents to support legal arguments. This study investigates the task of retrieving legal passages or paragraphs from decisions of the Court of Justice of the European Union (CJEU), whose language is highly structured and formulaic, leading to repetitive patterns. Understanding when lexical or semantic models are more effective at handling the repetitive nature of legal language is key to developing retrieval systems that are more accurate, efficient, and transparent for specific legal domains. To this end, we explore when this routinized legal language is better suited for retrieval using methods that rely on lexical and statistical features, such as BM25, or dense retrieval models trained to capture semantic and contextual information. A qualitative and quantitative analysis with three complementary metrics shows that both lexical and dense models perform well in scenarios with more repetitive usage of language, whereas BM25 performs better than the dense models in more nuanced scenarios where repetition and verbatim~quotes are less prevalent and in longer queries. Our experiments also show that BM25 is a strong baseline, surpassing off-the-shelf dense models in 4 out of 7 performance metrics. However, fine-tuning a dense model on domain-specific data led to improved performance, surpassing BM25 in most metrics, and we analyze the effect of the amount of data used in fine-tuning on the model's performance and temporal robustness. The code, dataset and appendix related to this work are available on: https://github.com/larimo/lexsem-legal-ir.

artificial intelligence, natural language, paragraph, (18 more...)

2506.12895

Country:

Europe (1.00)
North America > United States > Indiana > Tippecanoe County (0.14)

Genre: Research Report (1.00)

Industry:

Law (1.00)
Government > Regional Government > Europe Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)

arXiv.org Artificial IntelligenceJan-30-2025

Can we Retrieve Everything All at Once? ARM: An Alignment-Oriented LLM-based Retrieval Method

Chen, Peter Baile, Zhang, Yi, Cafarella, Michael, Roth, Dan

Real-world open-domain questions can be complicated, particularly when answering them involves information from multiple information sources. LLMs have demonstrated impressive performance in decomposing complex tasks into simpler steps, and previous work has used it for better retrieval in support of complex questions. However, LLM's decomposition of questions is unaware of what data is available and how data is organized, often leading to a sub-optimal retrieval performance. Recent effort in agentic RAG proposes to perform retrieval in an iterative fashion, where a followup query is derived as an action based on previous rounds of retrieval. While this provides one way of interacting with the data collection, agentic RAG's exploration of data is inefficient because successive queries depend on previous results rather than being guided by the organization of available data in the collection. To address this problem, we propose an LLM-based retrieval method -- ARM, that aims to better align the question with the organization of the data collection by exploring relationships among data objects beyond matching the utterance of the query, thus leading to a retrieve-all-at-once solution for complex queries. We evaluated ARM on two datasets, Bird and OTT-QA. On Bird, it outperforms standard RAG with query decomposition by up to 5.2 pt in execution accuracy and agentic RAG (ReAct) by up to 15.9 pt. On OTT-QA, it achieves up to 5.5 pt and 19.3 pt higher F1 match scores compared to these approaches.

information, large language model, natural language, (16 more...)

2501.18539

Country:

Europe > Portugal (0.05)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > Ireland (0.04)
(6 more...)

Genre: Research Report (0.64)

Industry: Government (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Gurov, Dilian, Jamroga, Filip, Jamroga, Wojciech, Kamiński, Mateusz, Kurpiewski, Damian, Penczek, Wojciech, Sidoruk, Teofil

Asynchronous Agents with Perfect Recall: Model Reductions, Knowledge-Based Construction, and Model Checking for Coalitional Strategies

arXiv.org Artificial IntelligenceDec-9-2024

Model checking of strategic abilities for agents with memory is a notoriously hard problem, and very few attempts have been made to tackle it. In this paper, we present two important steps towards this goal. First, we take the partial-order reduction scheme that was recently proved to preserve individual and coalitional abilities of memoryless agents, and show that it also works for agents with memory. Secondly, we take the Knowledge-Based Subset Construction, that was recently studied for synchronous concurrent games, and adapt it to preserve abilities of memoryful agents in asynchronous MAS. On the way, we also propose a new execution semantics for strategies in asynchronous MAS, that combines elements of Concurrent Game Structures and Interleaved Interpreted Systems in a natural and intuitive way.

artificial intelligence, logic & formal reasoning, logic programming, (19 more...)

2412.06706

Country: Europe > Poland (0.28)

Genre: Research Report (0.64)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Neural Information Processing SystemsOct-10-2024, 19:28:30 GMT

Learning in two-player zero-sum partially observable Markov games with perfect recall

learning, observable markov game, perfect recall, (4 more...)

Technology:

Information Technology > Game Theory (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.40)

Belardinelli, Francesco (Imperial College London, United Kingdom and Laboratoire IBISC, Université d'Evry, France) | Lomuscio, Alessio (Imperial College London United Kingdom) | Malvone, Vadim | Yu, Emily ( Johannes Kepler University Linz, Austria)

Approximating Perfect Recall when Model Checking Strategic Abilities: Theory and Applications

Journal of Artificial Intelligence ResearchMar-17-2022

The model checking problem for multi-agent systems against specifications in the alternating-time temporal logic ATL, hence ATL∗, under perfect recall and imperfect information is known to be undecidable. To tackle this problem, in this paper we investigate a notion of bounded recall under incomplete information. We present a novel three-valued semantics for ATL∗ in this setting and analyse the corresponding model checking problem. We show that the three-valued semantics here introduced is an approximation of the classic two-valued semantics, then give a sound, albeit partial, algorithm for model checking two-valued perfect recall via its approximation as three-valued bounded recall. Finally, we extend MCMAS, an open-source model checker for ATL and other agent specifications, to incorporate bounded recall; we illustrate its use and present experimental results.

bounded recall, imperfect information, perfect recall, (13 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1.12539

AI Access Foundation

12539

Journal of Artificial Intelligence Research

Country:

Europe > Austria > Vienna (0.14)
Europe > United Kingdom > England > Greater London > London (0.04)
Europe > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.04)
(19 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Li, Yanjun, Wang, Yanjing

Knowing How to Plan

arXiv.org Artificial IntelligenceJun-21-2021

Standard Epistemic Logic (EL) mainly studies reasoning patterns of knowing that ϕ, despite early contributions by Hintikka on formulating other know-wh expressions such as knowing who and why using first-order and higher-order modal logic. In recent years, there is a resurgence of interest on epistemic logics of know-wh powered by the new techniques for fragments of firstorder modal logic based on the so-called bundle modalities packing a quantifier and a normal epistemic modality together [26, 24, 21]. Within the varieties of logics of know-wh, the logics of know-how received the most attention in AI (cf.

artificial intelligence, logic, logic & formal reasoning, (20 more...)

doi: 10.4204/EPTCS.335.22

2106.11504

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > China > Tianjin Province > Tianjin (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)