AITopics | conc

We consider offline reinforcement learning (RL) in $H$-horizon Markov decision processes (MDPs) under the linear $q^\pi$-realizability assumption, where the action-value function of every policy is linear with respect to a given $d$-dimensional feature function. The hope in this setting is that learning a good policy will be possible without requiring a sample size that scales with the number of states in the MDP. Foster et al. [2021] have shown this to be impossible even under $\text{\textit{concentrability}}$, a data coverage assumption where a coefficient $C_\text{conc}$ bounds the extent to which the state-action distribution of any policy can veer off the data distribution. However, the data in this previous work was in the form of a sequence of individual transitions. This leaves open the question of whether the negative result mentioned could be overcome if the data was composed of sequences of full trajectories.

artificial intelligence, machine learning, proceedings, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.58)

Add feedback

Comparative Expressivity for Structured Argumentation Frameworks with Uncertain Rules and Premises

Proietti, Carlo, Yuste-Ginel, Antonio

arXiv.org Artificial IntelligenceOct-22-2025

Modelling qualitative uncertainty in formal argumentation is essential both for practical applications and theoretical understanding. Yet, most of the existing works focus on \textit{abstract} models for arguing with uncertainty. Following a recent trend in the literature, we tackle the open question of studying plausible instantiations of these abstract models. To do so, we ground the uncertainty of arguments in their components, structured within rules and premises. Our main technical contributions are: i) the introduction of a notion of expressivity that can handle abstract and structured formalisms, and ii) the presentation of both negative and positive expressivity results, comparing the expressivity of abstract and structured models of argumentation with uncertainty. These results affect incomplete abstract argumentation frameworks, and their extension with dependencies, on the abstract side, and ASPIC+, on the structured side.

argument, artificial intelligence, natural language, (17 more...)

arXiv.org Artificial Intelligence

2510.18631

Country:

Europe > Spain (0.04)
Europe > Italy (0.04)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (1.00)

Add feedback

0bd7c1a579459520e4d731a14b7bda7d-Paper-Conference.pdf

Neural Information Processing SystemsOct-11-2025, 00:51:01 GMT

estimator, kernel, probability, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > Experimental Study (0.92)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

Grounding Rule-Based Argumentation Using Datalog

Diller, Martin, Gaggl, Sarah Alice, Hanisch, Philipp, Monterosso, Giuseppina, Rauschenbach, Fritz

arXiv.org Artificial IntelligenceAug-18-2025

ASPIC+ is one of the main general frameworks for rule-based argumentation for AI. Although first-order rules are commonly used in ASPIC+ examples, most existing approaches to reason over rule-based argumentation only support propositional rules. To enable reasoning over first-order instances, a preliminary grounding step is required. As groundings can lead to an exponential increase in the size of the input theories, intelligent procedures are needed. However, there is a lack of dedicated solutions for ASPIC+. Therefore, we propose an intelligent grounding procedure that keeps the size of the grounding manageable while preserving the correctness of the reasoning process. To this end, we translate the first-order ASPIC+ instance into a Datalog program and query a Datalog engine to obtain ground substitutions to perform the grounding of rules and contraries. Additionally, we propose simplifications specific to the ASPIC+ formalism to avoid grounding of rules that have no influence on the reasoning process. Finally, we performed an empirical evaluation of a prototypical implementation to show scalability.

artificial intelligence, logic & formal reasoning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2508.10976

Country:

Europe > Germany (0.28)
North America > United States (0.28)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (0.91)

Add feedback

Learning the Linear Quadratic Regulator from Nonlinear Observations

Neural Information Processing SystemsAug-15-2025, 15:32:39 GMT

The learner's goal is to P AC-learn an

assumption, decoder, probability, (17 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)
North America > Canada (0.04)

Genre:

Workflow (0.46)
Overview (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Data Science (0.92)

Add feedback

Cross-Border Legal Adaptation of Autonomous Vehicle Design based on Logic and Non-monotonic Reasoning

Yu, Zhe, Lu, Yiwei, Schafer, Burkhard, Lin, Zhe

arXiv.org Artificial IntelligenceJul-31-2025

This paper focuses on the legal compliance challenges of autonomous vehicles in a transnational context. We choose the perspective of designers and try to provide supporting legal reasoning in the design process. Based on argumentation theory, we introduce a logic to represent the basic properties of argument-based practical (normative) reasoning, combined with partial order sets of natural numbers to express priority. Finally, through case analysis of legal texts, we show how the reasoning system we provide can help designers to adapt their design solutions more flexibly in the cross-border application of autonomous vehicles and to more easily understand the legal implications of their decisions.

artificial intelligence, definition 3, natural language, (13 more...)

arXiv.org Artificial Intelligence

2507.22432

Country:

North America > United States > Illinois > Cook County > Chicago (0.05)
Europe > Ireland (0.04)
Asia > China > Fujian Province > Xiamen (0.04)
(5 more...)

Genre: Research Report (0.40)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Transportation > Ground > Road (0.47)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

Adversarial Surrogate Risk Bounds for Binary Classification

Frank, Natalie S.

arXiv.org Machine LearningJun-12-2025

A central concern in classification is the vulnerability of machine learning models to adversarial attacks. Adversarial training is one of the most popular techniques for training robust classifiers, which involves minimizing an adversarial surrogate risk. Recent work characterized when a minimizing sequence of an adversarial surrogate risk is also a minimizing sequence of the adversarial classification risk for binary classification-- a property known as adversarial consistency . However, these results do not address the rate at which the adversarial classification risk converges to its optimal value for such a sequence of functions that minimize the adversarial surrogate. This paper provides surrogate risk bounds that quantify that convergence rate. Additionally, we derive distribution-dependent surrogate risk bounds in the standard (non-adversarial) learning setting, that may be of independent interest.

artificial intelligence, machine learning, surrogate risk, (15 more...)

arXiv.org Machine Learning

2506.09348

Country:

North America > United States > New York > New York County > New York City (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.50)

Industry: Government (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

CASE -- Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement

Zhang, Gaifan, Zhou, Yi, Bollegala, Danushka

arXiv.org Artificial IntelligenceMar-21-2025

The meaning conveyed by a sentence often depends on the context in which it appears. Despite the progress of sentence embedding methods, it remains unclear how to best modify a sentence embedding conditioned on its context. To address this problem, we propose Condition-Aware Sentence Embeddings (CASE), an efficient and accurate method to create an embedding for a sentence under a given condition. First, CASE creates an embedding for the condition using a Large Language Model (LLM), where the sentence influences the attention scores computed for the tokens in the condition during pooling. Next, a supervised nonlinear projection is learned to reduce the dimensionality of the LLM-based text embeddings. We show that CASE significantly outperforms previously proposed Conditional Semantic Textual Similarity (C-STS) methods on an existing standard benchmark dataset. We find that subtracting the condition embedding consistently improves the C-STS performance of LLM-based text embeddings. Moreover, we propose a supervised dimensionality reduction method that not only reduces the dimensionality of LLM-based embeddings but also significantly improves their performance.

artificial intelligence, large language model, natural language, (17 more...)

arXiv.org Artificial Intelligence

2503.17279

Country:

North America > United States (0.05)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Genre: Research Report (0.82)

Industry: Transportation (0.54)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Collaborating Authors

conc

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

a70145bf8b173e4496b554ce57969e24-Supplemental.pdf

0bd7c1a579459520e4d731a14b7bda7d-Paper-Conference.pdf

Trajectory Data Suffices for Statistically Efficient Learning in Offline RL with Linear q \pi -Realizability and Concentrability

Comparative Expressivity for Structured Argumentation Frameworks with Uncertain Rules and Premises

0bd7c1a579459520e4d731a14b7bda7d-Paper-Conference.pdf

Grounding Rule-Based Argumentation Using Datalog

Learning the Linear Quadratic Regulator from Nonlinear Observations

Cross-Border Legal Adaptation of Autonomous Vehicle Design based on Logic and Non-monotonic Reasoning

Adversarial Surrogate Risk Bounds for Binary Classification

CASE -- Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement