AITopics | coeff

Collaborating Authors

coeff

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

dab1263d1e6a88c9ba5e7e294def5e8b-Supplemental.pdf

Neural Information Processing SystemsFeb-10-2026, 17:14:46 GMT

matrix, probability, singular value, (17 more...)

Neural Information Processing Systems

Genre: Workflow (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.46)

Add feedback

CorrSteer: Generation-Time LLM Steering via Correlated Sparse Autoencoder Features

Cho, Seonglae, Wu, Zekun, Koshiyama, Adriano

arXiv.org Artificial IntelligenceOct-21-2025

Sparse Autoencoders (SAEs) can extract interpretable features from large language models (LLMs) without supervision. However, their effectiveness in downstream steering tasks is limited by the requirement for contrastive datasets or large activation storage. To address these limitations, we propose CorrSteer, which selects features by correlating sample correctness with SAE activations from generated tokens at inference time. This approach uses only inference-time activations to extract more relevant features, thereby reducing spurious correlations. It also obtains steering coefficients from average activations, automating the entire pipeline. Our method shows improved task performance on QA, bias mitigation, jailbreaking prevention, and reasoning benchmarks on Gemma-2 2B and LLaMA-3.1 8B, notably achieving a +3.3% improvement in MMLU performance with 4000 samples and a +27.2% improvement in HarmBench with only 108 samples. Selected features demonstrate semantically meaningful patterns aligned with each task's requirements, revealing the underlying capabilities that drive performance. Our work establishes correlation-based selection as an effective and scalable approach for automated SAE steering across language model applications.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2508.12535

Country:

Europe > Austria > Vienna (0.14)
South America > Colombia > Meta Department > Villavicencio (0.04)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
(8 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Law (1.00)
Education (0.93)
Leisure & Entertainment (0.67)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

Add feedback

Understanding Tool-Integrated Reasoning

Lin, Heng, Xu, Zhongwen

arXiv.org Machine LearningAug-27-2025

We study why Tool-Integrated Reasoning (TIR) makes Large Language Models (LLMs) more capable. While LLMs integrated with tools like Python code interpreters show great promise, a principled theory explaining why this paradigm is effective has been missing. This work provides the first formal proof that TIR fundamentally expands an LLM's capabilities. We demonstrate that tools enable a strict expansion of the model's empirical and feasible support, breaking the capability ceiling of pure-text models by unlocking problem-solving strategies that are otherwise impossible or intractably verbose. To guide model behavior without compromising training stability and performance, we also introduce Advantage Shaping Policy Optimization (ASPO), a novel algorithm that directly modifies the advantage function to guide the policy behavior. We conduct comprehensive experiments on challenging mathematical benchmarks, leveraging a Python interpreter as the external tool. Our results show that the TIR model decisively outperforms its pure-text counterpart on the pass@k metric. Crucially, this advantage is not confined to computationally-intensive problems but extends to those requiring significant abstract insight. We further identify the emergent cognitive patterns that illustrate how models learn to think with tools. Finally, we report improved tool usage behavior with early code invocation and much more interactive turns with ASPO. Overall, our work provides the first principled explanation for TIR's success, shifting the focus from the mere fact that tools work to why and how they enable more powerful reasoning.

large language model, machine learning, natural language, (21 more...)

arXiv.org Machine Learning

2508.19201

Genre: Research Report > New Finding (0.53)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

dab1263d1e6a88c9ba5e7e294def5e8b-Supplemental.pdf

Neural Information Processing SystemsAug-16-2025, 19:00:21 GMT

Supplementary Material for "T ensor Completion Made Practical" Run Jennrich's algorithm (see Section F.2.1) to decompose T Here we give an outline of the proof of Theorem 3.2. This is our main contribution. A robust analysis of Jennrich's algorithm implies that we can then estimate the rank one See Section F and Section G for details. C.1 Basic Facts We use the following notation: The following claim gives us a simple relation for this. C.4 Concentration Inequalities Claim C.8. Say we have real numbers γ x In particular we will prove Theorem B.1.

artificial intelligence, machine learning, singular value, (19 more...)

Neural Information Processing Systems

Genre: Workflow (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.46)

Add feedback

How Overconfidence in Initial Choices and Underconfidence Under Criticism Modulate Change of Mind in Large Language Models

Kumaran, Dharshan, Fleming, Stephen M, Markeeva, Larisa, Heyward, Joe, Banino, Andrea, Mathur, Mrinal, Pascanu, Razvan, Osindero, Simon, de Martino, Benedetto, Velickovic, Petar, Patraucean, Viorica

arXiv.org Artificial IntelligenceJul-8-2025

Large language models (LLMs) exhibit strikingly conflicting behaviors: they can appear steadfastly overconfident in their initial answers whilst at the same time being prone to excessive doubt when challenged. To investigate this apparent paradox, we developed a novel experimental paradigm, exploiting the unique ability to obtain confidence estimates from LLMs without creating memory of their initial judgments -- something impossible in human participants. We show that LLMs -- Gemma 3, GPT4o and o1-preview -- exhibit a pronounced choice-supportive bias that reinforces and boosts their estimate of confidence in their answer, resulting in a marked resistance to change their mind. We further demonstrate that LLMs markedly overweight inconsistent compared to consistent advice, in a fashion that deviates qualitatively from normative Bayesian updating. Finally, we demonstrate that these two mechanisms -- a drive to maintain consistency with prior commitments and hypersensitivity to contradictory feedback -- parsimoniously capture LLM behavior in a different domain. Together, these findings furnish a mechanistic account of LLM confidence that explains both their stubbornness and excessive sensitivity to criticism.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2507.0312

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Toward Equitable Access: Leveraging Crowdsourced Reviews to Investigate Public Perceptions of Health Resource Accessibility

Xue, Zhaoqian, Liu, Guanhong, Wei, Kai, Zhang, Chong, Zeng, Qingcheng, Hu, Songhua, Hua, Wenyue, Fan, Lizhou, Zhang, Yongfeng, Li, Lingyao

arXiv.org Artificial IntelligenceFeb-14-2025

Access to health resources is a critical determinant of public well-being and societal resilience, particularly during public health crises when demand for medical services and preventive care surges. However, disparities in accessibility persist across demographic and geographic groups, raising concerns about equity. Traditional survey methods often fall short due to limitations in coverage, cost, and timeliness. This study leverages crowdsourced data from Google Maps reviews, applying advanced natural language processing techniques, specifically ModernBERT, to extract insights on public perceptions of health resource accessibility in the United States during the COVID-19 pandemic. Additionally, we employ Partial Least Squares regression to examine the relationship between accessibility perceptions and key socioeconomic and demographic factors including political affiliation, racial composition, and educational attainment. Our findings reveal that public perceptions of health resource accessibility varied significantly across the U.S., with disparities peaking during the pandemic and slightly easing post-crisis. Political affiliation, racial demographics, and education levels emerged as key factors shaping these perceptions. These findings underscore the need for targeted interventions and policy measures to address inequities, fostering a more inclusive healthcare infrastructure that can better withstand future public health challenges.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2502.10641

Country:

North America > United States > Pennsylvania (0.04)
North America > United States > Michigan (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(6 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Public Health (1.00)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Communications > Social Media > Crowdsourcing (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.47)

Add feedback

Reinforcement learning-based statistical search strategy for an axion model from flavor

Nishimura, Satsuki, Miyao, Coh, Otsuka, Hajime

arXiv.org Artificial IntelligenceSep-16-2024

We propose a reinforcement learning-based search strategy to explore new physics beyond the Standard Model. The reinforcement learning, which is one of machine learning methods, is a powerful approach to find model parameters with phenomenological constraints. As a concrete example, we focus on a minimal axion model with a global $U(1)$ flavor symmetry. Agents of the learning succeed in finding $U(1)$ charge assignments of quarks and leptons solving the flavor and cosmological puzzles in the Standard Model, and find more than 150 realistic solutions for the quark sector taking renormalization effects into account. For the solutions found by the reinforcement learning-based analysis, we discuss the sensitivity of future experiments for the detection of an axion which is a Nambu-Goldstone boson of the spontaneously broken $U(1)$. We also examine how fast the reinforcement learning-based searching method finds the best discrete parameters in comparison with conventional optimization methods. In conclusion, the efficient parameter search based on the reinforcement learning-based strategy enables us to perform a statistical analysis of the vast parameter space associated with the axion model from flavor.

gev 0, intrinsic value, terminal state, (16 more...)

arXiv.org Artificial Intelligence

2409.10023

Country:

North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > United States > New York > New York County > New York City (0.04)
Asia > Japan > Kyūshū & Okinawa > Kyūshū > Fukuoka Prefecture > Fukuoka (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Explaining Veracity Predictions with Evidence Summarization: A Multi-Task Model Approach

Cekinel, Recep Firat, Karagoz, Pinar

arXiv.org Artificial IntelligenceFeb-9-2024

The rapid dissemination of misinformation through social media increased the importance of automated fact-checking. Furthermore, studies on what deep neural models pay attention to when making predictions have increased in recent years. While significant progress has been made in this field, it has not yet reached a level of reasoning comparable to human reasoning. To address these gaps, we propose a multi-task explainable neural model for misinformation detection. Specifically, this work formulates an explanation generation process of the model's veracity prediction as a text summarization problem. Additionally, the performance of the proposed model is discussed on publicly available datasets and the findings are evaluated with related studies.

coeff, kotonya and toni, proceedings, (14 more...)

arXiv.org Artificial Intelligence

2402.06443

Country:

Europe > United Kingdom (0.14)
North America > United States > Montana (0.04)
North America > United States > California (0.04)
(2 more...)

Genre: Research Report > New Finding (0.93)

Industry:

Media > News (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Government > Regional Government > North America Government > United States Government (0.93)
Health & Medicine > Therapeutic Area > Neurology > Multiple Sclerosis (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Communications > Social Media (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Collaborating Authors

coeff

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

A Notation N: the set of natural numbers R d: d-dimensional Euclidean space R

dab1263d1e6a88c9ba5e7e294def5e8b-Supplemental.pdf

CorrSteer: Generation-Time LLM Steering via Correlated Sparse Autoencoder Features

76c6f9f2475b275b92d03a83ea270af4-Supplemental-Conference.pdf

Understanding Tool-Integrated Reasoning

dab1263d1e6a88c9ba5e7e294def5e8b-Supplemental.pdf

How Overconfidence in Initial Choices and Underconfidence Under Criticism Modulate Change of Mind in Large Language Models

Toward Equitable Access: Leveraging Crowdsourced Reviews to Investigate Public Perceptions of Health Resource Accessibility

Reinforcement learning-based statistical search strategy for an axion model from flavor

Explaining Veracity Predictions with Evidence Summarization: A Multi-Task Model Approach