AITopics | pay attention

Collaborating Authors

pay attention

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

SM3-Text-to-Query: Synthetic M ulti-M odel Medical Text-to-Query Benchmark

Neural Information Processing SystemsFeb-17-2026, 01:58:48 GMT

Text-to-Query systems have surprisingly not been investigated so far.

information retrieval, large language model, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States (0.93)
Europe > Switzerland > Zürich > Zürich (0.04)
Europe > Czechia > Prague (0.04)
(5 more...)

Genre: Research Report (0.67)

Industry:

Health & Medicine > Health Care Providers & Services (0.93)
Government > Regional Government > North America Government > United States Government (0.67)
Information Technology > Security & Privacy (0.67)
Health & Medicine > Health Care Technology > Medical Record (0.46)

Technology:

Information Technology > Databases (1.00)
Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
(4 more...)

Add feedback

bd31bfd4caa85bffe07a35568182cdfa-Paper-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 16:11:28 GMT

agent, coordination pattern, factorization, (14 more...)

Neural Information Processing Systems

Country:

North America > Canada > Alberta (0.14)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Asia > China > Tianjin Province > Tianjin (0.04)
(2 more...)

Genre: Research Report > New Finding (0.88)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Pay attention to your loss : understanding misconceptions about Lipschitz neural networks

Neural Information Processing SystemsDec-24-2025, 14:31:06 GMT

Lipschitz constrained networks have gathered considerable attention in the deep learning community, with usages ranging from Wasserstein distance estimation to the training of certifiably robust classifiers. However they remain commonly considered as less accurate, and their properties in learning are still not fully understood. In this paper we clarify the matter: when it comes to classification 1-Lipschitz neural networks enjoy several advantages over their unconstrained counterpart. First, we show that these networks are as accurate as classical ones, and can fit arbitrarily difficult boundaries. Then, relying on a robustness metric that reflects operational needs we characterize the most robust classifier: the WGAN discriminator. Next, we show that 1-Lipschitz neural networks generalize well under milder assumptions. Finally, we show that hyper-parameters of the loss are crucial for controlling the accuracy-robustness trade-off. We conclude that they exhibit appealing properties to pave the way toward provably accurate, and provably robust neural networks.

misconception, name change, pay attention, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Pay Attention to MLPs

Neural Information Processing SystemsDec-24-2025, 02:24:00 GMT

Transformers have become one of the most important architectural innovations in deep learning and have enabled many breakthroughs over the past few years. Here we propose a simple network architecture, gMLP, based solely on MLPs with gating, and show that it can perform as well as Transformers in key language and vision applications. Our comparisons show that self-attention is not critical for Vision Transformers, as gMLP can achieve the same accuracy. For BERT, our model achieves parity with Transformers on pretraining perplexity and is better on some downstream NLP tasks. On finetuning tasks where gMLP performs worse, making the gMLP model substantially larger can close the gap with Transformers. In general, our experiments show that gMLP can scale as well as Transformers over increased data and compute.

name change, pay attention, transformer, (3 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Vision (0.62)
Information Technology > Artificial Intelligence > Machine Learning (0.42)

Add feedback

Where to Pay Attention in Sparse Training for Feature Selection?

Neural Information Processing SystemsDec-23-2025, 18:11:17 GMT

A new line of research for feature selection based on neural networks has recently emerged. Despite its superiority to classical methods, it requires many training iterations to converge and detect the informative features. For datasets with a large number of samples or a very high dimensional feature space, the computational time becomes prohibitively long. In this paper, we present a new efficient unsupervised method for feature selection based on sparse autoencoders. In particular, we propose a new sparse training algorithm that optimizes a model's sparse topology during training to quickly pay attention to informative features.

feature selection, informative feature, sparse training, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.60)

Add feedback

a182a8e6ebc91728b6e6b6382c9f7b1e-Paper-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsOct-10-2025, 11:48:27 GMT

dae-young kim ontology synthea, query, query language, (13 more...)

Neural Information Processing Systems

Country:

North America > United States (0.93)
Europe > Switzerland > Zürich > Zürich (0.04)
Europe > Czechia > Prague (0.04)
(5 more...)

Genre: Research Report (0.67)

Industry:

Health & Medicine > Health Care Providers & Services (0.93)
Government > Regional Government > North America Government > United States Government (0.67)
Information Technology > Security & Privacy (0.67)
Health & Medicine > Health Care Technology > Medical Record (0.46)

Technology:

Information Technology > Databases (1.00)
Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
(4 more...)

Add feedback

CoT-RAG: Integrating Chain of Thought and Retrieval-Augmented Generation to Enhance Reasoning in Large Language Models

Li, Feiyang, Fang, Peng, Shi, Zhan, Khan, Arijit, Wang, Fang, Wang, Weihao, Zhang, Xin, Cui, Yongjian

arXiv.org Artificial IntelligenceSep-11-2025

Chain-of-thought (CoT) reasoning boosts large language models' (LLMs) performance on complex tasks but faces two key limitations: a lack of reliability when solely relying on LLM-generated reasoning chains and lower reasoning performance from natural language prompts compared with code prompts. To address these issues, we propose CoT-RAG, a novel reasoning framework with three key designs: (i) Knowledge Graph-driven CoT Generation, featuring knowledge graphs to modulate reasoning chain generation of LLMs, thereby enhancing reasoning credibility; (ii) Learnable Knowledge Case-aware RAG, which incorporates retrieval-augmented generation (RAG) into knowledge graphs to retrieve relevant sub-cases and sub-descriptions, providing LLMs with learnable information; (iii) Pseudo Program Prompting Execution, which promotes greater logical rigor by guiding LLMs to execute reasoning tasks as pseudo-programs. Evaluations on nine public datasets spanning three reasoning tasks reveal significant accuracy gains-ranging from 4.0% to 44.3%-over state-of-the-art methods. Furthermore, tests on four domain-specific datasets demonstrate exceptional accuracy and efficient execution, underscoring its practical applicability and scalability. Our code and data are available at https: //github.com/hustlfy123/CoT-RAG.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2504.13534

Country:

Europe (1.00)
Asia > China (0.93)
Asia > Middle East > UAE (0.28)
North America > United States > Minnesota (0.27)

Genre: Research Report > New Finding (0.67)

Industry:

Banking & Finance > Trading (1.00)
Law (0.93)
Transportation (0.68)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Logic of General Attention Using Edge-Conditioned Event Models (Extended Version)

Belardinelli, Gaia, Bolander, Thomas, Watzl, Sebastian

arXiv.org Artificial IntelligenceMay-21-2025

In this work, we present the first general logic of attention. Attention is a powerful cognitive ability that allows agents to focus on potentially complex information, such as logically structured propositions, higher-order beliefs, or what other agents pay attention to. This ability is a strength, as it helps to ignore what is irrelevant, but it can also introduce biases when some types of information or agents are systematically ignored. Existing dynamic epistemic logics for attention cannot model such complex attention scenarios, as they only model attention to atomic formulas. Additionally, such logics quickly become cumbersome, as their size grows exponentially in the number of agents and announced literals. Here, we introduce a logic that overcomes both limitations. First, we generalize edge-conditioned event models, which we show to be as expressive as standard event models yet exponentially more succinct (generalizing both standard event models and generalized arrow updates). Second, we extend attention to arbitrary formulas, allowing agents to also attend to other agents' beliefs or attention. Our work treats attention as a modality, like belief or awareness. We introduce attention principles that impose closure properties on that modality and that can be used in its axiomatization. Throughout, we illustrate our framework with examples of AI agents reasoning about human attentional biases, demonstrating how such agents can discover attentional biases.

artificial intelligence, edge-conditioned event model, event model, (16 more...)

arXiv.org Artificial Intelligence

2505.14539

Country: Europe > United Kingdom > England (0.27)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.48)

Add feedback

Storybooth: Training-free Multi-Subject Consistency for Improved Visual Storytelling

Singh, Jaskirat, Chen, Junshen Kevin, Kohler, Jonas, Cohen, Michael

arXiv.org Artificial IntelligenceApr-9-2025

Training-free consistent text-to-image generation depicting the same subjects across different images is a topic of widespread recent interest. Existing works in this direction predominantly rely on cross-frame self-attention; which improves subject-consistency by allowing tokens in each frame to pay attention to tokens in other frames during self-attention computation. While useful for single subjects, we find that it struggles when scaling to multiple characters. In this work, we first analyze the reason for these limitations. Our exploration reveals that the primary-issue stems from self-attention-leakage, which is exacerbated when trying to ensure consistency across multiple-characters. This happens when tokens from one subject pay attention to other characters, causing them to appear like each other (e.g., a dog appearing like a duck). Motivated by these findings, we propose StoryBooth: a training-free approach for improving multi-character consistency. In particular, we first leverage multi-modal chain-of-thought reasoning and region-based generation to apriori localize the different subjects across the desired story outputs. The final outputs are then generated using a modified diffusion model which consists of two novel layers: 1) a bounded cross-frame self-attention layer for reducing inter-character attention leakage, and 2) token-merging layer for improving consistency of fine-grain subject details. Through both qualitative and quantitative results we find that the proposed approach surpasses prior state-of-the-art, exhibiting improved consistency across both multiple-characters and fine-grain subject details.

consistency, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2504.058

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.94)

Add feedback

Can YOU tell what this dog is thinking? Take the test - as study reveals humans are terrible at reading canine emotions

Daily Mail - Science & techMar-11-2025, 13:57:44 GMT

If you have a dog, you might think you have a strong connection with them. But according to a new study, you've probably been reading your pet's emotions all wrong. Although humans and dogs have a unique bond, scientists from Arizona State University say that we are terrible at understanding canine emotions. Participants were shown videos of a dog reacting to positive situations, such as seeing their lead, or negative situations such as being presented with the dreaded vacuum cleaner. Instead of actually trying to understand what the dog is feeling, the researchers found that people tend to'project human emotions onto their pets'.

artificial intelligence, emotion, vacuum cleaner, (14 more...)

Daily Mail - Science & tech

Country: North America > United States > Arizona (0.26)

Genre: Research Report > New Finding (0.92)

Industry: Health & Medicine (0.32)

Technology: Information Technology > Artificial Intelligence > Cognitive Science > Emotion (0.36)

Add feedback