AITopics

995f693b73050f90977ed2828202645c-Supplemental-Conference.pdf

Neural Information Processing SystemsMar-27-2025, 14:28:35 GMT

As described in Section 3.2, we implement categorical attention by associating each attention head In this example, an attention head (left) calculates the histogram for each position. An MLP (top right) reads the histogram values and outputs a value of 0 if the histogram value is greater than one, and 4 otherwise. Inspecting the corresponding classifier weights (bottom right), we see that an output value of 0--meaning a histogram count greater than 1--increases the likelihood that the double-histogram value is 1 or 2, and decreases the likelihood of larger values. Because the input length is limited to 8, this reflects the fact that if one number appears many times, it is unlikely that another number appears the same number of times. An output of 4 (meaning a histogram count of 1) increases the likelihood that the double-histogram is greater than 1.

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Learning Transformer Programs

Neural Information Processing SystemsMar-27-2025, 14:28:35 GMT

Recent research in mechanistic interpretability has attempted to reverse-engineer Transformer models by carefully inspecting network weights and activations. However, these approaches require considerable manual effort and still fall short of providing complete, faithful descriptions of the underlying algorithms. In this work, we introduce a procedure for training Transformers that are mechanistically interpretable by design. We build on RASP [Weiss et al., 2021], a programming language that can be compiled into Transformer weights. Instead of compiling human-written programs into Transformers, we design a modified Transformer that can be trained using gradient-based optimization and then automatically converted into a discrete, human-readable program. We refer to these models as Transformer Programs. To validate our approach, we learn Transformer Programs for a variety of problems, including an in-context learning task, a suite of algorithmic problems (e.g.

machine learning, natural language, programming language, (20 more...)

Neural Information Processing Systems

Country: Europe (0.67)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

bbc9d480a8257889d2af88983e8b126a-Supplemental-Conference.pdf

Neural Information Processing SystemsMar-27-2025, 14:28:15 GMT

artificial intelligence, machine learning, propagator, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)

Add feedback

bbc9d480a8257889d2af88983e8b126a-Paper-Conference.pdf

Neural Information Processing SystemsMar-27-2025, 14:28:11 GMT

artificial intelligence, deep learning, machine learning, (18 more...)

Neural Information Processing Systems

Country: Asia > Japan (0.28)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Reinforcement Learning under Latent Dynamics: Toward Statistical and Algorithmic Modularity

Neural Information Processing SystemsMar-27-2025, 14:28:04 GMT

Real-world applications of reinforcement learning often involve environments where agents operate on complex, high-dimensional observations, but the underlying ("latent") dynamics are comparatively simple.

lat, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England (0.14)
North America > United States (0.14)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.92)

Add feedback

Entropy testing and its application to testing Bayesian networks

Neural Information Processing SystemsMar-27-2025, 14:27:57 GMT

This paper studies the problem of entropy identity testing: given sample access to a distribution p and a fully described distribution q (both discrete distributions over a domain of size k), and the promise that either p = q or |H(p) H(q)| ε, where H() denotes the Shannon entropy, a tester needs to distinguish between the two cases with high probability.

artificial intelligence, bayesian inference, machine learning, (19 more...)

Neural Information Processing Systems

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Jungo Kasai

Neural Information Processing SystemsMar-27-2025, 14:27:50 GMT

Q: How many home runs has Shohei Ohtani hit? Why was the dataset created? Q: How many home runs has Shohei Ohtani hit? QA was created to provide a to benchmark question answering at the dynamic platform that asks questions about the present time: answers (e.g., the number of current world, challenging QA systems to provide Shohei Ohtani's home runs) change in real time. QA may identify areas of potential research, such as improving how QA systems deal with unanswerable What are the instances?

artificial intelligence, natural language, question answering, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.16)

Industry: Leisure & Entertainment > Sports > Baseball (0.76)

Technology: Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.51)

Add feedback

9941624ef7f867a502732b5154d30cb7-Paper-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsMar-27-2025, 14:27:48 GMT

machine learning, proc, question answering, (22 more...)

Neural Information Processing Systems

Country:

North America > United States (1.00)
Europe (0.68)

Genre: Research Report (0.93)

Industry:

Leisure & Entertainment (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.96)
(3 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.49)
(2 more...)

Add feedback

991c9324ca71aa85ab4dd11146b35fc3-Paper-Conference.pdf

Neural Information Processing SystemsMar-27-2025, 14:27:37 GMT

artificial intelligence, machine learning, optimization problem, (19 more...)

Neural Information Processing Systems

Country: North America > United States (0.29)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Right this way: Can VLMs Guide Us to See More to Answer Questions?

Neural Information Processing SystemsMar-27-2025, 14:27:34 GMT

In question-answering scenarios, humans can assess whether the available information is sufficient and seek additional information if necessary, rather than providing a forced answer. In contrast, Vision Language Models (VLMs) typically generate direct, one-shot responses without evaluating the sufficiency of the information. To investigate this gap, we identify a critical and challenging task in the Visual Question Answering (VQA) scenario: can VLMs indicate how to adjust an image when the visual information is insufficient to answer a question? This capability is especially valuable for assisting visually impaired individuals who often need guidance to capture images correctly. To evaluate this capability of current VLMs, we introduce a human-labeled dataset as a benchmark for this task. Additionally, we present an automated framework that generates synthetic training data by simulating "where to know" scenarios. Our empirical results show significant performance improvements in mainstream VLMs when fine-tuned with this synthetic data. This study demonstrates the potential to narrow the gap between information assessment and acquisition in VLMs, bringing their performance closer to humans.

information, large language model, machine learning, (18 more...)

Neural Information Processing Systems

Country: