AITopics | input example

mhealth_ood_neurips_2021.pdf

Neural Information Processing SystemsApr-24-2026, 21:49:17 GMT

artificial intelligence, machine learning, tnr, (17 more...)

Industry: Health & Medicine > Therapeutic Area (0.38)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

5d0d5594d24f0f955548f0fc0ff83d10-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 13:56:26 GMT

Onemightconsider"2V 7 V"and "V 84V"tobedifferent patterns orinvariants butatahigher levelofabstraction theycan both represent the concept of a repeated symbol irrespective of the position of the repeating item.

artificial intelligence, iteration, machine learning, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.96)

Add feedback

37f76c6fe3ab45e0cd7ecb176b5a046d-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 02:35:14 GMT

dnn, neural network, npf, (16 more...)

Neural Information Processing Systems

Country:

Asia > India (0.14)
North America > Canada (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

mhealth_ood_neurips_2021.pdf

Neural Information Processing SystemsFeb-7-2026, 16:14:53 GMT

interface, tnr, tpr95, (15 more...)

Neural Information Processing Systems

Industry: Health & Medicine > Therapeutic Area (0.38)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

Neural Path Features and Neural Path Kernel : Understanding the role of gates in deep learning

Neural Information Processing SystemsDec-23-2025, 22:53:13 GMT

Rectified linear unit (ReLU) activations can also be thought of as'gates', which, either pass or stop their pre-activation input when they are'on' (when the pre-activation input is positive) or'off' (when the pre-activation input is negative) respectively. A deep neural network (DNN) with ReLU activations has many gates, and the on/off status of each gate changes across input examples as well as network weights. For a given input example, only a subset of gates are'active', i.e., on, and the sub-network of weights connected to these active gates is responsible for producing the output. At randomised initialisation, the active sub-network corresponding to a given input example is random. During training, as the weights are learnt, the active sub-networks are also learnt, and could hold valuable information.

deep learning, feature and neural path kernel, neural path feature, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Add feedback

Neural Path Features and Neural Path Kernel: Understanding the role of gates in deep learning Chandrashekar Lakshminarayanan and Amit Vikram Singh

Neural Information Processing SystemsOct-2-2025, 16:31:55 GMT

A deep neural network (DNN) with ReLU activations has many gates, and the on/off status of each gate changes across input examples as well as network weights. For a given input example, only a subset of gates are active, i.e., on, and the sub-network of weights connected to these active gates is responsible for producing

artificial intelligence, machine learning, npf, (18 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.86)

Add feedback

Program Semantic Inequivalence Game with Large Language Models

Miceli-Barone, Antonio Valerio, Belle, Vaishak, Payani, Ali

arXiv.org Artificial IntelligenceAug-29-2025

Large Language Models (LLMs) can achieve strong performance on everyday coding tasks, but they can fail on complex tasks that require non-trivial reasoning about program semantics. Finding training examples to teach LLMs to solve these tasks can be challenging. In this work, we explore a method to synthetically generate code reasoning training data based on a semantic inequivalence game SInQ: a generator agent creates program variants that are semantically distinct, derived from a dataset of real-world programming tasks, while an evaluator agent has to identify input examples that cause the original programs and the generated variants to diverge in their behaviour, with the agents training each other semi-adversarially. We prove that this setup enables theoretically unlimited improvement through self-play in the limit of infinite computational resources. We evaluated our approach on multiple code generation and understanding benchmarks, including cross-language vulnerability detection (Lu et al., 2021), where our method improves vulnerability detection in C/C++ code despite being trained exclusively on Python code, and the challenging Python builtin identifier swap benchmark (Miceli-Barone et al., 2023), showing that whereas modern LLMs still struggle with this benchmark, our approach yields substantial improvements. We release the code needed to replicate the experiments, as well as the generated synthetic data, which can be used to fine-tune LLMs.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2505.03818

Genre: Research Report (1.00)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Neural Path Features and Neural Path Kernel : Understanding the role of gates in deep learning

Neural Information Processing SystemsOct-9-2024, 23:28:58 GMT

Rectified linear unit (ReLU) activations can also be thought of as'gates', which, either pass or stop their pre-activation input when they are'on' (when the pre-activation input is positive) or'off' (when the pre-activation input is negative) respectively. A deep neural network (DNN) with ReLU activations has many gates, and the on/off status of each gate changes across input examples as well as network weights. For a given input example, only a subset of gates are'active', i.e., on, and the sub-network of weights connected to these active gates is responsible for producing the output. At randomised initialisation, the active sub-network corresponding to a given input example is random. During training, as the weights are learnt, the active sub-networks are also learnt, and could hold valuable information. To this end, we encode the on/off state of the gates for a given input in a novel'neural path feature' (NPF), and the weights of the DNN are encoded in a novel'neural path value' (NPV).

deep learning, feature and neural path kernel, pre-activation input, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.75)

Add feedback

Do Llamas Work in English? On the Latent Language of Multilingual Transformers

Wendler, Chris, Veselovsky, Veniamin, Monea, Giovanni, West, Robert

arXiv.org Artificial IntelligenceJun-8-2024

We ask whether multilingual language models trained on unbalanced, English-dominated corpora use English as an internal pivot language -- a question of key importance for understanding how language models function and the origins of linguistic bias. Focusing on the Llama-2 family of transformer models, our study uses carefully constructed non-English prompts with a unique correct single-token continuation. From layer to layer, transformers gradually map an input embedding of the final prompt token to an output embedding from which next-token probabilities are computed. Tracking intermediate embeddings through their high-dimensional space reveals three distinct phases, whereby intermediate embeddings (1) start far away from output token embeddings; (2) already allow for decoding a semantically correct next token in the middle layers, but give higher probability to its version in English than in the input language; (3) finally move into an input-language-specific region of the embedding space. We cast these results into a conceptual model where the three phases operate in "input space", "concept space", and "output space", respectively. Crucially, our evidence suggests that the abstract "concept space" lies closer to English than to other languages, which may have important consequences regarding the biases held by multilingual language models.

layer layer layer, probability, translation, (13 more...)

arXiv.org Artificial Intelligence

2402.10588

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Finland (0.04)
(2 more...)

Genre: Research Report (0.81)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.91)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.91)

Add feedback

MaNtLE: Model-agnostic Natural Language Explainer

Menon, Rakesh R., Zaman, Kerem, Srivastava, Shashank

arXiv.org Artificial IntelligenceMay-22-2023

Understanding the internal reasoning behind the predictions of machine learning systems is increasingly vital, given their rising adoption and acceptance. While previous approaches, such as LIME, generate algorithmic explanations by attributing importance to input features for individual examples, recent research indicates that practitioners prefer examining language explanations that explain sub-groups of examples. In this paper, we introduce MaNtLE, a model-agnostic natural language explainer that analyzes multiple classifier predictions and generates faithful natural language explanations of classifier rationale for structured classification tasks. MaNtLE uses multi-task training on thousands of synthetic classification tasks to generate faithful explanations. Simulated user studies indicate that, on average, MaNtLE-generated explanations are at least 11% more faithful compared to LIME and Anchors explanations across three tasks. Human evaluations demonstrate that users can better predict model behavior using explanations from MaNtLE compared to other techniques

explanation, large language model, machine learning, (22 more...)

arXiv.org Artificial Intelligence

2305.12995

Country: