AITopics | taxnodes:Technology: Instructional Materials

Online Adaptation of Language Models with a Memory of Amortized Contexts Jihoon Tack, Eric Mitchell

Neural Information Processing SystemsMar-27-2025, 13:36:57 GMT

Due to the rapid generation and dissemination of information, large language models (LLMs) quickly run out of date despite enormous development costs. To address the crucial need to keep models updated, online learning has emerged as a critical tool when utilizing LLMs for real-world applications. However, given the ever-expanding corpus of unseen documents and the large parameter space of modern LLMs, efficient adaptation is essential. To address these challenges, we propose Memory of Amortized Contexts (MAC), an efficient and effective online adaptation framework for LLMs with strong knowledge retention. We propose a feature extraction and memory-augmentation approach to compress and extract information from new documents into compact modulations stored in a memory bank.

large language model, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country: Europe (0.14)

Genre:

Research Report > Experimental Study (1.00)
Instructional Material (0.67)

Industry: Education > Educational Setting > Online (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

combined

James Vuckovic

Neural Information Processing SystemsMar-27-2025, 13:32:47 GMT

artificial intelligence, bayesian inference, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Genre:

Research Report > New Finding (0.46)
Instructional Material > Course Syllabus & Notes (0.46)

Industry: Energy (0.46)

Add feedback

Towards Multi-dimensional Explanation Alignment for Medical Classification

Neural Information Processing SystemsMar-27-2025, 13:28:09 GMT

The lack of interpretability in the field of medical image analysis has significant ethical and legal implications. Existing interpretable methods in this domain encounter several challenges, including dependency on specific models, difficulties in understanding and visualization, as well as issues related to efficiency. To address these limitations, we propose a novel framework called Med-MICN (Medical Multidimensional Interpretable Concept Network). Med-MICN provides interpretability alignment for various angles, including neural symbolic reasoning, concept semantics, and saliency maps, which are superior to current interpretable methods. Its advantages include high prediction accuracy, interpretability across multiple dimensions, and automation through an end-to-end concept labeling process that reduces the need for extensive human training effort when working with new datasets. To demonstrate the effectiveness and interpretability of Med-MICN, we apply it to four benchmark datasets and compare it with baselines. The results clearly demonstrate the superior performance and interpretability of our Med-MICN.

data mining, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country: Asia > China (0.14)

Genre:

Research Report > Experimental Study (0.93)
Instructional Material (0.87)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
(4 more...)

Add feedback

Towards General Loop Invariant Generation: A Benchmark of Programs with Memory Manipulation Chang Liu

Neural Information Processing SystemsMar-27-2025, 13:21:27 GMT

Program verification is vital for ensuring software reliability, especially in the context of increasingly complex systems. Loop invariants, remaining true before and after each iteration of loops, are crucial for this verification process. Traditional provers and machine learning based methods for generating loop invariants often require expert intervention or extensive labeled data, and typically only handle numerical property verification.

large language model, loop invariant, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States (0.68)
Europe (0.67)

Genre:

Instructional Material > Course Syllabus & Notes (0.67)
Research Report > New Finding (0.46)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Software (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback

A Evaluation Tasks Sample one iteration of FEEDBACK - REFINE Sentiment Reversal

Neural Information Processing SystemsMar-27-2025, 12:39:18 GMT

Add an additional paragraph after the existing text with the following text: "We also offer a variety

large language model, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Genre:

Personal (0.46)
Instructional Material (0.46)

Industry: Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.67)

Add feedback

Identifying Latent State-Transition Processes for Individualized Reinforcement Learning

Neural Information Processing SystemsMar-27-2025, 12:13:54 GMT

The application of reinforcement learning (RL) involving interactions with individuals has grown significantly in recent years. These interactions, influenced by factors such as personal preferences and physiological differences, causally influence state transitions, ranging from health conditions in healthcare to learning progress in education. As a result, different individuals may exhibit different state-transition processes. Understanding individualized state-transition processes is essential for optimizing individualized policies. In practice, however, identifying these state-transition processes is challenging, as individual-specific factors often remain latent. In this paper, we establish the identifiability of these latent factors and introduce a practical method that effectively learns these processes from observed state-action trajectories. Experiments on various datasets show that the proposed method can effectively identify latent state-transition processes and facilitate the learning of individualized RL policies.

data mining, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.14)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Instructional Material (0.67)

Industry:

Health & Medicine > Consumer Health (1.00)
Education (1.00)
Information Technology > Security & Privacy (0.92)
Health & Medicine > Therapeutic Area > Endocrinology (0.67)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(3 more...)

Add feedback

Prospective Learning: Learning for a Dynamic Future Ashwin De Silva,1 Rubing Yang,2

Neural Information Processing SystemsMar-27-2025, 11:48:58 GMT

In real-world applications, the distribution of the data, and our goals, evolve over time. The prevailing theoretical framework for studying machine learning, namely probably approximately correct (PAC) learning, largely ignores time. As a consequence, existing strategies to address the dynamic nature of data and goals exhibit poor real-world performance. This paper develops a theoretical framework called "Prospective Learning" that is tailored for situations when the optimal hypothesis changes over time. In PAC learning, empirical risk minimization (ERM) is known to be consistent.

large language model, learner, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (0.67)
Europe > United Kingdom > England (0.45)

Genre:

Research Report (1.00)
Instructional Material (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Add feedback

8e69a97cbdd91ac0808603fa589d6c17-Supplemental-Conference.pdf

Neural Information Processing SystemsMar-27-2025, 11:43:41 GMT

artificial intelligence, evolutionary algorithm, machine learning, (18 more...)

Neural Information Processing Systems

Genre: Instructional Material (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)

Add feedback

Variance-Reduced Gradient Estimation via Noise-Reuse in Online Evolution Strategies

Neural Information Processing SystemsMar-27-2025, 11:43:37 GMT

Unrolled computation graphs are prevalent throughout machine learning but present challenges to automatic differentiation (AD) gradient estimation methods when their loss functions exhibit extreme local sensitivtiy, discontinuity, or blackbox characteristics. In such scenarios, online evolution strategies methods are a more capable alternative, while being more parallelizable than vanilla evolution strategies (ES) by interleaving partial unrolls and gradient updates. In this work, we propose a general class of unbiased online evolution strategies methods. We analytically and empirically characterize the variance of this class of gradient estimators and identify the one with the least variance, which we term Noise-Reuse Evolution Strategies (NRES).

evolutionary algorithm, machine learning, variance, (17 more...)

Neural Information Processing Systems

Genre: Instructional Material (0.46)

Technology: