AITopics | current task

Gradient-Guided Epsilon Constraint Method for Online Continual Learning

Neural Information Processing SystemsJun-22-2026, 02:12:27 GMT

Online Continual Learning (OCL) requires models to learn sequentially from data streams with limited memory. Rehearsal-based methods, particularly Experience Replay (ER), are commonly used in OCL scenarios. This paper revisits ER through the lens of ϵ-constraint optimization, revealing that ER implicitly employs a soft constraint on past task performance, with its weighting parameter post-hoc defining a slack variable. While effective, ER's implicit and fixed slack strategy has limitations: it can inadvertently lead to updates that negatively impact generalization, and its fixed trade-off between plasticity and stability may not optimally balance current streaming with memory retention, potentially overfitting to the memory buffer. To address these shortcomings, we propose the Gradient-Guided Epsilon Constraint (GEC) method for online continual learning. GEC explicitly formulates the OCL update as an ϵ-constraint optimization problem, which minimize the loss on the current task data and transform the stability objective as constraints and propose a gradient-guided method to dynamically adjusts the update direction based on whether the performance on memory samples violates a predefined slack tolerance ε: if forgetting exceeds this tolerance, GEC prioritizes constraint satisfaction; otherwise, it focuses on the current task while controlling the rate of increase in memory loss. Empirical evaluations on standard OCL benchmarks demonstrate GEC's ability to achieve a superior trade-off, leading to improved overall performance.

artificial intelligence, constraint-based reasoning, optimization problem, (15 more...)

Neural Information Processing Systems

Country: Asia > China (0.28)

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine (1.00)
Education > Educational Setting > Online (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (1.00)

Add feedback

RECAP: Recursive Context-Aware Reasoning and Planning for Large Language Model Agents

Neural Information Processing SystemsJun-19-2026, 11:01:33 GMT

Long-horizon tasks requiring multi-step reasoning and dynamic re-planning remain challenging for large language models (LLMs). Sequential prompting methods are prone to context drift, loss of goal information, and recurrent failure cycles, while hierarchical prompting methods often weaken cross-level continuity or incur substantial runtime overhead. We introduce ReCAP (Recursive Context-Aware Reasoning and Planning), a hierarchical framework with shared context for reasoning and planning in LLMs. ReCAP combines three key mechanisms: (i) plan-ahead decomposition, in which the model generates a full subtask list, executes the first item, and refines the remainder; (ii) structured re-injection of parent plans, maintaining consistent multi-level context during recursive return; and (iii) memory-efficient execution, bounding the active prompt so costs scale linearly with task depth. Together these mechanisms align high-level goals with low-level actions, reduce redundant prompting, and preserve coherent context updates across recursion. Experiments demonstrate that ReCAP substantially improves subgoal alignment and success rates on various long-horizon reasoning benchmarks, achieving a 32% gain on synchronous Robotouille and a 29% improvement on asynchronous Robotouille under the strict pass@1 protocol.

large language model, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.66)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

e991e5587c1daa49bbf9a818b3f02f9a-Paper-Conference.pdf

Neural Information Processing SystemsApr-30-2026, 04:09:49 GMT

artificial intelligence, machine learning, trire, (18 more...)

Neural Information Processing Systems

Country: Europe (0.28)

Genre: Research Report (0.93)

Industry:

Health & Medicine (0.93)
Education (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

d5cd70b708f726737e2ebace18c3f71b-Paper-Conference.pdf

Neural Information Processing SystemsFeb-18-2026, 07:22:01 GMT

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country: Asia > China > Jiangsu Province (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Communications (0.95)
(2 more...)

Add feedback

e991e5587c1daa49bbf9a818b3f02f9a-Paper-Conference.pdf

Neural Information Processing SystemsFeb-17-2026, 18:03:50 GMT

artificial intelligence, machine learning, trire, (18 more...)

Neural Information Processing Systems

Country: Europe > Netherlands > North Brabant > Eindhoven (0.04)

Genre: Research Report (0.93)

Industry:

Health & Medicine (0.93)
Education (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

6b44ee74539ea77d6a0d50d468724371-Paper-Conference.pdf

Neural Information Processing SystemsFeb-15-2026, 15:12:49 GMT

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country: Asia > China > Jiangsu Province > Nanjing (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Instructional Material > Online (0.71)

Industry:

Information Technology (1.00)
Education > Educational Setting > Online (0.68)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

Retrospective Adversarial Replay for Continual Learning

Neural Information Processing SystemsFeb-11-2026, 13:07:03 GMT

To avoid these problems, this paper proposes a method, "Retrospective Adversarial

artificial intelligence, continual learning, machine learning, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > Maryland (0.04)
Europe > France (0.04)

Industry:

Health & Medicine (0.68)
Information Technology > Security & Privacy (0.68)
Education > Educational Setting > Online (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

dc1913d422398c25c5f0b81cab94cc87-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 18:00:39 GMT

agent, auxiliary reward, side effect, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

LifelongPolicyGradientLearning ofFactoredPolicies forFasterTrainingWithoutForgetting

Neural Information Processing SystemsFeb-9-2026, 16:35:58 GMT

We provide a novel method for lifelong policy gradient learning that trains lifelong function approximators directly via policygradients, allowing the agent to benefit from accumulated knowledge throughout the entire training process.

artificial intelligence, information, machine learning, (16 more...)

Neural Information Processing Systems

Country: