AITopics | pg-ella

Collaborating Authors

pg-ella

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

max

Neural Information Processing SystemsFeb-9-2026, 16:36:04 GMT

Theonlyexception wasthenumber oflatentcomponents used for the Walker-2Dbody-partsdomain, as we found empirically thatk = 5led to saturation of the learning process early on.

ewc1, ewc2, machine learning, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.34)

Add feedback

Review for NeurIPS paper: Lifelong Policy Gradient Learning of Factored Policies for Faster Training Without Forgetting

Neural Information Processing SystemsNov-14-2025, 22:34:36 GMT

This should be emphasised similar to the argument in the introduction.

artificial intelligence, learning, machine learning, (15 more...)

Neural Information Processing Systems

Genre: Summary/Review (0.49)

Industry: Education (0.32)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

a58149d355f02887dfbe55ebb2b64ba3-AuthorFeedback.pdf

Neural Information Processing SystemsSep-25-2025, 13:48:06 GMT

artificial intelligence, machine learning, pg-ella, (18 more...)

Neural Information Processing Systems

Industry:

Materials > Chemicals > Industrial Gases > Liquified Gas (0.71)
Materials > Chemicals > Commodity Chemicals > Petrochemicals > LNG (0.71)
Energy > Oil & Gas > Midstream (0.71)
Energy > Oil & Gas > Downstream (0.71)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.33)

Add feedback

Reviews: Lifelong Inverse Reinforcement Learning

Neural Information Processing SystemsOct-7-2024, 07:24:06 GMT

Summary: This paper considers the problem of lifelong inverse reinforcement learning, where the goal is to learn a set of reward functions (from demonstrations) that can be applied to a series of tasks. The authors propose to do this by learning and continuously updating a shared latent space of reward components, which are combined with task specific coefficients to reconstruct the reward for a particular task. The derivation of the algorithm basically mirrors the Efficient Lifelong Learning Algorithm (ELLA) (citation [33]). Although ELLA was formulated for supervised learning, variants such as PG-ELLA (not cited in this paper, by Ammar et al. "Online Multi-task Learning for Policy Gradient Methods") have applied the same derivation procedure to extend the original ELLA algorithm to the reinforcement learning setting. This paper is another extension of ELLA, to the inverse reinforcement learning setting, where instead of sharing policies via a latent space, they are sharing reward functions.

lifelong inverse reinforcement learning, reward function, transition model, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Lifelong Policy Gradient Learning of Factored Policies for Faster Training Without Forgetting

Mendez, Jorge A., Wang, Boyu, Eaton, Eric

arXiv.org Artificial IntelligenceJul-14-2020

Policy gradient methods have shown success in learning control policies for high-dimensional dynamical systems. Their biggest downside is the amount of exploration they require before yielding high-performing policies. In a lifelong learning setting, in which an agent is faced with multiple consecutive tasks over its lifetime, reusing information from previously seen tasks can substantially accelerate the learning of new tasks. We provide a novel method for lifelong policy gradient learning that trains lifelong function approximators directly via policy gradients, allowing the agent to benefit from accumulated knowledge throughout the entire training process. We show empirically that our algorithm learns faster and converges to better policies than single-task and lifelong learning baselines, and completely avoids catastrophic forgetting on a variety of challenging domains.

artificial intelligence, lpg-ftw, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2007.07011

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Pennsylvania (0.04)
North America > United States > New York > New York County > New York City (0.04)
(2 more...)

Genre: Research Report (0.70)

Industry: Education > Educational Setting (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Scalable Multitask Policy Gradient Reinforcement Learning

Bsat, Salam El (Rafik Hariri University) | Ammar, Haitham Bou (American University of Beirut) | Taylor, Matthew E. (Washington State University)

AAAI ConferencesFeb-14-2017

Policy search reinforcement learning (RL) allows agents to learn autonomously with limited feedback. However, such methods typically require extensive experience for successful behavior due to their tabula rasa nature. Multitask RL is an approach, which aims to reduce data requirements by allowing knowledge transfer between tasks. Although successful, current multitask learning methods suffer from scalability issues when considering large number of tasks. The main reasons behind this limitation is the reliance on centralized solutions. This paper proposes to a novel distributed multitask RL framework, improving the scalability across many different types of tasks. Our framework maps multitask RL to an instance of general consensus and develops an efficient decentralized solver. We justify the correctness of the algorithm both theoretically and empirically: we first proof an improvement of convergence speed to an order of O(1/k) with k being the number of iterations, and then show our algorithm surpassing others on multiple dynamical system benchmarks.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

AAAI Conferences

Thirty-First AAAI Conference on Artificial Intelligence

Country: North America > United States (0.28)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.85)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Add feedback