AITopics | rar

Collaborating Authors

rar

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Thinking in Character: Advancing Role-playing Agents with Role-Aware Reasoning

Neural Information Processing SystemsJun-21-2026, 14:08:03 GMT

The advancement of Large Language Models (LLMs) has spurred significant interest in Role-Playing Agents (RPAs) for applications such as emotional companionship and virtual interaction. However, recent RPAs are often built on explicit dialogue data, lacking deep, human-like internal thought processes, resulting in superficial knowledge and style expression. While Large Reasoning Models (LRMs) can be employed to simulate character thought, their direct application is hindered by attention diversion (i.e., RPAs forget their role) and style drift (i.e., overly formal and rigid reasoning rather than character-consistent reasoning). To address these challenges, this paper introduces a novel Role-Aware Reasoning (RAR) method, which consists of two important stages: Role Identity Activation (RIA) and Reasoning Style Optimization (RSO). RIA explicitly guides the model with character profiles during reasoning to counteract attention diversion, and then RSO aligns reasoning style with the character and scene via LRM distillation to mitigate style drift. Extensive experiments demonstrate that the proposed RAR significantly enhances the performance of RPAs by effectively addressing attention diversion and style drift.

arxiv preprint arxiv, large language model, machine learning, (19 more...)

Neural Information Processing Systems

Country:

Asia > China (0.46)
North America > United States (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Media (0.67)
Information Technology (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
(2 more...)

Add feedback

GradiVeQ: Vector Quantization for Bandwidth-Efficient Gradient Aggregation in Distributed CNN Training

Mingchao Yu, Zhifeng Lin, Krishna Narra, Songze Li, Youjie Li, Nam Sung Kim, Alexander Schwing, Murali Annavaram, Salman Avestimehr

Neural Information Processing SystemsFeb-14-2026, 16:01:21 GMT

Neural Information Processing Systems http://nips.cc/

gradient, gradiv eq, iteration, (11 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.14)
North America > United States > Illinois (0.04)
North America > Canada > Quebec > Montreal (0.04)
North America > Canada > Ontario > Toronto (0.04)

Industry: Government > Regional Government (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Retrospective Adversarial Replay for Continual Learning

Neural Information Processing SystemsFeb-11-2026, 13:07:03 GMT

To avoid these problems, this paper proposes a method, "Retrospective Adversarial

artificial intelligence, continual learning, machine learning, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > Maryland (0.04)
Europe > France (0.04)

Industry:

Health & Medicine (0.68)
Information Technology > Security & Privacy (0.68)
Education > Educational Setting > Online (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

isanunbiasedstochasticgradientdescentupdateruleforthefollowingempiricalrisk: R(θ) = X

Neural Information Processing SystemsFeb-9-2026, 08:15:19 GMT

This section contains the theoretical analysis of the loss functions of offline experience replay (Proposition 2),augmented experience replay (Proposition 3),andonline experience replay with reservoirsampling(Proposition1). For all experiments, we use the learning rate of 0.1 following the same setting as in Aljundi et al. [2019], Shimetal.[2021], This paper uses Randaugment [Cubuk et al., 2020], which is an auto augmentation method. It randomly selectsP augmentation operators from a set of 14 operators and applies them to the images. ToapplyBPGintheOCLenvironment,weproposeto determine the better/worse action set based on the feedback in the form of current memory batch accuracyAM,which reflects the memory overfitting level of the CL agent.

artificial intelligence, iter, machine learning, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.70)

Add feedback

5ebbbac62b968254093023f1c95015d3-Paper-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 08:15:16 GMT

augmentation, continual learning, rehearsal, (14 more...)

Neural Information Processing Systems

Country:

Oceania > New Zealand > North Island > Waikato (0.05)
Europe > France > Île-de-France > Paris > Paris (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Education > Educational Setting > Online (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

234b941e88b755b7a72a1c1dd5022f30-Supplemental.pdf

Neural Information Processing SystemsFeb-7-2026, 19:45:08 GMT

That is,we optimize for theα and β hyperparameters while fixing theσ to a negligible amount (σ = 2 32 specifically).

artificial intelligence, machine learning, seefiguredescriptionabove, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.52)

Add feedback

Retrospective Adversarial Replay for Continual Learning

Neural Information Processing SystemsDec-25-2025, 02:03:16 GMT

Continual learning is an emerging research challenge in machine learning that addresses the problem where models quickly fit the most recently trained-on data but suffer from catastrophic forgetting of previous data due to distribution shifts --- it does this by maintaining a small historical replay buffer in replay-based methods. To avoid these problems, this paper proposes a method, ``Retrospective Adversarial Replay (RAR)'', that synthesizes adversarial samples near the forgetting boundary. RAR perturbs a buffered sample towards its nearest neighbor drawn from the current task in a latent representation space. By replaying such samples, we are able to refine the boundary between previous and current tasks, hence combating forgetting and reducing bias towards the current task. To mitigate the severity of a small replay buffer, we develop a novel MixUp-based strategy to increase replay variation by replaying mixed augmentations. Combined with RAR, this achieves a holistic framework that helps to alleviate catastrophic forgetting. We show that this excels on broadly-used benchmarks and outperforms other continual learning baselines especially when only a small buffer is available. We conduct a thorough ablation study over each key component as well as a hyperparameter sensitivity analysis to demonstrate the effectiveness and robustness of RAR.

continual learning, name change, retrospective adversarial replay, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

GradiVeQ: Vector Quantization for Bandwidth-Efficient Gradient Aggregation in Distributed CNN Training

Mingchao Yu, Zhifeng Lin, Krishna Narra, Songze Li, Youjie Li, Nam Sung Kim, Alexander Schwing, Murali Annavaram, Salman Avestimehr

Neural Information Processing SystemsNov-20-2025, 20:11:32 GMT

Data parallelism can boost the training speed of convolutional neural networks (CNN), but could suffer from significant communication costs caused by gradient aggregation.

gradient, gradiv eq, iteration, (11 more...)

Neural Information Processing Systems

Country: