AITopics | panacea

Collaborating Authors

panacea

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Panacea: Pareto Alignment via Preference Adaptation for LLMs

Neural Information Processing SystemsMar-21-2026, 12:05:21 GMT

However, this convention tends to oversimplify the multi-dimensional and heterogeneous nature of human preferences, leading to reduced expressivity and even misalignment. This paper presents Panacea, an innovative approach that reframes alignment as a multi-dimensional preference optimization problem. Panacea trains a single model capable of adapting online and Pareto-optimally to diverse sets of preferences without the need for further tuning. A major challenge here is using a low-dimensional preference vector to guide the model's behavior, despite it being governed by an overwhelmingly large number of parameters. To address this, Panacea is designed to use singular value decomposition (SVD)-based low-rank adaptation, which allows the preference vector to be simply injected online as singular values. Theoretically, we prove that Panacea recovers the entire Pareto front with common loss aggregation methods under mild conditions. Moreover, our experiments demonstrate, for the first time, the feasibility of aligning a single LLM to represent an exponentially vast spectrum of human preferences through various optimization methods. Our work marks a step forward in effectively and efficiently aligning models to diverse and intricate human preferences in a controllable and Pareto-optimal manner.

artificial intelligence, large language model, natural language, (8 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.59)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.42)

Add feedback

89f39d0b3d49a47606a165eefba2778c-Paper-Conference.pdf

Neural Information Processing SystemsFeb-16-2026, 11:20:30 GMT

dimension, panacea, preference vector, (14 more...)

Neural Information Processing Systems

Country:

Asia > China > Hong Kong (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
(2 more...)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.98)

Add feedback

SecurityAnalysisofSafeandSeldonian ReinforcementLearningAlgorithms

Neural Information Processing SystemsFeb-8-2026, 17:04:40 GMT

This component makes current Seldonian algorithms safe: the safety test checks whether necessary safety constraints are satisfiedwithhighprobability.

machine learning, reinforcement learning, trajectory, (19 more...)

Neural Information Processing Systems

Country:

North America > United States (0.28)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Industry:

Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (0.96)
Government (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)

Add feedback

65ae450c5536606c266f49f1c08321f2-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-8-2026, 17:04:30 GMT

evaluation, panacea, trajectory, (7 more...)

Neural Information Processing Systems

Industry: Health & Medicine > Therapeutic Area (0.32)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.50)

Add feedback

Panacea: Pareto Alignment via Preference Adaptation for LLMs

Neural Information Processing SystemsOct-10-2025, 08:49:39 GMT

Panacea trains a single model capable of adapting online and Pareto-optimally to diverse sets of preferences without the need for further tuning.

dimension, panacea, preference vector, (14 more...)

Neural Information Processing Systems

Country:

Asia > China > Hong Kong (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
(2 more...)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.98)

Add feedback

65ae450c5536606c266f49f1c08321f2-Paper.pdf

Neural Information Processing SystemsOct-3-2025, 02:43:27 GMT

machine learning, reinforcement learning, trajectory, (17 more...)

Neural Information Processing Systems

Country: North America > United States (0.68)

Genre: Research Report (0.46)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (1.00)
Government > Military (0.69)
Government > Regional Government (0.68)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.46)

Add feedback

65ae450c5536606c266f49f1c08321f2-AuthorFeedback.pdf

Neural Information Processing SystemsOct-3-2025, 02:43:16 GMT

artificial intelligence, machine learning, trajectory, (9 more...)

Neural Information Processing Systems

Industry: Health & Medicine > Therapeutic Area (0.32)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.50)

Add feedback

The Panaceas for Improving Low-Rank Decomposition in Communication-Efficient Federated Learning

Li, Shiwei, Luo, Xiandi, Wang, Haozhao, Tang, Xing, Xu, Shijie, Luo, Weihong, Li, Yuhua, He, Xiuqiang, Li, Ruixuan

arXiv.org Artificial IntelligenceAug-19-2025

To improve the training efficiency of federated learning (FL), previous research has employed low-rank decomposition techniques to reduce communication overhead. In this paper, we seek to enhance the performance of these low-rank decomposition methods. Specifically, we focus on three key issues related to decomposition in FL: what to decompose, how to decompose, and how to aggregate. Subsequently, we introduce three novel techniques: Model Update Decomposition (MUD), Block-wise Kronecker Decomposition (BKD), and Aggregation-Aware Decomposition (AAD), each targeting a specific issue. These techniques are complementary and can be applied simultaneously to achieve optimal performance. Additionally, we provide a rigorous theoretical analysis to ensure the convergence of the proposed MUD. Extensive experimental results show that our approach achieves faster convergence and superior accuracy compared to relevant baseline methods. The code is available at https://github.com/Leopold1423/fedmud-icml25.

artificial intelligence, decomposition, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2505.23176

Country:

Asia > China (0.46)
North America (0.28)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Panacea: Pareto Alignment via Preference Adaptation for LLMs

Neural Information Processing SystemsMay-27-2025, 08:03:12 GMT

panacea, pareto alignment, preference adaptation, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.58)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.42)

Add feedback

Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation

Wang, Yibo, Huang, Tiansheng, Shen, Li, Yao, Huanjin, Luo, Haotian, Liu, Rui, Tan, Naiqiang, Huang, Jiaxing, Tao, Dacheng

arXiv.org Artificial IntelligenceJan-29-2025

Harmful fine-tuning attack introduces significant security risks to the fine-tuning services. Mainstream defenses aim to vaccinate the model such that the later harmful fine-tuning attack is less effective. However, our evaluation results show that such defenses are fragile -- with a few fine-tuning steps, the model still can learn the harmful knowledge. To this end, we do further experiment and find that an embarrassingly simple solution -- adding purely random perturbations to the fine-tuned model, can recover the model from harmful behavior, though it leads to a degradation in the model's fine-tuning performance. To address the degradation of fine-tuning performance, we further propose Panacea, which optimizes an adaptive perturbation that will be applied to the model after fine-tuning. Panacea maintains model's safety alignment performance without compromising downstream fine-tuning performance. Comprehensive experiments are conducted on different harmful ratios, fine-tuning tasks and mainstream LLMs, where the average harmful scores are reduced by up-to 21.5%, while maintaining fine-tuning performance. As a by-product, we analyze the optimized perturbation and show that different layers in various LLMs have distinct safety coefficients. Source code available at https://github.com/w-yibo/Panacea

arxiv preprint arxiv, dataset, perturbation, (12 more...)

arXiv.org Artificial Intelligence

2501.181

Country: