AITopics | user edit

Collaborating Authors

user edit

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Aligning LLM Agents by Learning Latent Preference from User Edits Ge Gao Alexey T aymanov Eduardo Salinas Paul Mineiro Dipendra Misra Department of Computer Science, Cornell University

Neural Information Processing SystemsFeb-18-2026, 18:03:29 GMT

The inferred user preference descriptions are used to define prompts for generating responses in the future.

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country:

North America > Dominican Republic > Puerto Plata > Puerto Plata (0.05)
North America > United States > Maryland (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Atlantic Ocean (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Media (0.68)
Government (0.67)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.67)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Principled Fine-tuning of LLMs from User-Edits: A Medley of Preference, Supervision, and Reward

Misra, Dipendra, Pacchiano, Aldo, Chi, Ta-Chung, Gao, Ge

arXiv.org Machine LearningJan-28-2026

We study how to fine-tune LLMs using user-edit deployment data consisting of a set of context, an agent's response, and user edits. This deployment data is naturally generated by users in applications such as LLMs-based writing assistants and coding agents. The _natural_ origin of user edits makes it a desired source for adapting and personalizing LLMs. In this setup, there emerges a unification of various feedback types namely preferences, supervised labels, and cost that are typically studied separately in the literature. In this paper, we initiate the theoretical investigation of learning from user edits. We first derive bounds for learning algorithms that learn from each of these feedback types. We prove that these algorithms have different trade-offs depending upon the user, data distribution, and model class. We then propose a simple ensembling procedure to jointly learn from these feedback types. On two domains adapted from Gao et al. 2024, we show our ensembling procedure outperforms these methods that learn from individual feedback. Further, we show that our proposed procedure can robustly adapt to different user-edit distributions at test time.

large language model, machine learning, natural language, (20 more...)

arXiv.org Machine Learning

2601.19055

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.82)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Aligning LLM Agents by Learning Latent Preference from User Edits

Neural Information Processing SystemsDec-27-2025, 14:01:07 GMT

We study interactive learning of language agents based on user edits made to the agent's output. In a typical setting such as writing assistants, the user interacts with a language agent to generate a response given a context, and may optionally edit the agent response to personalize it based on their latent preference, in addition to improving the correctness. The edit feedback is naturally generated, making it a suitable candidate for improving the agent's alignment with the user's preference, and for reducing the cost of user edits over time. We propose a learning framework, PRELUDE that infers a description of the user's latent preference based on historic edit data and using it to define a prompt policy that drives future response generation. This avoids fine-tuning the agent, which is costly, challenging to scale with the number of users, and may even degrade its performance on other tasks.

large language model, machine learning, natural language, (13 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.54)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.42)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.36)

Add feedback

Aligning LLM Agents by Learning Latent Preference from User Edits Ge Gao Alexey T aymanov Eduardo Salinas Paul Mineiro Dipendra Misra Department of Computer Science, Cornell University

Neural Information Processing SystemsOct-10-2025, 21:48:04 GMT

The inferred user preference descriptions are used to define prompts for generating responses in the future.

agent, user edit, user preference, (16 more...)

Neural Information Processing Systems

Country:

North America > Dominican Republic > Puerto Plata > Puerto Plata (0.05)
North America > United States > Maryland (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Atlantic Ocean (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Media (0.68)
Government (0.67)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.67)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Aligning LLM Agents by Learning Latent Preference from User Edits

Neural Information Processing SystemsMay-27-2025, 21:33:12 GMT

aligning llm agent, learning latent preference, user edit, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.72)

Add feedback

Memory Augmented Cross-encoders for Controllable Personalized Search

Mysore, Sheshera, Dhanania, Garima, Patil, Kishor, Kallumadi, Surya, McCallum, Andrew, Zamani, Hamed

arXiv.org Artificial IntelligenceNov-4-2024

Personalized search represents a problem where retrieval models condition on historical user interaction data in order to improve retrieval results. However, personalization is commonly perceived as opaque and not amenable to control by users. Further, personalization necessarily limits the space of items that users are exposed to. Therefore, prior work notes a tension between personalization and users' ability for discovering novel items. While discovery of novel items in personalization setups may be resolved through search result diversification, these approaches do little to allow user control over personalization. Therefore, in this paper, we introduce an approach for controllable personalized search. Our model, CtrlCE presents a novel cross-encoder model augmented with an editable memory constructed from users historical items. Our proposed memory augmentation allows cross-encoder models to condition on large amounts of historical user data and supports interaction from users permitting control over personalization. Further, controllable personalization for search must account for queries which don't require personalization, and in turn user control. For this, we introduce a calibrated mixing model which determines when personalization is necessary. This allows system designers using CtrlCE to only obtain user input for control when necessary. In multiple datasets of personalized search, we show CtrlCE to result in effective personalization as well as fulfill various key goals for controllable personalized search.

data mining, information retrieval, machine learning, (22 more...)

arXiv.org Artificial Intelligence

2411.0279

Country:

North America > United States > District of Columbia > Washington (0.05)
North America > United States > Massachusetts > Hampshire County > Amherst (0.04)
North America > United States > New York > New York County > New York City (0.04)
(5 more...)

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (0.66)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.68)
(2 more...)

Add feedback

Aligning LLM Agents by Learning Latent Preference from User Edits

Gao, Ge, Taymanov, Alexey, Salinas, Eduardo, Mineiro, Paul, Misra, Dipendra

arXiv.org Artificial IntelligenceJun-9-2024

We study interactive learning of LLM-based language agents based on user edits made to the agent's output. In a typical setting such as writing assistants, the user interacts with a language agent to generate a response given a context, and may optionally edit the agent response to personalize it based on their latent preference, in addition to improving the correctness. The edit feedback is naturally generated, making it a suitable candidate for improving the agent's alignment with the user's preference, and for reducing the cost of user edits over time. We propose a learning framework, PRELUDE that infers a description of the user's latent preference based on historic edit data. The inferred user preference descriptions are used to define prompts for generating responses in the future. This avoids fine-tuning the agent, which is costly, challenging to scale with the number of users, and may even degrade its performance on other tasks. Furthermore, learning descriptive preference improves interpretability, allowing the user to view and modify the learned preference. However, user preference can be complex, subtle, and vary based on context, making it challenging to learn. To address this, we propose a simple yet effective algorithm named CIPHER that leverages the LLM to infer the user preference for a given context based on user edits. In the future, CIPHER retrieves inferred preferences from the k-closest contexts in the history, and forms an aggregate preference for response generation. We introduce two interactive environments -- summarization and email writing, and use a GPT-4 simulated user for evaluation. On both tasks, CIPHER outperforms several baselines by achieving the lowest edit distance cost while only having a small overhead in LLM query cost. Our analysis reports that user preferences learned by CIPHER show significant similarity to the ground truth latent preferences.

agent, user edit, user preference, (15 more...)

arXiv.org Artificial Intelligence

2404.15269

Country:

North America > Dominican Republic > Puerto Plata > Puerto Plata (0.04)
North America > United States > Maryland (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry:

Media (0.68)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.67)
Education (0.66)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback