Beyond Reactivity: Measuring Proactive Problem Solving in LLM Agents

Pasternak, Gil, Rajagopal, Dheeraj, White, Julia, Atreja, Dhruv, Thomas, Matthew, Hurn-Maloney, George, Lewis, Ash

Oct-31-2025–arXiv.org Artificial Intelligence

From these personas, we synthetically construct comprehensive world models that encode: Workplace hierarchy and relationship context Work patterns and communication styles Available action space A with corresponding parameter spaces P Pain points and operational constraints For instance, given a senior account manager with 20 years of client-facing experience as shown in figure 2, the world model might identify "client documentation upkeep" as a pain point, while also modeling specific client relationships and their respective engagement contexts. Bottleneck Generation: Using the contextualized world model, we generate bottleneck b: a persona-relevant, actionable user-need that satisfies our formal definition (see Section 2). Each bottleneck b is designed to be identifiable through evidence T in the document set D and resolvable through exactly one action a A. User Datastore: For each sample S, we construct the document set D = T K. The True positives T - documents where f(d) = 1 - collectively provide sufficient evidence to identify bottleneck b. Distractors K are documents where f(d) = 0, introducing realistic noise with respect to the bottleneck. In our current datastore setup, all the generated documents are either emails, calendar events, or text documents, as exemplified in Figures 1 and 2. To mirror real-world complexity, we employ two key design principles: (i) Evidence distribution: We often distribute evidence for b across multiple documents in T, requiring agents to synthesize information from t different sources.

bottleneck, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

Oct-31-2025

arXiv.org PDF

Add feedback

Genre:
- Research Report (1.00)

Industry:
- Information Technology (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (1.00)
  - Natural Language > Large Language Model (1.00)
  - Cognitive Science > Problem Solving (1.00)
  - Machine Learning
    - Neural Networks > Deep Learning (0.50)
    - Performance Analysis > Accuracy (0.34)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found