Beyond Reactivity: Measuring Proactive Problem Solving in LLM Agents

Pasternak, Gil, Rajagopal, Dheeraj, White, Julia, Atreja, Dhruv, Thomas, Matthew, Hurn-Maloney, George, Lewis, Ash

arXiv.org Artificial Intelligence 

From these personas, we synthetically construct comprehensive world models that encode: Workplace hierarchy and relationship context Work patterns and communication styles Available action space A with corresponding parameter spaces P Pain points and operational constraints For instance, given a senior account manager with 20 years of client-facing experience as shown in figure 2, the world model might identify "client documentation upkeep" as a pain point, while also modeling specific client relationships and their respective engagement contexts. Bottleneck Generation: Using the contextualized world model, we generate bottleneck b: a persona-relevant, actionable user-need that satisfies our formal definition (see Section 2). Each bottleneck b is designed to be identifiable through evidence T in the document set D and resolvable through exactly one action a A. User Datastore: For each sample S, we construct the document set D = T K. The True positives T - documents where f(d) = 1 - collectively provide sufficient evidence to identify bottleneck b. Distractors K are documents where f(d) = 0, introducing realistic noise with respect to the bottleneck. In our current datastore setup, all the generated documents are either emails, calendar events, or text documents, as exemplified in Figures 1 and 2. To mirror real-world complexity, we employ two key design principles: (i) Evidence distribution: We often distribute evidence for b across multiple documents in T, requiring agents to synthesize information from t different sources.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found