LM Agents for Coordinating Multi-User Information Gathering

Jhamtani, Harsh, Andreas, Jacob, Van Durme, Benjamin

Feb-17-2025–arXiv.org Artificial Intelligence

This paper introduces PeopleJoin, a benchmark for evaluating LM-mediated collaborative problem solving. Given a user request, PeopleJoin agents must identify teammates who might be able to assist, converse with these teammates to gather information, and finally compile a useful answer or summary for the original user. PeopleJoin comprises two evaluation domains: PeopleJoin-QA, focused on questions about tabular data, and PeopleJoin-DocCreation, focused on document creation tasks. The two domains are adapted from existing NLP benchmarks for database question answering and multi-document summarization; here, however, the information needed to complete these tasks is distributed across synthetic ``organizations'' of 2--20 users, simulating natural multi-user collaboration scenarios. We implemented several popular LM agent architectures, evaluating their accuracy and efficiency at completing tasks, and highlight new research questions that can be studied using PeopleJoin.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Feb-17-2025

arXiv.org PDF

Add feedback

Country:
- Africa > Rwanda
  - Kigali > Kigali (0.04)
- Asia
  - Nepal (0.04)
  - Singapore (0.04)
  - Thailand > Bangkok
    - Bangkok (0.04)
- North America
  - Canada (0.04)
  - United States
    - Montana (0.14)
    - New York > New York County
      - New York City (0.04)
    - California (0.04)
    - Louisiana (0.04)
    - North Dakota (0.04)
    - New Hampshire (0.04)
    - North Carolina (0.04)
    - Maryland (0.04)
    - Missouri (0.04)
    - Indiana (0.04)
    - Utah (0.04)
    - West Virginia (0.04)

Genre:
- Research Report > New Finding (0.66)

Industry:
- Banking & Finance > Economy (1.00)
- Education (0.93)
- Government > Regional Government
  - North America Government > United States Government (1.00)
- Health & Medicine (1.00)
- Law (0.67)
- Transportation (0.67)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning > Neural Networks
      - Deep Learning (0.48)
    - Natural Language > Large Language Model (1.00)
    - Representation & Reasoning > Agents (1.00)
  - Information Management (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found