Parachute: Evaluating Interactive Human-LM Co-writing Systems

Mar-24-2023–arXiv.org Artificial Intelligence

A surge of advances in language models (LMs) has led to significant interest in using LMs to build co-writing systems, in which humans and LMs interactively contribute to a shared writing artifact. However, there is a lack of studies assessing co-writing systems in interactive settings. We propose a human-centered evaluation framework, Parachute, for interactive co-writing systems. Parachute showcases an integrative view of interaction evaluation, where each evaluation aspect consists of categorized practical metrics. Furthermore, we present Parachute with a use case to demonstrate how to evaluate and compare co-writing systems using Parachute.

artificial intelligence, natural language, parachute, (16 more...)

arXiv.org Artificial Intelligence

Mar-24-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Pennsylvania > Allegheny County > Pittsburgh (0.04)
- Europe > Germany
  - Hamburg (0.05)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report (0.50)
- Questionnaire & Opinion Survey (0.48)

Industry:
- Transportation > Air (1.00)

Technology:
- Information Technology
  - Artificial Intelligence > Natural Language (1.00)
  - Human Computer Interaction (0.95)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found