OASST-ETC Dataset: Alignment Signals from Eye-tracking Analysis of LLM Responses
Lopez-Cardona, Angela, Idesis, Sebastian, Barreda-Ángeles, Miguel, Abadal, Sergi, Arapakis, Ioannis
–arXiv.org Artificial Intelligence
While Large Language Models (LLMs) have significantly advanced natural language processing, aligning them with human preferences remains an open challenge. Although current alignment methods rely primarily on explicit feedback, eye-tracking (ET) data offers insights into real-time cognitive processing during reading. In this paper, we present OASST-ETC, a novel eye-tracking corpus capturing reading patterns from 24 participants, while evaluating LLM-generated responses from the OASST1 dataset. Our analysis reveals distinct reading patterns between preferred and non-preferred responses, which we compare with synthetic eye-tracking data. Furthermore, we examine the correlation between human reading measures and attention patterns from various transformer-based models, discovering stronger correlations in preferred responses. This work introduces a unique resource for studying human cognitive processing in LLM evaluation and suggests promising directions for incorporating eye-tracking data into alignment methods. The dataset and analysis code are publicly available.
arXiv.org Artificial Intelligence
Mar-13-2025
- Country:
- South America
- Brazil (0.04)
- Paraguay > Asunción
- Asunción (0.04)
- Colombia > Meta Department
- Villavicencio (0.04)
- Oceania > Australia
- North America > United States
- Virginia (0.04)
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Europe
- Austria > Vienna (0.14)
- Switzerland > Zürich
- Zürich (0.04)
- Italy > Piedmont
- Turin Province > Turin (0.04)
- Germany
- Berlin (0.04)
- Baden-Württemberg > Tübingen Region
- Tübingen (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Spain
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Aragón > Zaragoza Province
- Zaragoza (0.04)
- Catalonia > Barcelona Province
- Denmark > Capital Region
- Copenhagen (0.04)
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- United Kingdom
- Scotland > City of Glasgow
- Glasgow (0.04)
- England > Oxfordshire
- Oxford (0.04)
- Scotland > City of Glasgow
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- South America
- Genre:
- Research Report
- New Finding (1.00)
- Experimental Study (1.00)
- Research Report
- Industry:
- Health & Medicine (0.67)
- Information Technology > Security & Privacy (0.46)
- Technology: