ACL-rlg: A Dataset for Reading List Generation
Aubert-Béduchaud, Julien, Boudin, Florian, Daille, Béatrice, Dufour, Richard
–arXiv.org Artificial Intelligence
Familiarizing oneself with a new scientific field and its existing literature can be daunting due to the large amount of available articles. Curated lists of academic references, or reading lists, compiled by experts, offer a structured way to gain a comprehensive overview of a domain or a specific scientific challenge. In this work, we introduce ACL-rlg, the largest open expert-annotated reading list dataset. We also provide multiple baselines for evaluating reading list generation and formally define it as a retrieval task. Our qualitative study highlights the fact that traditional scholarly search engines and indexing methods perform poorly on this task, and GPT-4o, despite showing better results, exhibits signs of potential data contamination.
arXiv.org Artificial Intelligence
Dec-30-2024
- Country:
- Asia > Japan
- Honshū (0.14)
- Europe (1.00)
- North America > United States
- Texas (0.14)
- Asia > Japan
- Genre:
- Instructional Material > Course Syllabus & Notes (0.94)
- Research Report (1.00)
- Technology: