CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions for RAG systems
Rosenthal, Sara, Sil, Avirup, Florian, Radu, Roukos, Salim
–arXiv.org Artificial Intelligence
Large (NQ) (Kwiatkowski et al., 2019) and SQuAD (Rajpurkar scale research in this area began with the tasks et al., 2016, 2018) which are just a few of Machine Reading Comprehension (Rajpurkar words. It is grounded on a single gold passage, et al., 2016; Rogers et al., 2023; Fisch et al., in contrast to other long-form question answering 2021), and Information Retrieval (Manning et al., (LFQA) datasets such as ELI5 (Fan et al., 2019) 2008; Voorhees and Harman, 2005; Thakur et al., where gold passages are not available. It is built 2021) and has more recently been come to be from a subset of the highly successful Natural Questions known as Retrieval Augmented Generation (Lewis (Kwiatkowski et al., 2019) dataset for extractive et al., 2021; Guu et al., 2020) which encompasses QA from Wikipedia documents based on users both tasks. The recent popularity of generative real web search queries - specifically, the subset of AI with Large Language models (LLM), such as NQ that has long answers (passages) but no short GPT (Brown et al., 2020), Llama (Touvron et al., extractive answers.
arXiv.org Artificial Intelligence
Apr-2-2024
- Country:
- Asia (1.00)
- Europe > United Kingdom
- England (0.28)
- North America > United States (1.00)
- Oceania > New Zealand
- North Island (0.28)
- Genre:
- Overview (0.67)
- Research Report (0.64)
- Industry:
- Energy > Power Industry (0.47)
- Government
- Leisure & Entertainment (1.00)
- Materials > Metals & Mining (0.46)
- Media (0.67)
- Technology: