AITopics | original passage

Collaborating Authors

original passage

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

What Has Been Lost with Synthetic Evaluation?

Gill, Alexander, Ravichander, Abhilasha, Marasović, Ana

arXiv.org Artificial IntelligenceOct-7-2025

Large language models (LLMs) are increasingly used for data generation. However, creating evaluation benchmarks raises the bar for this emerging paradigm. Benchmarks must target specific phenomena, penalize exploiting shortcuts, and be challenging. Through two case studies, we investigate whether LLMs can meet these demands by generating reasoning over-text benchmarks and comparing them to those created through careful crowdsourcing. Specifically, we evaluate both the validity and difficulty of LLM-generated versions of two high-quality reading comprehension datasets: CondaQA, which evaluates reasoning about negation, and DROP, which targets reasoning about quantities. We find that prompting LLMs can produce variants of these datasets that are often valid according to the annotation guidelines, at a fraction of the cost of the original crowdsourcing effort. However, we show that they are less challenging for LLMs than their human-authored counterparts. This finding sheds light on what may have been lost by generating evaluation data with LLMs, and calls for critically reassessing the immediate use of this increasingly prevalent approach to benchmark creation.

computational linguistic, large language model, machine learning, (21 more...)

arXiv.org Artificial Intelligence

2505.2283

Country:

Europe > United Kingdom (1.00)
Asia > Middle East > UAE (0.46)
North America > United States > New Mexico (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.93)

Industry:

Leisure & Entertainment > Sports > Football (1.00)
Government > Regional Government > Europe Government > United Kingdom Government (1.00)
Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

PrismRAG: Boosting RAG Factuality with Distractor Resilience and Strategized Reasoning

Kachuee, Mohammad, Gollapudi, Teja, Kim, Minseok, Huang, Yin, Sun, Kai, Yang, Xiao, Wang, Jiaqi, Shah, Nirav, Liu, Yue, Colak, Aaron, Kumar, Anuj, Yih, Wen-tau, Dong, Xin Luna

arXiv.org Artificial IntelligenceJul-28-2025

Retrieval-augmented generation (RAG) often falls short when retrieved context includes confusing semi-relevant passages, or when answering questions require deep contextual understanding and reasoning. We propose an efficient fine-tuning framework, called PrismRAG, that (i) trains the model with distractor-aware QA pairs mixing gold evidence with subtle distractor passages, and (ii) instills reasoning-centric habits that make the LLM plan, rationalize, and synthesize without relying on extensive human engineered instructions. Evaluated across 12 open-book RAG QA benchmarks spanning diverse application domains and scenarios, PrismRAG improves average factuality by 5.4%, outperforming state-of-the-art solutions.

information, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2507.18857

Country: North America > United States (1.00)

Genre: Research Report > New Finding (0.67)

Industry:

Media (0.68)
Leisure & Entertainment > Sports (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Add feedback

A Linguistic Analysis of Student-Generated Paraphrases

Rus, Vasile (The University of Memphis) | Feng, Shi (The University of Memphis) | Brandon, Russell (The University of Memphis) | Crossley, Scott (Georgia State University) | McNamara, Danielle S. (The University of Memphis)

AAAI ConferencesMay-18-2011

Paraphrase identification is a core Natural Language Processing task that involves assessing the semantic similarity of two texts. To foster systematic studies of this task, standardized datasets were created on which various approaches could be compared more fairly. However, a better understanding and more precise operational definition of a paraphrase are needed before any further datasets or systematic evaluations of the task of paraphrase identification are proposed. This study develops the concept of paraphrasing as a writing strategy. Six types of paraphrases are defined through the creation of a relatively large corpus of student-generated paraphrases. These paraphrases are analyzed along several dozen linguistic dimensions ranging from cohesion to lexical diversity. The most significant indices from these dimensions were then used to build a prediction model that could identify true and false paraphrases and each of the six paraphrase types.

original passage, original text, student, (17 more...)

AAAI Conferences

Twenty-Fourth International FLAIRS Conference

Country:

North America > United States > New Jersey > Bergen County > Mahwah (0.04)
North America > United States > Missouri (0.04)
North America > United States > Tennessee > Shelby County > Memphis (0.04)
(4 more...)

Genre: Research Report > New Finding (0.48)

Industry: Education (0.48)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.89)

Add feedback