AITopics | edited sentence

Collaborating Authors

edited sentence

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

HalluVerse25: Fine-grained Multilingual Benchmark Dataset for LLM Hallucinations

Abdaljalil, Samir, Kurban, Hasan, Serpedin, Erchin

arXiv.org Artificial IntelligenceMar-10-2025

Large Language Models (LLMs) are increasingly used in various contexts, yet remain prone to generating non-factual content, commonly referred to as "hallucinations". The literature categorizes hallucinations into several types, including entity-level, relation-level, and sentence-level hallucinations. However, existing hallucination datasets often fail to capture fine-grained hallucinations in multilingual settings. In this work, we introduce HalluVerse25, a multilingual LLM hallucination dataset that categorizes fine-grained hallucinations in English, Arabic, and Turkish. Our dataset construction pipeline uses an LLM to inject hallucinations into factual biographical sentences, followed by a rigorous human annotation process to ensure data quality. We evaluate several LLMs on HalluVerse25, providing valuable insights into how proprietary models perform in detecting LLM-generated hallucinations across different contexts.

dataset, edited sentence, hallucination, (14 more...)

arXiv.org Artificial Intelligence

2503.07833

Country:

North America > United States > Texas > Brazos County > College Station (0.14)
Asia > Japan (0.14)
Europe > Austria > Vienna (0.14)
(15 more...)

Genre: Personal > Honors (0.95)

Industry: Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Paraphrasing in Affirmative Terms Improves Negation Understanding

Rezaei, MohammadHossein, Blanco, Eduardo

arXiv.org Artificial IntelligenceJun-11-2024

Negation is a common linguistic phenomenon. Yet language models face challenges with negation in many natural language understanding tasks such as question answering and natural language inference. In this paper, we experiment with seamless strategies that incorporate affirmative interpretations (i.e., paraphrases without negation) to make models more robust against negation. Crucially, our affirmative interpretations are obtained automatically. We show improvements with CondaQA, a large corpus requiring reasoning with negation, and five natural language understanding tasks.

affirmative interpretation, interpretation, negation, (14 more...)

arXiv.org Artificial Intelligence

2406.07492

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.15)
North America > United States > Washington > King County > Seattle (0.14)
North America > Nicaragua > Managua > Managua (0.04)
(18 more...)

Genre: Research Report > New Finding (0.46)

Industry: Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.46)

Add feedback

DEFT: Data Efficient Fine-Tuning for Large Language Models via Unsupervised Core-Set Selection

Das, Devleena, Khetan, Vivek

arXiv.org Artificial IntelligenceNov-15-2023

Recent advances have led to the availability of many pre-trained language models (PLMs); however, a question that remains is how much data is truly needed to fine-tune PLMs for downstream tasks? In this work, we introduce DEFT, a data-efficient fine-tuning framework that leverages unsupervised core-set selection to minimize the amount of data needed to fine-tune PLMs for downstream tasks. We demonstrate the efficacy of our DEFT framework in the context of text-editing LMs, and compare to the state-of-the art text-editing model, CoEDIT. Our quantitative and qualitative results demonstrate that DEFT models are just as accurate as CoEDIT while being finetuned on ~70% less data.

arxiv preprint arxiv, coedit, dataset, (13 more...)

arXiv.org Artificial Intelligence

2310.16776

Country:

Europe > France (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.46)

Add feedback