AITopics | Personal

Collaborating Authors

Personal

MAGIC: A Multi-Hop and Graph-Based Benchmark for Inter-Context Conflicts in Retrieval-Augmented Generation

arXiv.org Artificial IntelligenceOct-10-2025

Knowledge conflict often arises in retrieval-augmented generation (RAG) systems, where retrieved documents may be inconsistent with one another or contradict the model's parametric knowledge. Existing benchmarks for investigating the phenomenon have notable limitations, including a narrow focus on the question answering setup, heavy reliance on entity substitution techniques, and a restricted range of conflict types. To address these issues, we propose a knowledge graph (KG)-based framework that generates varied and subtle conflicts between two similar yet distinct contexts, while ensuring interpretability through the explicit relational structure of KGs. Experimental results on our benchmark, MAGIC, provide intriguing insights into the inner workings of LLMs regarding knowledge conflict: both open-source and proprietary models struggle with conflict detection -- especially when multi-hop reasoning is required -- and often fail to pinpoint the exact source of contradictions. Finally, we present in-depth analyses that serve as a foundation for improving LLMs in integrating diverse, sometimes even conflicting, information.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2507.21544

Country:

Asia (1.00)
Africa (0.97)
North America > United States (0.93)
Europe > United Kingdom > England (0.46)

Genre:

Research Report > New Finding (1.00)
Personal (0.94)

Industry:

Government (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

67496dfa96afddab795530cc7c69b57a-Supplemental-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 15:57:39 GMT

baseline, higher corpus level diversity, intensity, (13 more...)

Neural Information Processing Systems

Country:

Asia > Myanmar (0.14)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Asia > Middle East > Israel (0.04)
(37 more...)

Genre:

Personal (1.00)
Research Report (0.67)

Industry:

Leisure & Entertainment > Sports > Soccer (1.00)
Leisure & Entertainment > Sports > Football (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
(6 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Security & Privacy (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Policy design for two-sided platforms with participation dynamics: Interview with Haruka Kiyohara

AIHubOct-9-2025, 09:41:12 GMT

In their paper Policy Design for Two-sided Platforms with Participation Dynamics, which was presented at ICML 2025, and investigated the the participation dynamics in two-sided markets. In this interview, Haruka tells us more about such two-sided platforms, the main contributions of the work, and the experiments carried out to test the method. What is the topic of the research in your paper and why is it an interesting area for study? Our paper studied the long-term impacts of decision-making algorithms on two-sided platforms like e-commerce or music streaming applications. In two-sided platforms, multiple stakeholders, such as viewers and content creators, are involved.

algorithm, participation dynamic, two-sided platform, (12 more...)

AIHub

Country: Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)

Genre: Personal > Interview (0.35)

Industry:

Leisure & Entertainment (0.49)
Energy (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.71)
Information Technology > Communications > Social Media (0.49)

Add feedback

A API Details

Neural Information Processing SystemsOct-9-2025, 08:52:42 GMT

API calls for each position identified in a piece of text. Question Answering We use the Atlas model of Izacard et al. (2022) finetuned on Natural Questions Calculator Our calculator is based on a simple Python script and only supports the operators " It does not return any result for syntactically invalid equations. "=", "equals", "equal to", "total of", "average of" followed by a number, or (iii) contain at least three English text before generating API calls. Below, we list the prompts used to sample API calls for each tool considered. Your task is to add calls to a Question Answering API to a piece of text. Input: Joe Biden was born in Scranton, Pennsylvania. Output: Joe Biden was born in [QA("Where was Joe Biden born?")] Scranton, [QA("In Output: Coca-Cola, or [QA("What other name is Coca-Cola known by?")] Coke, is Your task is to add calls to a Calculator API to a piece of text.

artificial intelligence, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Lackawanna County > Scranton (0.24)
Africa > Ghana (0.05)
Asia > China > Jiangsu Province > Nanjing (0.04)
Europe > United Kingdom (0.04)

Genre: Personal (0.54)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Leisure & Entertainment (0.68)
Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.51)

Add feedback

China honing abilities for a possible future attack, Taiwan warns

The Japan TimesOct-9-2025, 05:00:00 GMT

A China Coast Guard vessel is seen on a giant screen showing news footage about the coast guard's law enforcement patrols in waters around Taiwan, outside a shopping mall in Beijing on April 1. | REUTERS TAIPEI - China is increasing military activities near Taiwan and honing its ability to stage a surprise attack, as well as seeking to undermine trust in the government with hybrid online warfare tactics, the island's defense ministry said on Thursday. Democratically-governed Taiwan, which China views as its own territory, has faced increased military pressure from Beijing over the past five years, including at least seven rounds of major war games around the island since 2022. China has been using artificial intelligence tools to weaken Taiwan's cybersecurity and to scan for weak points in critical infrastructure, the defense ministry said in a report released every two years. Beijing is also using hybrid warfare to weaken people's trust in the government and support for defense spending, and stepping up grey zone harassment, it added, referring to non-combat operations such as coast guard patrols designed to pressure Taiwan. Through both conventional and unconventional military actions, it aims to test its capabilities for attacking Taiwan and confronting foreign forces, the ministry said.

china, ministry, taiwan, (12 more...)

The Japan Times

Country:

Asia > China > Beijing > Beijing (0.68)
Asia > Taiwan > Taiwan Province > Taipei (0.25)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.06)
(5 more...)

Genre: Personal > Honors (0.85)

Industry:

Government > Military (1.00)
Government > Regional Government > Asia Government (0.30)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

Large Language Model as Attributed Training Data Generator: A T ale of Diversity and Bias Yue Y u

Neural Information Processing SystemsOct-9-2025, 04:43:35 GMT

Large language models (LLMs) have been recently leveraged as training data generators for various natural language processing (NLP) tasks. While previous research has explored different approaches to training models using generated data, they generally rely on simple class-conditional prompts, which may limit the diversity of the generated data and inherit systematic biases of LLM. Thus, we investigate training data generation with diversely attributed prompts (e.g.,

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > District of Columbia > Washington (0.04)
North America > Mexico (0.04)
Africa (0.04)
(6 more...)

Genre:

Research Report > New Finding (0.92)
Personal (0.67)

Industry:

Media > Film (1.00)
Leisure & Entertainment > Games > Computer Games (1.00)
Law (1.00)
(14 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

DICES Dataset: Supplementary Material

Neural Information Processing SystemsOct-9-2025, 03:53:11 GMT

artificial intelligence, machine learning, rater, (18 more...)

Neural Information Processing Systems

Country: North America > United States > California > Santa Clara County (0.04)

Genre:

Research Report > Experimental Study (0.98)
Research Report > New Finding (0.70)
Personal (0.68)

Industry:

Law (1.00)
Information Technology > Security & Privacy (0.69)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.69)
Information Technology > Security & Privacy (0.69)

Add feedback

Energy firms snap up weather services for trading edge in Japan

The Japan TimesOct-9-2025, 03:03:00 GMT

Power traders are fueling a boom in weather data, which helps them to anticipate sudden price swings. Weather forecasters are finding a lucrative niche in Japan's power-trading boom, selling hyper-specialized data to firms seeking an edge in one of the world's most volatile electricity markets. Weathernews is among a handful of companies cashing in on demand for meteorological data. The Tokyo-listed company's shares have surged 50% in the last year as investors bet on its expanded use of artificial intelligence, among other factors. The firm says it's supplying -- or is in talks to provide -- data to several dozen power traders, about a third of which are based outside Japan.

crime & legal science, trading edge, weather service, (10 more...)

The Japan Times

Country:

North America > United States (0.52)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.30)
Asia > Philippines (0.05)
(4 more...)

Genre: Personal > Honors (0.90)

Industry:

Energy > Power Industry (0.72)
Government > Regional Government > North America Government > United States Government (0.41)

Technology:

Information Technology > Communications > Social Media (0.78)
Information Technology > Artificial Intelligence (0.74)

Add feedback

91edff07232fb1b55a505a9e9f6c0ff3-Supplemental-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 01:30:47 GMT

large language model, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country: Oceania > Australia (0.04)

Genre:

Personal (0.46)
Instructional Material (0.46)

Industry: Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.67)

Add feedback

ELF-R

Neural Information Processing SystemsOct-9-2025, 01:30:43 GMT

This mostly includes tasks with multifaceted objectives, such as dialogue response generation, or tasks with hard-to-define goals, such as enhancing program readability.

large language model, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country: