AITopics | Kielce

Collaborating Authors

Kielce

Exploring Explanations Improves the Robustness of In-Context Learning

arXiv.org Artificial IntelligenceJun-4-2025

In-context learning (ICL) has emerged as a successful paradigm for leveraging large language models (LLMs). However, it often struggles to generalize beyond the distribution of the provided demonstrations. A recent advancement in enhancing robustness is ICL with explanations (X-ICL), which improves prediction reliability by guiding LLMs to understand and articulate the reasoning behind correct labels. Building on this approach, we introduce an advanced framework that extends X-ICL by systematically exploring explanations for all possible labels (X$^2$-ICL), thereby enabling more comprehensive and robust decision-making. Experimental results on multiple natural language understanding datasets validate the effectiveness of X$^2$-ICL, demonstrating significantly improved robustness to out-of-distribution data compared to the existing ICL approaches.

computational linguistic, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2506.02378

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Canada > Ontario > Toronto (0.04)
(18 more...)

Genre: Research Report > New Finding (0.93)

Industry:

Media > Film (1.00)
Leisure & Entertainment > Sports > Football (1.00)
Media > Music (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Beacon: A Naturalistic Driving Dataset During Blackouts for Benchmarking Traffic Reconstruction and Control

Sarker, Supriya, Islam, Iftekharul, Poudel, Bibek, Li, Weizi

arXiv.org Artificial IntelligenceDec-17-2024

Extreme weather events and other vulnerabilities are causing blackouts with increasing frequency, disrupting traffic control systems and posing significant challenges to urban mobility. To address this growing concern, we introduce \model{}, a naturalistic driving dataset collected during blackouts at complex intersections. Beacon provides detailed traffic data from two unsignalized intersections in Memphis, TN, including timesteps, origin, and destination lanes for each vehicle over four hours. We analyze traffic demand, vehicle trajectories, and density across different scenarios. We also use the dataset to reconstruct unsignalized, signalized and mixed traffic conditions, demonstrating its utility for benchmarking traffic reconstruction techniques and control methods. To the best of our knowledge, Beacon could be the first public available traffic dataset that captures naturalistic driving behaviors at complex intersections.

artificial intelligence, intersection, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2412.14208

Country:

North America > United States > Tennessee > Shelby County > Memphis (0.25)
North America > United States > Tennessee > Knox County > Knoxville (0.14)
North America > United States > California (0.04)
(7 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Transportation > Infrastructure & Services (1.00)
Transportation > Ground > Road (1.00)
Energy (0.95)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback

Make a Choice! Knowledge Base Question Answering with In-Context Learning

Tan, Chuanyuan, Chen, Yuehe, Shao, Wenbiao, Chen, Wenliang

arXiv.org Artificial IntelligenceMay-23-2023

Question answering over knowledge bases (KBQA) aims to answer factoid questions with a given knowledge base (KB). Due to the large scale of KB, annotated data is impossible to cover all fact schemas in KB, which poses a challenge to the generalization ability of methods that require a sufficient amount of annotated data. Recently, LLMs have shown strong few-shot performance in many NLP tasks. We expect LLM can help existing methods improve their generalization ability, especially in low-resource situations. In this paper, we present McL-KBQA, a framework that incorporates the few-shot ability of LLM into the KBQA method via ICL-based multiple choice and then improves the effectiveness of the QA tasks. Experimental results on two KBQA datasets demonstrate the competitive performance of McL-KBQA with strong improvements in generalization. We expect to explore a new way to QA tasks from KBQA in conjunction with LLM, how to generate answers normatively and correctly with strong generalization.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2305.13972

Country:

North America > United States > Missouri > Jackson County > Kansas City (0.05)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
Asia > China (0.04)
(7 more...)

Genre: Research Report (0.64)

Industry: Education (0.36)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Keyword Extraction from Short Texts with a Text-To-Text Transfer Transformer

Pęzik, Piotr, Mikołajczyk-Bareła, Agnieszka, Wawrzyński, Adam, Nitoń, Bartłomiej, Ogrodniczuk, Maciej

arXiv.org Artificial IntelligenceOct-17-2022

The paper explores the relevance of the Text-To-Text Transfer Transformer language model (T5) for Polish (plT5) to the task of intrinsic and extrinsic keyword extraction from short text passages. The evaluation is carried out on the new Polish Open Science Metadata Corpus (POSMAC), which is released with this paper: a collection of 216,214 abstracts of scientific publications compiled in the CURLICAT project. We compare the results obtained by four different methods, i.e. plT5kw, extremeText, TermoPL, KeyBERT and conclude that the plT5kw model yields particularly promising results for both frequent and sparsely represented keywords. Furthermore, a plT5kw keyword generation model trained on the POSMAC also seems to produce highly useful results in cross-domain text labelling scenarios. We discuss the performance of the model on news stories and phone-based dialog transcripts which represent text genres and domains extrinsic to the dataset of scientific abstracts. Finally, we also attempt to characterize the challenges of evaluating a text-to-text model on both intrinsic and extrinsic keyword extraction.

artificial intelligence, information retrieval, natural language, (13 more...)

arXiv.org Artificial Intelligence

2209.14008

Country:

Europe > Germany (0.14)
Europe > Poland > Świętokrzyskie Province > Kielce (0.04)
Europe > Italy > Sicily (0.04)
(10 more...)

Genre: Research Report (1.00)

Industry: Media > News (0.35)

Technology: Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.94)

Add feedback