AITopics | Snooker

Collaborating Authors

Snooker

Searching for Difficult-to-Translate Test Examples at Scale

Xu, Wenda, Zouhar, Vilém, Riley, Parker, Finkelstein, Mara, Freitag, Markus, Deutsch, Daniel

arXiv.org Artificial IntelligenceOct-1-2025

NLP models require test data that are sufficiently challenging. The difficulty of an example is linked to the topic it originates from (''seed topic''). The relationship between the topic and the difficulty of its instances is stochastic in nature: an example about a difficult topic can happen to be easy, and vice versa. At the scale of the Internet, there are tens of thousands of potential topics, and finding the most difficult one by drawing and evaluating a large number of examples across all topics is computationally infeasible. We formalize this task and treat it as a multi-armed bandit problem. In this framework, each topic is an ''arm,'' and pulling an arm (at a cost) involves drawing a single example, evaluating it, and measuring its difficulty. The goal is to efficiently identify the most difficult topics within a fixed computational budget. We illustrate the bandit problem setup of finding difficult examples for the task of machine translation. We find that various bandit strategies vastly outperform baseline methods like brute-force searching the most challenging topics.

artificial intelligence, data mining, natural language, (19 more...)

arXiv.org Artificial Intelligence

2509.26619

Genre: Research Report > Experimental Study (0.46)

Industry:

Leisure & Entertainment > Sports > Snooker (1.00)
Law (1.00)

Technology:

Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

ECLeKTic: a Novel Challenge Set for Evaluation of Cross-Lingual Knowledge Transfer

Goldman, Omer, Shaham, Uri, Malkin, Dan, Eiger, Sivan, Hassidim, Avinatan, Matias, Yossi, Maynez, Joshua, Gilady, Adi Mayrav, Riesa, Jason, Rijhwani, Shruti, Rimell, Laura, Szpektor, Idan, Tsarfaty, Reut, Eyal, Matan

arXiv.org Artificial IntelligenceMar-3-2025

To achieve equitable performance across languages, multilingual large language models (LLMs) must be able to abstract knowledge beyond the language in which it was acquired. However, the current literature lacks reliable ways to measure LLMs' capability of cross-lingual knowledge transfer. To that end, we present ECLeKTic, a multilingual closed-book QA (CBQA) dataset that Evaluates Cross-Lingual Knowledge Transfer in a simple, black-box manner. We detected information with uneven coverage across languages by controlling for presence and absence of Wikipedia articles in 12 languages. We generated knowledge-seeking questions in a source language, for which the answer appears in a relevant Wikipedia article and translated them to all other 11 languages, for which the respective Wikipedias lack equivalent articles. Assuming that Wikipedia reflects the prominent knowledge in the LLM's training data, to solve ECLeKTic's CBQA task the model is required to transfer knowledge between languages. Experimenting with 8 LLMs, we show that SOTA models struggle to effectively share knowledge across, languages even if they can predict the answer well for queries in the same language the knowledge was acquired in.

computational linguistic, knowledge, kt ic, (16 more...)

arXiv.org Artificial Intelligence

2502.21228

Country:

Asia > Singapore (0.04)
Pacific Ocean > South Pacific Ocean > Coral Sea (0.04)
Oceania > Australia > Queensland (0.04)
(7 more...)

Genre: Research Report (0.82)

Industry: Leisure & Entertainment > Sports > Snooker (1.00)

Technology:

Information Technology > Knowledge Management (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Add feedback

Semantically Aligned Question and Code Generation for Automated Insight Generation

Singha, Ananya, Chopra, Bhavya, Khatry, Anirudh, Gulwani, Sumit, Henley, Austin Z., Le, Vu, Parnin, Chris, Singh, Mukul, Verbruggen, Gust

arXiv.org Artificial IntelligenceMar-21-2024

Automated insight generation is a common tactic for helping knowledge workers, such as data scientists, to quickly understand the potential value of new and unfamiliar data. Unfortunately, automated insights produced by large-language models can generate code that does not correctly correspond (or align) to the insight. In this paper, we leverage the semantic knowledge of large language models to generate targeted and insightful questions about data and the corresponding code to answer those questions. Then through an empirical study on data from Open-WikiTable, we show that embeddings can be effectively used for filtering out semantically unaligned pairs of question and code. Additionally, we found that generating questions and code together yields more diverse questions.

aligned question and code generation, participant, semantically aligned question, (12 more...)

arXiv.org Artificial Intelligence

2405.01556

Country:

Europe > Portugal > Lisbon > Lisbon (0.05)
Asia > India (0.05)
North America > United States > New York > New York County > New York City (0.04)
(6 more...)

Genre: Questionnaire & Opinion Survey (1.00)

Industry: Leisure & Entertainment > Sports > Snooker (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback