AITopics | difficult question

Collaborating Authors

difficult question

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Skill-Targeted Adaptive Training

He, Yinghui, Panigrahi, Abhishek, Lin, Yong, Arora, Sanjeev

arXiv.org Artificial IntelligenceOct-14-2025

Language models often show little to no improvement (i.e., "saturation") when trained via vanilla supervised fine-tuning (SFT) on data similar to what they saw in their training set (e.g., MATH). We introduce a new fine-tuning strategy, STAT, to train such a student model by using the metacognition ability of a stronger large language model (LLM) as the teacher. The teacher uses the task dataset to create a list of skills needed for the task, and then labels each data point with its required skills (Didolkar et al., 2024). By monitoring the student's answers, the teacher creates a Missing-Skill-Profile for the student, tracking how often they failed to apply each skill in their responses. We use this idea to build a modified training set in one of two ways. In STAT-Sel, the teacher uses an existing set of training examples but adaptively reweights them according to the Missing-Skill-Profile. In STAT-Syn, the teacher synthesizes additional examples involving missing skills. Across extensive experiments on Llama and Qwen models, our methods yield improvements of up to 7.5% on MATH, whereas SFT provides only limited gains. Furthermore, STAT enhances performance on out-of-distribution benchmarks (e.g., AIME24/25, AMC23, etc.) by an average of 4.6%. Crucially, we find that STAT is complementary to RL via GRPO (Shao et al., 2024): after the model is improved using STAT to address skill gaps, GRPO continues to add further gains. We conclude that skill-targeted adaptive training should broadly improve current training pipelines. Our code is available at: https://github.com/princeton-pli/STAT.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2510.10023

Genre: Research Report (0.50)

Industry: Education (0.88)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

AdaptMI: Adaptive Skill-based In-context Math Instruction for Small Language Models

He, Yinghui, Panigrahi, Abhishek, Lin, Yong, Arora, Sanjeev

arXiv.org Artificial IntelligenceSep-12-2025

In-context learning (ICL) allows a language model to improve its problem-solving capability when provided with suitable information in context. Since the choice of in-context information can be determined based on the problem itself, in-context learning is analogous to human learning from teachers in a classroom. Recent works (Didolkar et al., 2024a; 2024b) show that ICL performance can be improved by leveraging a frontier large language model's (LLM) ability to predict required skills to solve a problem, popularly referred to as an LLM's metacognition, and using the recommended skills to construct necessary in-context examples. While this skill-based strategy boosts ICL performance in larger models, its gains on small language models (SLMs) have been minimal, highlighting a performance gap in ICL capabilities. We investigate this gap and show that skill-based prompting can hurt SLM performance on easy questions by introducing unnecessary information, akin to cognitive overload. To address this, we introduce AdaptMI, an adaptive approach to selecting skill-based in-context Math Instructions for SLMs. Inspired by cognitive load theory from human pedagogy, our method only introduces skill-based examples when the model performs poorly. We further propose AdaptMI+, which adds examples targeted to the specific skills missing from the model's responses. On 5-shot evaluations across popular math benchmarks and five SLMs (1B--7B; Qwen, Llama), AdaptMI+ improves accuracy by up to 6% over naive skill-based strategies.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2505.00147

Country:

South America > Colombia > Meta Department > Villavicencio (0.04)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry: Education > Curriculum (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

DistRAG: Towards Distance-Based Spatial Reasoning in LLMs

Schneider, Nicole R, Ramachandran, Nandini, O'Sullivan, Kent, Samet, Hanan

arXiv.org Artificial IntelligenceJun-5-2025

Many real world tasks where Large Language Models (LLMs) can be used require spatial reasoning, like Point of Interest (POI) recommendation and itinerary planning. However, on their own LLMs lack reliable spatial reasoning capabilities, especially about distances. To address this problem, we develop a novel approach, DistRAG, that enables an LLM to retrieve relevant spatial information not explicitly learned during training. Our method encodes the geodesic distances between cities and towns in a graph and retrieves a context subgraph relevant to the question. Using this technique, our method enables an LLM to answer distance-based reasoning questions that it otherwise cannot answer. Given the vast array of possible places an LLM could be asked about, DistRAG offers a flexible first step towards providing a rudimentary `world model' to complement the linguistic knowledge held in LLMs.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2506.03424

Country:

North America > United States (0.50)
Europe (0.47)
Oceania > Australia > New South Wales > Sydney (0.29)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback

A Study on Educational Data Analysis and Personalized Feedback Report Generation Based on Tags and ChatGPT

Zhou, Yizhou, Zhang, Mengqiao, Jiang, Yuan-Hao, Gao, Xinyu, Liu, Naijie, Jiang, Bo

arXiv.org Artificial IntelligenceJan-12-2025

This study introduces a novel method that employs tag annotation coupled with the ChatGPT language model to analyze student learning behaviors and generate personalized feedback. Central to this approach is the conversion of complex student data into an extensive set of tags, which are then decoded through tailored prompts to deliver constructive feedback that encourages rather than discourages students. This methodology focuses on accurately feeding student data into large language models and crafting prompts that enhance the constructive nature of feedback. The effectiveness of this approach was validated through surveys conducted with over 20 mathematics teachers, who confirmed the reliability of the generated reports. This method can be seamlessly integrated into intelligent adaptive learning systems or provided as a tool to significantly reduce the workload of teachers, providing accurate and timely feedback to students. By transforming raw educational data into interpretable tags, this method supports the provision of efficient and timely personalized learning feedback that offers constructive suggestions tailored to individual learner needs.

large language model, machine learning, natural language, (14 more...)

arXiv.org Artificial Intelligence

2501.06819

Country: Asia > China (0.30)

Genre: Research Report > Promising Solution (0.34)

Industry:

Education > Educational Setting (1.00)
Education > Curriculum > Subject-Specific Education (0.49)
Education > Educational Technology > Educational Software > Computer Based Training (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

AI's Influence on Music Is Raising Some Difficult Questions

TIME - TechDec-4-2023, 18:38:03 GMT

Earlier this year, Bad Bunny emphatically rejected rumors that he was about to release a new song with Justin Bieber. "That's fake," he told TIME in an interview for a cover story on his meteoric rise. "You never know what I'm going to do." But last month, a song featuring what sounded like his and Bieber's voices started circulating on TikTok, garnering millions of likes. Bad Bunny hadn't lied in the interview, though: the song was created with AI.

artist, difficult question, music, (14 more...)

TIME - Tech

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Communications > Social Media (0.93)
Information Technology > Artificial Intelligence > Natural Language (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Continuations by Albert Wenger : Thinking About AI: Part 3 - Existential Risk...

#artificialintelligenceApr-10-2023, 00:15:13 GMT

Now we are getting to the biggest and weirdest risk of AI: a super intelligence emerging and wiping out humanity in pursuit of its own goals. To a lot of people this seems like a totally absurd idea, held only by a tiny fringe of people who appear weird and borderline culty. It seems so far out there and also so huge that most people wind up dismissing it and/or forgetting about shortly after hearing it. There is a big similarity here to the climate crisis, where the more extreme views are widely dismissed. In case you have not encountered the argument yet, let me give a very brief summary (Nick Bostrom has an entire book on the topic and Eliezer Yudkowsky has been blogging about it for two decades, so this will be super compressed by comparison): A superintelligence when it emerges will be pursuing its own set of goals.

artificial general intelligence, existential risk, superintelligence, (14 more...)

#artificialintelligence

Industry: Health & Medicine (0.31)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.70)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.70)

Add feedback

Not Quite 'Ask a Librarian': AI on the Nature, Value, and Future of LIS

Dinneen, Jesse David, Bubinger, Helen

arXiv.org Artificial IntelligenceJul-7-2021

AI language models trained on Web data generate prose that reflects human knowledge and public sentiments, but can also contain novel insights and predictions. We asked the world's best language model, GPT-3, fifteen difficult questions about the nature, value, and future of library and information science (LIS), topics that receive perennial attention from LIS scholars. We present highlights from its 45 different responses, which range from platitudes and caricatures to interesting perspectives and worrisome visions of the future, thus providing an LIS-tailored demonstration of the current performance of AI language models. We also reflect on the viability of using AI to forecast or generate research ideas in this way today. Finally, we have shared the full response log online for readers to consider and evaluate for themselves.

information, information science, lis, (15 more...)

arXiv.org Artificial Intelligence

2107.05383

Country:

North America > Canada > Quebec > Montreal (0.04)
Europe > Germany > Berlin (0.04)
Europe > United Kingdom (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.51)

Add feedback

Getting Factual Answers to More Difficult Questions

#artificialintelligenceJun-24-2021, 11:25:51 GMT

Every day, businesses and organizations are tasked with making more decisions than any human could ever hope to handle. Often, enterprises need to make complex business decisions with limited information on hand. With the help of AI-based products and AI-driven enterprise search solutions as a critical enabling technology, leaders can make better strategic and informed decisions by gaining insight from a vast amount of data in a short period. With the assistance of custom dashboards and 360-degree views of data, employees with different roles can each have a single view into all the information they need at its most appropriate level. The key message for companies is that making the right decisions and fast decisions are not either-or anymore.

difficult question, digital twin, information, (11 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

A Study on the Manifestation of Trust in Speech

Gauder, Lara, Pepino, Leonardo, Riera, Pablo, Brussino, Silvina, Vidal, Jazmín, Gravano, Agustín, Ferrer, Luciana

arXiv.org Artificial IntelligenceFeb-9-2021

Research has shown that trust is an essential aspect of human-computer interaction directly determining the degree to which the person is willing to use a system. An automatic prediction of the level of trust that a user has on a certain system could be used to attempt to correct potential distrust by having the system take relevant actions like, for example, apologizing or explaining its decisions. In this work, we explore the feasibility of automatically detecting the level of trust that a user has on a virtual assistant (VA) based on their speech. We developed a novel protocol for collecting speech data from subjects induced to have different degrees of trust in the skills of a VA. The protocol consists of an interactive session where the subject is asked to respond to a series of factual questions with the help of a virtual assistant. In order to induce subjects to either trust or distrust the VA's skills, they are first informed that the VA was previously rated by other users as being either good or bad; subsequently, the VA answers the subjects' questions consistently to its alleged abilities. All interactions are speech-based, with subjects and VAs communicating verbally, which allows the recording of speech produced under different trust conditions. Using this protocol, we collected a speech corpus in Argentine Spanish. We show clear evidence that the protocol effectively succeeded in influencing subjects into the desired mental state of either trusting or distrusting the agent's skills, and present results of a perceptual study of the degree of trust performed by expert listeners. Finally, we found that the subject's speech can be used to detect which type of VA they were using, which could be considered a proxy for the user's trust toward the VA's abilities, with an accuracy up to 76%, compared to a random baseline of 50%.

experiment, speech, system error, (17 more...)

arXiv.org Artificial Intelligence

2102.0937

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
South America > Argentina > Pampas > Buenos Aires F.D. > Buenos Aires (0.04)
North America > Canada > Quebec > Montreal (0.04)
(5 more...)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Industry:

Health & Medicine (0.46)
Government (0.46)

Technology:

Information Technology > Human Computer Interaction (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

Making AI Systems Fairer Will Require Time, Guidelines

#artificialintelligenceSep-10-2019, 16:21:53 GMT

Christoph Lutge, director of the Institute for Ethics in Artificial Intelligence at Germany's Technical University of Munich, said there is "a chance that these AI systems might be fairer eventually, but they will need guidelines." In January, the Institute for Ethics in Artificial Intelligence was established at Germany's Technical University of Munich (TUM), with initial funding from a five-year, $7.5-million grant from Facebook. The Institute has issued its first call for proposals, and an advisory board was recently appointed. The Institute's director, Christoph Lütge, holds the Peter Löscher Chair in Business Ethics at TUM. Lütge recently spoke about ethics in artificial intelligence (AI) generally, and the new Institute specifically. Can you give an example of the type of ethical question in AI that the Center might be dealing with?

artificial intelligence, facebook, social media, (16 more...)

#artificialintelligence

Country: Europe > Germany > Bavaria > Upper Bavaria > Munich (0.48)

Industry:

Information Technology > Services (0.73)
Media (0.54)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)

Add feedback