AITopics | critical investigation

Collaborating Authors

critical investigation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

On the Planning Abilities of Large Language Models - A Critical Investigation

Neural Information Processing SystemsDec-27-2025, 04:14:29 GMT

Intrigued by the claims of emergent reasoning capabilities in LLMs trained on general web corpora, in this paper, we set out to investigate their planning capabilities. We aim to evaluate (1) the effectiveness of LLMs in generating plans autonomously in commonsense planning tasks and (2) the potential of LLMs as a source of heuristic guidance for other agents (AI planners) in their planning tasks. We conduct a systematic study by generating a suite of instances on domains similar to the ones employed in the International Planning Competition and evaluate LLMs in two distinct modes: autonomous and heuristic. Our findings reveal that LLMs' ability to generate executable plans autonomously is rather limited, with the best model (GPT-4) having an average success rate of ~12% across the domains.

language model, name change, planning ability, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

On the Planning Abilities of Large Language Models - A Critical Investigation

Neural Information Processing SystemsJan-20-2025, 02:17:09 GMT

critical investigation, language model, planning ability, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Critical Investigation of Failure Modes in Physics-informed Neural Networks

Basir, Shamsulhaq, Senocak, Inanc

arXiv.org Artificial IntelligenceJun-28-2022

Several recent works in scientific machine learning have revived interest in the application of neural networks to partial differential equations (PDEs). A popular approach is to aggregate the residual form of the governing PDE and its boundary conditions as soft penalties into a composite objective/loss function for training neural networks, which is commonly referred to as physics-informed neural networks (PINNs). In the present study, we visualize the loss landscapes and distributions of learned parameters and explain the ways this particular formulation of the objective function may hinder or even prevent convergence when dealing with challenging target solutions. We construct a purely data-driven loss function composed of both the boundary loss and the domain loss. Using this data-driven loss function and, separately, a physics-informed loss function, we then train two neural network models with the same architecture. We show that incomparable scales between boundary and domain loss terms are the culprit behind the poor performance. Additionally, we assess the performance of both approaches on two elliptic problems with increasingly complex target solutions. Based on our analysis of their loss landscapes and learned parameter distributions, we observe that a physics-informed neural network with a composite objective function formulation produces highly non-convex loss surfaces that are difficult to optimize and are more prone to the problem of vanishing gradients.

deep learning, machine learning, physics-informed neural network, (3 more...)

arXiv.org Artificial Intelligence

2206.09961

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

MRes RCA Show: Critical Investigations into the Future of Art and Design Research

#artificialintelligenceSep-10-2018, 21:05:57 GMT

The MRes RCA programme provides early and mid-career art and design researchers with the intellectual, technical and professional tools with which to complete high-quality research projects. The programme is a uniquely interdisciplinary degree, and the first to be taught across all four Schools of the RCA. Over a full-time year it offers training in practice and theory-led research methods for critical studies in art and design. As demonstrated by the graduating students' work, MRes RCA supports students from diverse backgrounds. Students come from previous study both in art and design and in related disciplines such as history, political sciences and psychology, and with experience working in the creative industries, as practising architects, designers and artists.

art and design research, artificial intelligence, student, (11 more...)

#artificialintelligence

Country: Asia > Middle East > Syria > Aleppo Governorate > Aleppo (0.05)

Industry: Education (0.91)

Technology: Information Technology > Artificial Intelligence (0.31)

Add feedback

How Much Reading Does Reading Comprehension Require? A Critical Investigation of Popular Benchmarks

Kaushik, Divyansh, Lipton, Zachary C.

arXiv.org Artificial IntelligenceAug-21-2018

Many recent papers address reading comprehension, where examples consist of (question, passage, answer) tuples. Presumably, a model must combine information from both questions and passages to predict corresponding answers. However, despite intense interest in the topic, with hundreds of published papers vying for leaderboard dominance, basic questions about the difficulty of many popular benchmarks remain unanswered. In this paper, we establish sensible baselines for the bAbI, SQuAD, CBT, CNN, and Who-did-What datasets, finding that question- and passage-only models often perform surprisingly well. On $14$ out of $20$ bAbI tasks, passage-only models achieve greater than $50\%$ accuracy, sometimes matching the full model. Interestingly, while CBT provides $20$-sentence stories only the last is needed for comparably accurate prediction. By comparison, SQuAD and CNN appear better-constructed.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

1808.04926

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.05)

Genre: Research Report > New Finding (0.68)

Industry: Education > Assessment & Standards > Student Performance (0.72)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback