AITopics | arnold

Collaborating Authors

arnold

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

The real storm chasers of the Great Plains

Popular ScienceMay-7-2026, 13:00:00 GMT

More information Adding us as a Preferred Source in Google by using this link indicates that you would like to see more of our content in Google News results. Storm chasers took this photo of a rotating wall cloud in Clovis, New Mexico, in May 2023. Breakthroughs, discoveries, and DIY tips sent six days a week. Flying cows, SUVs soaring through the air like toys, quaint towns that are virtually wiped off the map. Hollywood certainly makes the very real world of chasing tornadoes appear exciting on the big screen.

artificial intelligence, storm, tornado, (11 more...)

Popular Science

Country: North America > United States > New Mexico > Curry County > Clovis (0.24)

Industry:

Government > Regional Government > North America Government > United States Government (0.49)
Media > Film (0.35)

Technology: Information Technology > Artificial Intelligence (0.49)

Add feedback

Online child safety advocates urge California lawmakers to increase protections

Los Angeles TimesDec-7-2025, 11:00:00 GMT

While child safety advocates agree progress was made at the state capital this year to protect children online, they argue there’s still a long way to go and plan to fight for more protections when legislators reconvene in January.

advertisement, artificial intelligence, social media, (11 more...)

Los Angeles Times

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.06)
South America > Venezuela (0.04)
North America > United States > New York (0.04)
(6 more...)

Industry: Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

ChessArena: A Chess Testbed for Evaluating Strategic Reasoning Capabilities of Large Language Models

Liu, Jincheng, He, Sijun, Wu, Jingjing, Wang, Xiangsen, Chen, Yang, Kuang, Zhaoqi, Bao, Siqi, Yao, Yuan

arXiv.org Artificial IntelligenceDec-3-2025

Recent large language models (LLMs) have shown strong reasoning capabilities. However, a critical question remains: do these models possess genuine reasoning skills particularly complex strategic reasoning or are they primarily excelling at sophisticated pattern recognition within their training data? To address this question, this paper presents a chess testbed, ChessArena, to evaluate the strategic reasoning capabilities of LLMs. Chess requires complex strategic reasoning capabilities including long-term planning, strict rule comprehension, and multi-turn conversation memorization. Specifically, ChessArena is a competitive framework where LLMs play against each other, under four different play modes. The testbed is equipped with a ranking algorithm and a leaderboard. The testbed can also evaluate fine-grained capabilities including basic understanding, move selection, and puzzle solving. Over 13 LLMs with different modes are evaluated in ChessArena, playing over 800 games. The results reveal significant shortcomings in current LLMs: no model can beat Maia-1100 (a chess engine at human amateur level), while some even failed to defeat a random player that selects moves arbitrarily. We also present a strong baseline to the testbed: our fine-tuned Qwen3-8B substantially improved performance, approaching much larger state-of-the-art reasoning models.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2509.24239

Genre: Research Report > New Finding (0.45)

Industry: Leisure & Entertainment > Games > Chess (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)

Add feedback

What does 'chance of precipitation' really mean? A meteorologist explains.

Popular ScienceDec-2-2025, 14:00:00 GMT

What does'chance of precipitation' really mean? Here's how to figure out if you can leave the umbrella at home. It's not always "when it rains, it pours." Breakthroughs, discoveries, and DIY tips sent every weekday. Understanding the weather forecast can sometimes feel like reading tea leaves.

artificial intelligence, forecast, precipitation, (12 more...)

Popular Science

Country: North America > United States > New York (0.05)

Industry: Government (0.48)

Technology: Information Technology > Artificial Intelligence (0.68)

Add feedback

EEFSUVA: A New Mathematical Olympiad Benchmark

Khatibi, Nicole N, Radamovich, Daniil A., Brenner, Michael P.

arXiv.org Artificial IntelligenceOct-6-2025

Recent breakthroughs have spurred claims that large language models (LLMs) match gold medal Olympiad to graduate level proficiency on mathematics benchmarks. In this work, we examine these claims in detail and assess the extent to which current benchmarks capture genuine LLM mathematical reasoning. The composition of these benchmarks, primarily drawing from the International Mathematics Olympiad (IMO) and related competitions, may overstate models reasoning ability due to potential data contamination and a narrow focus on familiar problem types. To enable a more holistic assessment of mathematical understanding, we introduce EEFSUVA, a novel benchmark curated from under circulated regional and national Olympiads of Eastern Europe and the countries from the former Soviet Union. These contests feature problems of comparable difficulty to the IMO and are renowned for demanding nonstandard problem-solving techniques, yet their problems are far less prevalent in online corpora. Preliminary results suggest that even state-of-the-art LLMs exhibit a notable performance decline on EEFSUVA relative to other Olympiad-style benchmarks. These findings also suggest the potential importance of broader evaluation datasets for a fuller assessment of mathematical reasoning and for guiding future model development.

benchmark, large language model, machine learning, (21 more...)

arXiv.org Artificial Intelligence

2510.01227

Country:

Europe > Russia (0.35)
Asia > Russia (0.25)
Europe > Eastern Europe (0.24)

Genre: Research Report (0.70)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

Add feedback

Researchers Are Already Leaving Meta's New Superintelligence Lab

WIREDAug-26-2025, 18:00:37 GMT

At least three artificial intelligence researchers have resigned from Meta's new superintelligence lab, just two months after CEO Mark Zuckerberg first announced the initiative. Two of the staffers have returned to OpenAI, where they both previously worked, after less than one-month stints at Meta, WIRED has confirmed. Ethan Knight worked at the ChatGPT maker earlier in his career but joined Meta from Elon Musk's xAI. A third researcher, Rishabh Agarwal, announced publicly on Monday he was leaving Meta's lab as well. He joined the tech giant in April to work on generative AI projects before switching to a role at Meta Superintelligence Labs (MSL), according to his LinkedIn profile.

large language model, machine learning, natural language, (20 more...)

WIRED

Country:

North America > United States > California > San Mateo County > Menlo Park (0.06)
North America > Canada (0.06)

Industry: Information Technology (0.59)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

Arnold: a generalist muscle transformer policy

Chiappa, Alberto Silvio, An, Boshi, Simos, Merkourios, Li, Chengkun, Mathis, Alexander

arXiv.org Artificial IntelligenceAug-26-2025

Controlling high-dimensional and nonlinear musculoskeletal models of the human body is a foundational scientific challenge. Recent machine learning breakthroughs have heralded policies that master individual skills like reaching, object manipulation and locomotion in musculoskeletal systems with many degrees of freedom. However, these agents are merely "specialists", achieving high performance for a single skill. In this work, we develop Arnold, a generalist policy that masters multiple tasks and embodiments. Arnold combines behavior cloning and fine-tuning with PPO to achieve expert or super-expert performance in 14 challenging control tasks from dexterous object manipulation to locomotion. A key innovation is Arnold's sensorimotor vocabulary, a compositional representation of the semantics of heterogeneous sensory modalities, objectives, and actuators. Arnold leverages this vocabulary via a transformer architecture to deal with the variable observation and action spaces of each task. This framework supports efficient multi-task, multi-embodiment learning and facilitates rapid adaptation to novel tasks. Finally, we analyze Arnold to provide insights into biological motor control, corroborating recent findings on the limited transferability of muscle synergies across tasks.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2508.18066

Country: North America > United States (0.67)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine > Therapeutic Area (0.67)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(3 more...)

Add feedback

Bandit_Theory_and_Thompson_Sampling_Guided_Directed_Evolution_for_Sequence_Optimization_camera

Hui Yuan

Neural Information Processing SystemsAug-19-2025, 21:18:35 GMT

There is a growing interest in machine learning-assisted DE for accelerating protein optimization. Y et the theoretical understanding of DE, as well as the use of machine learning in DE, remains limited.

algorithm, evolution, sequence, (16 more...)

Neural Information Processing Systems

Country:

North America > Canada > Alberta (0.14)
Europe > France (0.05)
North America > United States > Oregon (0.04)
(3 more...)

Genre: Research Report (0.46)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)

Add feedback

From Chaos to Order: The Atomic Reasoner Framework for Fine-grained Reasoning in Large Language Models

Liu, Jinyi, Zheng, Yan, Cheng, Rong, Wu, Qiyu, Guo, Wei, Ni, Fei, Liang, Hebin, Yuan, Yifu, Mao, Hangyu, Zhang, Fuzheng, Hao, Jianye

arXiv.org Artificial IntelligenceMar-20-2025

Recent advances in large language models (LLMs) have shown remarkable progress, yet their capacity for logical ``slow-thinking'' reasoning persists as a critical research frontier. Current inference scaling paradigms suffer from two fundamental constraints: fragmented thought flows compromising logical coherence, and intensively computational complexity that escalates with search space dimensions. To overcome these limitations, we present \textbf{Atomic Reasoner} (\textbf{AR}), a cognitive inference strategy that enables fine-grained reasoning through systematic atomic-level operations. AR decomposes the reasoning process into atomic cognitive units, employing a cognitive routing mechanism to dynamically construct reasoning representations and orchestrate inference pathways. This systematic methodology implements stepwise, structured cognition, which ensures logical coherence while significantly reducing cognitive load, effectively simulating the cognitive patterns observed in human deep thinking processes. Extensive experimental results demonstrate AR's superior reasoning capabilities without the computational burden of exhaustive solution searches, particularly excelling in linguistic logic puzzles. These findings substantiate AR's effectiveness in enhancing LLMs' capacity for robust, long-sequence logical reasoning and deliberation.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2503.15944

Country:

Asia > Thailand > Bangkok > Bangkok (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
Asia > China > Tianjin Province > Tianjin (0.04)
Africa > Rwanda > Kigali > Kigali (0.04)

Genre:

Workflow (1.00)
Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

Language Models Largely Exhibit Human-like Constituent Ordering Preferences

Tur, Ada Defne, Kamath, Gaurav, Reddy, Siva

arXiv.org Artificial IntelligenceFeb-14-2025

Though English sentences are typically inflexible vis-\`a-vis word order, constituents often show far more variability in ordering. One prominent theory presents the notion that constituent ordering is directly correlated with constituent weight: a measure of the constituent's length or complexity. Such theories are interesting in the context of natural language processing (NLP), because while recent advances in NLP have led to significant gains in the performance of large language models (LLMs), much remains unclear about how these models process language, and how this compares to human language processing. In particular, the question remains whether LLMs display the same patterns with constituent movement, and may provide insights into existing theories on when and how the shift occurs in human language. We compare a variety of LLMs with diverse properties to evaluate broad LLM performance on four types of constituent movement: heavy NP shift, particle movement, dative alternation, and multiple PPs. Despite performing unexpectedly around particle movement, LLMs generally align with human preferences around constituent ordering.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2502.0567

Country:

Europe (1.00)
North America > United States (0.68)
North America > Canada > Quebec (0.14)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.75)

Add feedback