Goto

Collaborating Authors

 South America


AdaptMI: Adaptive Skill-based In-context Math Instruction for Small Language Models

arXiv.org Artificial Intelligence

In-context learning (ICL) allows a language model to improve its problem-solving capability when provided with suitable information in context. Since the choice of in-context information can be determined based on the problem itself, in-context learning is analogous to human learning from teachers in a classroom. Recent works (Didolkar et al., 2024a; 2024b) show that ICL performance can be improved by leveraging a frontier large language model's (LLM) ability to predict required skills to solve a problem, popularly referred to as an LLM's metacognition, and using the recommended skills to construct necessary in-context examples. While this skill-based strategy boosts ICL performance in larger models, its gains on small language models (SLMs) have been minimal, highlighting a performance gap in ICL capabilities. We investigate this gap and show that skill-based prompting can hurt SLM performance on easy questions by introducing unnecessary information, akin to cognitive overload. To address this, we introduce AdaptMI, an adaptive approach to selecting skill-based in-context Math Instructions for SLMs. Inspired by cognitive load theory from human pedagogy, our method only introduces skill-based examples when the model performs poorly. We further propose AdaptMI+, which adds examples targeted to the specific skills missing from the model's responses. On 5-shot evaluations across popular math benchmarks and five SLMs (1B--7B; Qwen, Llama), AdaptMI+ improves accuracy by up to 6% over naive skill-based strategies.


An Ontology-Driven Graph RAG for Legal Norms: A Structural, Temporal, and Deterministic Approach

arXiv.org Artificial Intelligence

Retrieval-Augmented Generation (RAG) systems in the legal domain face a critical challenge: standard, flat-text retrieval is blind to the hierarchical, diachronic, and causal structure of law, leading to anachronistic and unreliable answers. This paper introduces the Structure-Aware Temporal Graph RAG (SAT-Graph RAG), an ontology-driven framework designed to overcome these limitations by explicitly modeling the formal structure and diachronic nature of legal norms. We ground our knowledge graph in a formal, LRMoo-inspired model that distinguishes abstract legal Works from their versioned Expressions. We model temporal states as efficient aggregations that reuse the versioned expressions (CTVs) of unchanged components, and we reify legislative events as first-class Action nodes to make causality explicit and queryable. This structured backbone enables a unified, planner-guided query strategy that applies explicit policies to deterministically resolve complex requests for (i) point-in-time retrieval, (ii) hierarchical impact analysis, and (iii) auditable provenance reconstruction. Through a case study on the Brazilian Constitution, we demonstrate how this approach provides a verifiable, temporally-correct substrate for LLMs, enabling higher-order analytical capabilities while drastically reducing the risk of factual errors. The result is a practical framework for building more trustworthy and explainable legal AI systems.


BRoverbs -- Measuring how much LLMs understand Portuguese proverbs

arXiv.org Artificial Intelligence

Large Language Models (LLMs) exhibit significant performance variations depending on the linguistic and cultural context in which they are applied. This disparity signals the necessity of mature evaluation frameworks that can assess their capabilities in specific regional settings. In the case of Portuguese, existing evaluations remain limited, often relying on translated datasets that may not fully capture linguistic nuances or cultural references. Meanwhile, native Portuguese-language datasets predominantly focus on structured national exams or sentiment analysis of social media interactions, leaving gaps in evaluating broader linguistic understanding. To address this limitation, we introduce BRoverbs, a dataset specifically designed to assess LLM performance through Brazilian proverbs. Proverbs serve as a rich linguistic resource, encapsulating cultural wisdom, figurative expressions, and complex syntactic structures that challenge the model comprehension of regional expressions. BRoverbs aims to provide a new evaluation tool for Portuguese-language LLMs, contributing to advancing regionally informed benchmarking. The benchmark is available at https://huggingface.co/datasets/Tropic-AI/BRoverbs.


PIPES: A Meta-dataset of Machine Learning Pipelines

arXiv.org Artificial Intelligence

Solutions to the Algorithm Selection Problem (ASP) in machine learning face the challenge of high computational costs associated with evaluating various algorithms' performances on a given dataset. To mitigate this cost, the meta-learning field can leverage previously executed experiments shared in online repositories such as OpenML. OpenML provides an extensive collection of machine learning experiments. However, an analysis of OpenML's records reveals limitations. It lacks diversity in pipelines, specifically when exploring data preprocessing steps/blocks, such as scaling or imputation, resulting in limited representation. Its experiments are often focused on a few popular techniques within each pipeline block, leading to an imbalanced sample. To overcome the observed limitations of OpenML, we propose PIPES, a collection of experiments involving multiple pipelines designed to represent all combinations of the selected sets of techniques, aiming at diversity and completeness. PIPES stores the results of experiments performed applying 9,408 pipelines to 300 datasets. It includes detailed information on the pipeline blocks, training and testing times, predictions, performances, and the eventual error messages. This comprehensive collection of results allows researchers to perform analyses across diverse and representative pipelines and datasets. PIPES also offers potential for expansion, as additional data and experiments can be incorporated to support the meta-learning community further. The data, code, supplementary material, and all experiments can be found at https://github.com/cynthiamaia/PIPES.git.


Learning functions through Diffusion Maps

arXiv.org Artificial Intelligence

We propose a data-driven method for approximating real-valued functions on smooth manifolds, building on the Diffusion Maps framework under the manifold hypothesis. Given pointwise evaluations of a function, the method constructs a smooth extension to the ambient space by exploiting diffusion geometry and its connection to the heat equation and the Laplace-Beltrami operator. To address the computational challenges of high-dimensional data, we introduce a dimensionality reduction strategy based on the low-rank structure of the distance matrix, revealed via singular value decomposition (SVD). In addition, we develop an online updating mechanism that enables efficient incorporation of new data, thereby improving scalability and reducing computational cost. Numerical experiments, including applications to sparse CT reconstruction, demonstrate that the proposed methodology outperforms classical feedforward neural networks and interpolation methods in terms of both accuracy and efficiency.


US energy chief tells BBC nuclear fusion will soon power the world

BBC News

Don't worry too much about planet-warming emissions, the US Energy Secretary has told the BBC, because within five years AI will have enabled the harnessing of nuclear fusion - the energy that powers the sun and stars. Chris Wright told me in an interview that he expected the technology to deliver power to electricity grids around the world within eight to 15 years and that it would rapidly become a big driver of greenhouse gas reductions. His claims will likely surprise even enthusiasts for the technology. Harnessing the energy released when atoms fuse together could produce vast amounts of low carbon energy but most scientists believe commercial fusion power plants are still a long way off. With artificial intelligence and what's going on at the national labs and private companies in the United States, we will have that approach about how to harness fusion energy multiple ways within the next five years, said Mr Wright.


Stone Age women were buried with as many tools as men

Popular Science

Prehistoric graves show women wielded more than early archeologists gave them credit for. Breakthroughs, discoveries, and DIY tips sent every weekday. Some 6,000 years ago in the northern reaches of modern Latvia, a young woman died. Afterwards, a group of early humans buried her in an ancient, sacred place along a lakeshore. They carefully lowered her small teenage frame into the ground, gently placing a stone ax, 28 flint flakes, 15 blades, and a stone scraper beside her.


Belarus frees political prisoners in exchange for easing of US sanctions

BBC News

Dozens of political prisoners have been freed from Belarusian prisons as part of a deal between authoritarian leader Alexander Lukashenko and US President Donald Trump. Fifty-two prisoners have been released, including trade union leaders, journalists and activists, but more than 1,000 political prisoners remain in jail. In exchange, the US has said it will relieve some sanctions on Belarusian airline Belavia, allowing it to buy parts for its airlines. The prisoner release came on the eve of joint military exercises involving Belarus and close ally Russia, and after what neighbouring Poland called an unprecedented Russian drone incursion into its airspace. Poland is closing its borders with Belarus because of the Zapad-2025 drills, which last until Tuesday.


Nato strengthens defences after Russian drones shot down over Poland

BBC News

Nato members are sending troops, artillery, and air defence systems to secure its eastern flank after what Poland called an unprecedented Russian drone incursion into its airspace. In the early hours of Wednesday, three Russian drones were shot down after crossing into Polish airspace. Other drones crashed to the ground and were later found across eastern Poland. Poland has requested a UN Security Council session about the incident, which will take place on Friday at 19:00 GMT. In response to the drone incursion, the Netherlands and the Czech Republic said they would send defences to Poland, while Lithuania would receive a German brigade and greater warning of Russian attacks on Ukraine that could cross over.


See Kathmandu's destroyed and barricaded streets after violence

BBC News

See Kathmandu's destroyed and barricaded streets after violence There's a real sense of tension in Kathmandu, the BBC's Samira Hussain says, after protests against corruption spiralled into arson and violence. Nepal's army deployed patrols on the streets, as the Himalayan nation reeled from its worst unrest in decades. The prime minister quit and politicians' homes were vandalised, and government buildings and parliament were torched. The streets of Nepal's capital have a heavy military presence, with barricades erected outside parliament and the supreme court. The military parade was attended by world leaders including Vladimir Putin and Kim Jong Un and showcased China's new weapons.