South America
Japanese researchers discover 248 Nazca Line geoglyphs in Peru
A team of researchers at Yamagata University announced on Monday the discovery of 248 new Nazca Line geoglyphs in Peru. The geoglyphs, which include drawings of humans, birds and llamas, were drawn along footpaths used by people in ancient times, with each path depicting a different theme, the research team said. In cooperation with IBM, the team identified the geoglyphs through field surveys conducted from 2023 to 2024 on sites selected from aerial photographs using artificial intelligence technology. While one path features continuous images of priests holding human heads, or heads alone, another shows multiple depictions of llamas. The research team, which began work on the World Heritage drawings in 2004, has now identified a total of 893 geoglyphs.
Feature learning is decoupled from generalization in high capacity neural networks
Göring, Niclas Alexander, London, Charles, Erturk, Abdurrahman Hadi, Mingard, Chris, Nam, Yoonsoo, Louis, Ard A.
Neural networks outperform kernel methods, sometimes by orders of magnitude, e.g. on staircase functions. This advantage stems from the ability of neural networks to learn features, adapting their hidden representations to better capture the data. We introduce a concept we call feature quality to measure this performance improvement. We examine existing theories of feature learning and demonstrate empirically that they primarily assess the strength of feature learning, rather than the quality of the learned features themselves. Consequently, current theories of feature learning do not provide a sufficient foundation for developing theories of neural network generalization.
Bayesian symbolic regression: Automated equation discovery from a physicists' perspective
Guimera, Roger, Sales-Pardo, Marta
Symbolic regression automates the process of learning closed-form mathematical models from data. Standard approaches to symbolic regression, as well as newer deep learning approaches, rely on heuristic model selection criteria, heuristic regularization, and heuristic exploration of model space. Here, we discuss the probabilistic approach to symbolic regression, an alternative to such heuristic approaches with direct connections to information theory and statistical physics. We show how the probabilistic approach establishes model plausibility from basic considerations and explicit approximations, and how it provides guarantees of performance that heuristic approaches lack. We also discuss how the probabilistic approach compels us to consider model ensembles, as opposed to single models.
Zero-shot Performance of Generative AI in Brazilian Portuguese Medical Exam
Truyts, Cesar Augusto Madid, Rabelo, Amanda Gomes, de Souza, Gabriel Mesquita, Lages, Daniel Scaldaferri, Pereira, Adriano Jose, Flato, Uri Adrian Prync, Reis, Eduardo Pontes dos, Vieira, Joaquim Edson, Silveira, Paulo Sergio Panse, Junior, Edson Amaro
Artificial intelligence (AI) has shown the potential to revolutionize healthcare by improving diagnostic accuracy, optimizing workflows, and personalizing treatment plans. Large Language Models (LLMs) and Multimodal Large Language Models (MLLMs) have achieved notable advancements in natural language processing and medical applications. However, the evaluation of these models has focused predominantly on the English language, leading to potential biases in their performance across different languages. This study investigates the capability of six LLMs (GPT-4.0 Turbo, LLaMA-3-8B, LLaMA-3-70B, Mixtral 8x7B Instruct, Titan Text G1-Express, and Command R+) and four MLLMs (Claude-3.5-Sonnet, Claude-3-Opus, Claude-3-Sonnet, and Claude-3-Haiku) to answer questions written in Brazilian spoken portuguese from the medical residency entrance exam of the Hospital das Clínicas da Faculdade de Medicina da Universidade de São Paulo (HCFMUSP) - the largest health complex in South America. The performance of the models was benchmarked against human candidates, analyzing accuracy, processing time, and coherence of the generated explanations. The results show that while some models, particularly Claude-3.5-Sonnet and Claude-3-Opus, achieved accuracy levels comparable to human candidates, performance gaps persist, particularly in multimodal questions requiring image interpretation. Furthermore, the study highlights language disparities, emphasizing the need for further fine-tuning and data set augmentation for non-English medical AI applications. Our findings reinforce the importance of evaluating generative AI in various linguistic and clinical settings to ensure a fair and reliable deployment in healthcare. Future research should explore improved training methodologies, improved multimodal reasoning, and real-world clinical integration of AI-driven medical assistance.
LoX: Low-Rank Extrapolation Robustifies LLM Safety Against Fine-tuning
Perin, Gabriel J., Chen, Runjin, Chen, Xuxi, Hirata, Nina S. T., Wang, Zhangyang, Hong, Junyuan
Large Language Models (LLMs) have become indispensable in real-world applications. However, their widespread adoption raises significant safety concerns, particularly in responding to socially harmful questions. Despite substantial efforts to improve model safety through alignment, aligned models can still have their safety protections undermined by subsequent fine-tuning - even when the additional training data appears benign. In this paper, we empirically demonstrate that this vulnerability stems from the sensitivity of safety-critical low-rank subspaces in LLM parameters to fine-tuning. Building on this insight, we propose a novel training-free method, termed Low-Rank Extrapolation (LoX), to enhance safety robustness by extrapolating the safety subspace of an aligned LLM. Our experimental results confirm the effectiveness of LoX, demonstrating significant improvements in robustness against both benign and malicious fine-tuning attacks while preserving the model's adaptability to new tasks. For instance, LoX leads to 11% to 54% absolute reductions in attack success rates (ASR) facing benign or malicious fine-tuning attacks. By investigating the ASR landscape of parameters, we attribute the success of LoX to that the extrapolation moves LLM parameters to a flatter zone, thereby less sensitive to perturbations. The code is available at github.com/VITA-Group/LoX.
Flow Stochastic Segmentation Networks
Ribeiro, Fabio De Sousa, Todd, Omar, Jones, Charles, Kori, Avinash, Mehta, Raghav, Glocker, Ben
We introduce the Flow Stochastic Segmentation Network (Flow-SSN), a generative segmentation model family featuring discrete-time autoregressive and modern continuous-time flow variants. We prove fundamental limitations of the low-rank parameterisation of previous methods and show that Flow-SSNs can estimate arbitrarily high-rank pixel-wise covariances without assuming the rank or storing the distributional parameters. Flow-SSNs are also more efficient to sample from than standard diffusion-based segmentation models, thanks to most of the model capacity being allocated to learning the base distribution of the flow, constituting an expressive prior. We apply Flow-SSNs to challenging medical imaging benchmarks and achieve state-of-the-art results. Code available: https://github.com/biomedia-mira/flow-ssn.
Efficient Lines Detection for Robot Soccer
Melo, João G., Mafaldo, João P., Barros, Edna
Self-localization is essential in robot soccer, where accurate detection of visual field features, such as lines and boundaries, is critical for reliable pose estimation. This paper presents a lightweight and efficient method for detecting soccer field lines using the ELSED algorithm, extended with a classification step that analyzes RGB color transitions to identify lines belonging to the field. We introduce a pipeline based on Particle Swarm Optimization (PSO) for threshold calibration to optimize detection performance, requiring only a small number of annotated samples. Our approach achieves accuracy comparable to a state-of-the-art deep learning model while offering higher processing speed, making it well-suited for real-time applications on low-power robotic platforms.
Performance Evaluation and Threat Mitigation in Large-scale 5G Core Deployment
Moreira, Rodrigo, Moreira, Larissa F. Rodrigues, Silva, Flávio de Oliveira
The deployment of large-scale software-based 5G core functions presents significant challenges due to their reliance on optimized and intelligent resource provisioning for their services. Many studies have focused on analyzing the impact of resource allocation for complex deployments using mathematical models, queue theories, or even Artificial Intelligence (AI). This paper elucidates the effects of chaotic workloads, generated by Distributed Denial of Service (DDoS) on different Network Functions (NFs) on User Equipment registration performance. Our findings highlight the necessity of diverse resource profiles to ensure Service-Level Agreement (SLA) compliance in large-scale 5G core deployments. Additionally, our analysis of packet capture approaches demonstrates the potential of kernel-based monitoring for scalable security threat defense. Finally, our empirical evaluation provides insights into the effective deployment of 5G NFs in complex scenarios.
#RoboCup2025: social media round-up part 2
RoboCup2025 took place from 15-21 July in Salvador, Brazil. The event saw around 3000 participants competing in the various leagues. In our first social media round-up post we saw what the teams got up to during the first couple of days of the event. In this second post, we take a look at the action from the final days when the competitions reached their climax. In the #RoboCup2025 @Home OPL Final, our robot performed very well.
#RoboCup2025: social media round-up part 2
RoboCup2025 took place from 15-21 July in Salvador, Brazil. The event saw around 3000 participants competing in the various leagues. In our first social media round-up post we saw what the teams got up to during the first couple of days of the event. In this second post, we take a look at the action from the final days when the competitions reached their climax. In the #RoboCup2025 @Home OPL Final, our robot performed very well.