AITopics | South America

Collaborating Authors

South America

Exploring a Large Language Model for Transforming Taxonomic Data into OWL: Lessons Learned and Implications for Ontology Development

Soares, Filipi Miranda, Saraiva, Antonio Mauro, Pires, Luís Ferreira, Santos, Luiz Olavo Bonino da Silva, Moreira, Dilvan de Abreu, Corrêa, Fernando Elias, Braghetto, Kelly Rosa, Drucker, Debora Pignatari, Delbem, Alexandre Cláudio Botazzo

arXiv.org Artificial IntelligenceMay-27-2025

Managing scientific names in ontologies that represent species taxonomies is challenging due to the ever-evolving nature of these taxonomies. Manually maintaining these names becomes increasingly difficult when dealing with thousands of scientific names. To address this issue, this paper investigates the use of ChatGPT-4 to automate the development of the :Organism module in the Agricultural Product Types Ontology (APTO) for species classification. Our methodology involved leveraging ChatGPT-4 to extract data from the GBIF Backbone API and generate OWL files for further integration in APTO. Two alternative approaches were explored: (1) issuing a series of prompts for ChatGPT-4 to execute tasks via the BrowserOP plugin and (2) directing ChatGPT-4 to design a Python algorithm to perform analogous tasks. Both approaches rely on a prompting method where we provide instructions, context, input data, and an output indicator. The first approach showed scalability limitations, while the second approach used the Python algorithm to overcome these challenges, but it struggled with typographical errors in data handling. This study highlights the potential of Large language models like ChatGPT-4 to streamline the management of species names in ontologies. Despite certain limitations, these tools offer promising advancements in automating taxonomy-related tasks and improving the efficiency of ontology development.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

doi: 10.3724/2096-7004.di.2025.0020

2504.18651

Country:

South America > Brazil (0.29)
Europe > Netherlands (0.28)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine (1.00)
Information Technology (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition

Neural Information Processing SystemsMay-26-2025, 22:38:37 GMT

Large language model systems face significant security risks from maliciously crafted messages that aim to overwrite the system's original instructions or leak private data. To study this problem, we organized a capture-the-flag competition at IEEE SaTML 2024, where the flag is a secret string in the LLM system prompt. The competition was organized in two phases. In the first phase, teams developed defenses to prevent the model from leaking the secret. During the second phase, teams were challenged to extract the secrets hidden for defenses proposed by the other teams.

artificial intelligence, large language model, natural language, (2 more...)

Neural Information Processing Systems

Country: South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.09)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Not so griddy: Internal representations of RNNs path integrating more than one agent

Neural Information Processing SystemsMay-26-2025, 19:22:45 GMT

Success in collaborative and competitive environments, where agents must work with or against each other, requires individuals to encode the position and trajectory of themselves and others. Decades of neurophysiological experiments have shed light on how brain regions [e.g., medial entorhinal cortex (MEC), hippocampus] encode the self's position and trajectory. However, it has only recently been discovered that MEC and hippocampus are modulated by the positions and trajectories of others. To understand how encoding spatial information of multiple agents shapes neural representations, we train a recurrent neural network (RNN) model that captures properties of MEC to path integrate trajectories of two agents simultaneously navigating the same environment. We find significant differences between these RNNs and those trained to path integrate only a single agent.

artificial intelligence, machine learning, representation, (6 more...)

Neural Information Processing Systems

Country: South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.09)

Genre: Research Report > New Finding (0.43)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.63)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.63)

Add feedback

CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark

Neural Information Processing SystemsMay-26-2025, 17:02:34 GMT

Visual Question Answering (VQA) is an important task in multimodal AI, which requires models to understand and reason on knowledge present in visual and textual data. However, most of the current VQA datasets and models are primarily focused on English and a few major world languages, with images that are Western-centric. While recent efforts have tried to increase the number of languages covered on VQA datasets, they still lack diversity in low-resource languages. More importantly, some datasets extend the text to other languages, either via translation or some other approaches, but usually keep the same images, resulting in narrow cultural representation. To address these limitations, we create CVQA, a new Culturally-diverse Multilingual Visual Question Answering benchmark dataset, designed to cover a rich set of languages and regions, where we engage native speakers and cultural experts in the data collection process.

artificial intelligence, cvqa, question answering, (4 more...)

Neural Information Processing Systems

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.06)
Asia > Middle East > Israel (0.06)

Technology: Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.85)

Add feedback

SolarCube: An Integrative Benchmark Dataset Harnessing Satellite and In-situ Observations for Large-scale Solar Energy Forecasting

Neural Information Processing SystemsMay-26-2025, 15:22:44 GMT

Solar power is a critical source of renewable energy, offering significant potential to lower greenhouse gas emissions and mitigate climate change. However, the cloud induced-variability of solar radiation reaching the earth's surface presents a challenge for integrating solar power into the grid (e.g., storage and backup management). The new generation of geostationary satellites such as GOES-16 has become an important data source for large-scale and high temporal frequency solar radiation forecasting. However, no machine-learning-ready dataset has integrated geostationary satellite data with fine-grained solar radiation information to support forecasting model development and benchmarking with consistent metrics. SolarCube covers 19 study areas distributed over multiple continents: North America, South America, Asia, and Oceania.

artificial intelligence, machine learning, modeling & simulation, (7 more...)

Neural Information Processing Systems

Country:

South America (0.27)
Oceania (0.27)
North America (0.27)
Asia (0.27)

Industry: Energy > Renewable > Solar (1.00)

Technology:

Information Technology > Modeling & Simulation (0.76)
Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

Scientist delivers ominous message to humanity after UFO covered in strange writing is found

Daily Mail - Science & techMay-26-2025, 15:20:57 GMT

A UFO researcher has an ominous message for humanity as governments around the world begin releasing more information about alleged contact with extraterrestrials. Dr Julia Mossbridge is a cognitive neuroscientist and a researcher of unidentified aerial phenomena (UAP) - the new term for UFOs and alien sightings. After scientists in Colombia recovered a mysterious, sphere-shaped object that many now believe is a piece of UFO technology, Mossbridge said the world is moving into an era which may soon have to deal with the knowledge that aliens exist. 'We are entering a time when we are starting to recognize as humans we don't have the control that we thought we had over everything,' Dr Mossbridge told Fox News. However, Mossbridge, who studies how humans think and also attended the May 1 congressional hearing on UAPs, said the impending disclosure of alien life could throw the worldview of many people into chaos.

mossbridge, scientist deliver ominous message, velez el potro, (11 more...)

Daily Mail - Science & tech

Country:

South America > Colombia (0.28)
South America > Peru (0.05)

Industry:

Government (0.70)
Media > News (0.35)

Technology: Information Technology > Artificial Intelligence (0.51)

Add feedback

Towards robust vision by multi-task learning on monkey visual cortex

Neural Information Processing SystemsMay-26-2025, 15:01:38 GMT

Deep neural networks set the state-of-the-art across many tasks in computer vision, but their generalization ability to simple image distortions is surprisingly fragile. In contrast, the mammalian visual system is robust to a wide range of perturbations. Recent work suggests that this generalization ability can be explained by useful inductive biases encoded in the representations of visual stimuli throughout the visual cortex. Here, we successfully leveraged these inductive biases with a multi-task learning approach: we jointly trained a deep network to perform image classification and to predict neural activity in macaque primary visual cortex (V1) in response to the same natural stimuli. We measured the out-of-distribution generalization abilities of our resulting network by testing its robustness to common image distortions.

artificial intelligence, machine learning, visual cortex, (11 more...)

Neural Information Processing Systems

Country: South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.07)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.86)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

DataRater: Meta-Learned Dataset Curation

Calian, Dan A., Farquhar, Gregory, Kemaev, Iurii, Zintgraf, Luisa M., Hessel, Matteo, Shar, Jeremy, Oh, Junhyuk, György, András, Schaul, Tom, Dean, Jeffrey, van Hasselt, Hado, Silver, David

arXiv.org Machine LearningMay-26-2025

The quality of foundation models depends heavily on their training data. Consequently, great efforts have been put into dataset curation. Yet most approaches rely on manual tuning of coarse-grained mixtures of large buckets of data, or filtering by hand-crafted heuristics. An approach that is ultimately more scalable (let alone more satisfying) is to \emph{learn} which data is actually valuable for training. This type of meta-learning could allow more sophisticated, fine-grained, and effective curation. Our proposed \emph{DataRater} is an instance of this idea. It estimates the value of training on any particular data point. This is done by meta-learning using `meta-gradients', with the objective of improving training efficiency on held out data. In extensive experiments across a range of model scales and datasets, we find that using our DataRater to filter data is highly effective, resulting in significantly improved compute efficiency.

large language model, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

2505.17895

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
Asia > Middle East > Jordan (0.04)
(12 more...)

Genre: Research Report (0.65)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Leveraging Stochastic Depth Training for Adaptive Inference

Korol, Guilherme, Beck, Antonio Carlos Schneider, Castrillon, Jeronimo

arXiv.org Artificial IntelligenceMay-26-2025

Dynamic DNN optimization techniques such as layer-skipping offer increased adaptability and efficiency gains but can lead to i) a larger memory footprint as in decision gates, ii) increased training complexity (e.g., with non-differentiable operations), and iii) less control over performance-quality trade-offs due to its inherent input-dependent execution. To approach these issues, we propose a simpler yet effective alternative for adaptive inference with a zero-overhead, single-model, and time-predictable inference. Central to our approach is the observation that models trained with Stochastic Depth -- a method for faster training of residual networks -- become more resilient to arbitrary layer-skipping at inference time. We propose a method to first select near Pareto-optimal skipping configurations from a stochastically-trained model to adapt the inference at runtime later. Compared to original ResNets, our method shows improvements of up to 2X in power efficiency at accuracy drops as low as 0.71%.

artificial intelligence, configuration, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2505.17626

Country:

Europe > Germany (0.28)
South America > Brazil (0.28)

Genre: Research Report (0.65)

Industry: Information Technology (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Language Matters: How Do Multilingual Input and Reasoning Paths Affect Large Reasoning Models?

Tam, Zhi Rui, Wu, Cheng-Kuang, Chiu, Yu Ying, Lin, Chieh-Yen, Chen, Yun-Nung, Lee, Hung-yi

arXiv.org Artificial IntelligenceMay-26-2025

Large reasoning models (LRMs) have demonstrated impressive performance across a range of reasoning tasks, yet little is known about their internal reasoning processes in multilingual settings. We begin with a critical question: {\it In which language do these models reason when solving problems presented in different languages?} Our findings reveal that, despite multilingual training, LRMs tend to default to reasoning in high-resource languages (e.g., English) at test time, regardless of the input language. When constrained to reason in the same language as the input, model performance declines, especially for low-resource languages. In contrast, reasoning in high-resource languages generally preserves performance. We conduct extensive evaluations across reasoning-intensive tasks (MMMLU, MATH-500) and non-reasoning benchmarks (CulturalBench, LMSYS-toxic), showing that the effect of language choice varies by task type: input-language reasoning degrades performance on reasoning tasks but benefits cultural tasks, while safety evaluations exhibit language-specific behavior. By exposing these linguistic biases in LRMs, our work highlights a critical step toward developing more equitable models that serve users across diverse linguistic backgrounds.

large language model, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

2505.17407

Country:

Europe (1.00)
Asia (1.00)
South America (0.68)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.91)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.74)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.33)

Add feedback