AITopics | South America

Collaborating Authors

South America

Beyond Black-Box Predictions: Identifying Marginal Feature Effects in Tabular Transformer Networks

Thielmann, Anton, Reuter, Arik, Saefken, Benjamin

arXiv.org Machine LearningApr-11-2025

In recent years, deep neural networks have showcased their predictive power across a variety of tasks. Beyond natural language processing, the transformer architecture has proven efficient in addressing tabular data problems and challenges the previously dominant gradient-based decision trees in these areas. However, this predictive power comes at the cost of intelligibility: Marginal feature effects are almost completely lost in the black-box nature of deep tabular transformer networks. Alternative architectures that use the additivity constraints of classical statistical regression models can maintain intelligible marginal feature effects, but often fall short in predictive power compared to their more complex counterparts. To bridge the gap between intelligibility and performance, we propose an adaptation of tabular transformer networks designed to identify marginal feature effects. We provide theoretical justifications that marginal feature effects can be accurately identified, and our ablation study demonstrates that the proposed model efficiently detects these effects, even amidst complex feature interactions. To demonstrate the model's predictive capabilities, we compare it to several interpretable as well as black-box models and find that it can match black-box performances while maintaining intelligibility. The source code is available at https://github.com/OpenTabular/NAMpy.

artificial intelligence, machine learning, natural language, (12 more...)

arXiv.org Machine Learning

2504.08712

Country:

North America > United States > California (0.05)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
South America > Brazil (0.04)
Europe > United Kingdom > England > Lancashire > Lancaster (0.04)

Genre: Research Report (0.64)

Industry: Transportation > Air (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Black Mirror's pessimism porn won't lead us to a better future Louis Anslow

The GuardianApr-10-2025, 07:00:24 GMT

Black Mirror is more than science fiction – its stories about modernity have become akin to science folklore, shaping our collective view of technology and the future. Each new innovation gets an allegory: smartphones as tools for a new age caste system, robot dogs as overzealous human hunters, drones as a murderous swarm, artificial intelligence as new age necromancy, virtual reality and brain chips as seizure-inducing nightmares, to name a few. It is a must-watch, but must we take it so seriously? Black Mirror fails to consistently explore the duality of technology and our reactions to it. It is a critical deficit.

black mirror, future louis anslow, technological change, (15 more...)

The Guardian

Country:

Europe > United Kingdom (0.30)
South America > Brazil (0.05)
North America > United States > New York (0.05)
(6 more...)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (0.74)
Energy > Power Industry > Utilities > Nuclear (0.49)

Technology: Information Technology > Artificial Intelligence > Science Fiction (0.52)

Add feedback

Unifying and extending Diffusion Models through PDEs for solving Inverse Problems

Dasgupta, Agnimitra, da Cunha, Alexsander Marciano, Fardisi, Ali, Aminy, Mehrnegar, Binder, Brianna, Shaddy, Bryan, Oberai, Assad A

arXiv.org Machine LearningApr-10-2025

Diffusion models have emerged as powerful generative tools with applications in computer vision and scientific machine learning (SciML), where they have been used to solve large-scale probabilistic inverse problems. Traditionally, these models have been derived using principles of variational inference, denoising, statistical signal processing, and stochastic differential equations. In contrast to the conventional presentation, in this study we derive diffusion models using ideas from linear partial differential equations and demonstrate that this approach has several benefits that include a constructive derivation of the forward and reverse processes, a unified derivation of multiple formulations and sampling strategies, and the discovery of a new class of models. We also apply the conditional version of these models to solving canonical conditional density estimation problems and challenging inverse problems. These problems help establish benchmarks for systematically quantifying the performance of different formulations and sampling strategies in this study, and for future studies. Finally, we identify and implement a mechanism through which a single diffusion model can be applied to measurements obtained from multiple measurement operators. Taken together, the contents of this manuscript provide a new understanding and several new directions in the application of diffusion models to solving physics-based inverse problems.

artificial intelligence, diffusion model, machine learning, (18 more...)

arXiv.org Machine Learning

2504.07437

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
South America > Brazil > Rio de Janeiro > Rio de Janeiro (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Revealed: Big tech's new datacentres will take water from the world's driest areas

The GuardianApr-9-2025, 11:30:01 GMT

Amazon, Microsoft and Google are operating datacentres that use vast amounts of water in some of the world's driest areas and are building many more, an investigation by SourceMaterial and the Guardian has found. With Donald Trump pledging to support them, the three technology giants are planning hundreds of datacentres in the US and across the globe, with a potentially huge impact on populations already living with water scarcity. "The question of water is going to become crucial," said Lorena Jaume-Palasí, founder of the Ethical Tech Society. "Resilience from a resource perspective is going to be very difficult for those communities." Efforts by Amazon, the world's largest online retailer, to mitigate its water use have sparked opposition from inside the company, SourceMaterial's investigation found, with one of its own sustainability experts warning that its plans are "not ethical".

amazon, datacentre, new datacentre, (15 more...)

The Guardian

Country:

Europe > Spain > Aragón (0.07)
South America (0.05)
Oceania > Australia (0.05)
(5 more...)

Industry:

Government > Regional Government > North America Government > United States Government (0.90)
Information Technology > Services (0.70)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.31)
Information Technology > Communications > Social Media (0.30)

Add feedback

Can A.I. Writing Be More Than a Gimmick?

The New YorkerApr-9-2025, 10:00:00 GMT

The new essay collection "Searches: Selfhood in the Digital Age," by Vauhini Vara, opens with a transcript. "If I paste some writing here, can we talk about it?" Her interlocutor, the large language model ChatGPT, responds, "Of course!" The chatbot asks what specific themes it should focus on. "Nothing in particular," Vara replies.

gpt-3, powerpoint, vara, (16 more...)

The New Yorker

Country:

South America > Bolivia (0.14)
North America > United States > New York (0.04)
North America > United States > Florida > Orange County (0.04)
(2 more...)

Genre: Summary/Review (0.40)

Industry: Media (0.70)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.95)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.73)

Add feedback

Assumption-free fidelity bounds for hardware noise characterization

Colombo, Nicolo

arXiv.org Machine LearningApr-9-2025

In the Quantum Supremacy regime, quantum computers may overcome classical machines on several tasks if we can estimate, mitigate, or correct unavoidable hardware noise. Estimating the error requires classical simulations, which become unfeasible in the Quantum Supremacy regime. We leverage Machine Learning data-driven approaches and Conformal Prediction, a Machine Learning uncertainty quantification tool known for its mild assumptions and finite-sample validity, to find theoretically valid upper bounds of the fidelity between noiseless and noisy outputs of quantum devices. Under reasonable extrapolation assumptions, the proposed scheme applies to any Quantum Computing hardware, does not require modeling the device's noise sources, and can be used when classical simulations are unavailable, e.g. in the Quantum Supremacy regime.

artificial intelligence, machine learning, natural language, (15 more...)

arXiv.org Machine Learning

2504.0701

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Ukraine > Kyiv Oblast > Kyiv (0.04)
South America > Peru > Cusco Department > Cusco Province > Cusco (0.04)
(5 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.46)

Add feedback

Improving Mixed-Criticality Scheduling with Reinforcement Learning

El-Mahdy, Muhammad, Sakr, Nourhan, Carrasco, Rodrigo

arXiv.org Artificial IntelligenceApr-9-2025

This paper introduces a novel reinforcement learning (RL) approach to scheduling mixed-criticality (MC) systems on processors with varying speeds. Building upon the foundation laid by [1], we extend their work to address the non-preemptive scheduling problem, which is known to be NP-hard. By modeling this scheduling challenge as a Markov Decision Process (MDP), we develop an RL agent capable of generating near-optimal schedules for real-time MC systems. Our RL-based scheduler prioritizes high-critical tasks while maintaining overall system performance. Through extensive experiments, we demonstrate the scalability and effectiveness of our approach. The RL scheduler significantly improves task completion rates, achieving around 80% overall and 85% for high-criticality tasks across 100,000 instances of synthetic data and real data under varying system conditions. Moreover, under stable conditions without degradation, the scheduler achieves 94% overall task completion and 93% for high-criticality tasks. These results highlight the potential of RL-based schedulers in real-time and safety-critical applications, offering substantial improvements in handling complex and dynamic scheduling scenarios.

criticality level, machine learning, reinforcement learning, (15 more...)

arXiv.org Artificial Intelligence

2504.03994

Country:

Africa > Middle East > Egypt (0.15)
South America > Chile (0.15)
North America > United States (0.15)
(2 more...)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.71)

Add feedback

Impact of Price Inflation on Algorithmic Collusion Through Reinforcement Learning Agents

Tinoco, Sebastián, Abeliuk, Andrés, del Solar, Javier Ruiz

arXiv.org Artificial IntelligenceApr-9-2025

Algorithmic pricing is increasingly shaping market competition, raising concerns about its potential to compromise competitive dynamics. While prior work has shown that reinforcement learning (RL)-based pricing algorithms can lead to tacit collusion, less attention has been given to the role of macroeconomic factors in shaping these dynamics. This study examines the role of inflation in influencing algorithmic collusion within competitive markets. By incorporating inflation shocks into a RL-based pricing model, we analyze whether agents adapt their strategies to sustain supra-competitive profits. Our findings indicate that inflation reduces market competitiveness by fostering implicit coordination among agents, even without direct collusion. However, despite achieving sustained higher profitability, agents fail to develop robust punishment mechanisms to deter deviations from equilibrium strategies. The results suggest that inflation amplifies non-competitive dynamics in algorithmic pricing, emphasizing the need for regulatory oversight in markets where AI-driven pricing is prevalent.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

arXiv.org Artificial Intelligence

2504.05335

Country:

Europe (0.68)
South America > Chile (0.15)

Genre: Research Report > New Finding (1.00)

Industry:

Banking & Finance > Economy (0.68)
Banking & Finance > Trading (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Constructing the Truth: Text Mining and Linguistic Networks in Public Hearings of Case 03 of the Special Jurisdiction for Peace (JEP)

Sosa, Juan, Urrego-López, Alejandro, Prieto, Cesar, Camargo-Díaz, Emma J.

arXiv.org Artificial IntelligenceApr-9-2025

Case 03 of the Special Jurisdiction for Peace (JEP), focused on the so-called false positives in Colombia, represents one of the most harrowing episodes of the Colombian armed conflict. This article proposes an innovative methodology based on natural language analysis and semantic co-occurrence models to explore, systematize, and visualize narrative patterns present in the public hearings of victims and appearing parties. By constructing skipgram networks and analyzing their modularity, the study identifies thematic clusters that reveal regional and procedural status differences, providing empirical evidence on dynamics of victimization, responsibility, and acknowledgment in this case. This computational approach contributes to the collective construction of both judicial and extrajudicial truth, offering replicable tools for other transitional justice cases. The work is grounded in the pillars of truth, justice, reparation, and non-repetition, proposing a critical and in-depth reading of contested memories.

artificial intelligence, data mining, natural language, (18 more...)

arXiv.org Artificial Intelligence

2504.04325

Country: South America > Colombia (0.68)

Genre: Research Report (0.64)

Industry:

Law (1.00)
Government > Military (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Data Science > Data Mining > Text Mining (0.40)

Add feedback

Bounds in Wasserstein Distance for Locally Stationary Functional Time Series

Tinio, Jan Nino G., Alaya, Mokhtar Z., Bouzebda, Salim

arXiv.org Machine LearningApr-8-2025

Functional time series (FTS) extend traditional methodologies to accommodate data observed as functions/curves. A significant challenge in FTS consists of accurately capturing the time-dependence structure, especially with the presence of time-varying covariates. When analyzing time series with time-varying statistical properties, locally stationary time series (LSTS) provide a robust framework that allows smooth changes in mean and variance over time. This work investigates Nadaraya-Watson (NW) estimation procedure for the conditional distribution of locally stationary functional time series (LSFTS), where the covariates reside in a semi-metric space endowed with a semi-metric. Under small ball probability and mixing condition, we establish convergence rates of NW estimator for LSFTS with respect to Wasserstein distance. The finite-sample performances of the model and the estimation method are illustrated through extensive numerical experiments both on functional simulated and real data.

artificial intelligence, machine learning, wasserstein distance, (14 more...)

arXiv.org Machine Learning

2504.06453

Country:

Europe > Netherlands > South Holland > Delft (0.04)
South America (0.04)
North America > United States > Rhode Island > Providence County > Providence (0.04)
(4 more...)

Genre: Research Report (1.00)

Industry: Banking & Finance (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Data Science (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.92)

Add feedback