AITopics | Cotta, Leonardo

Collaborating Authors

Cotta, Leonardo

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

End-To-End Causal Effect Estimation from Unstructured Natural Language Data

Dhawan, Nikita, Cotta, Leonardo, Ullrich, Karen, Krishnan, Rahul G., Maddison, Chris J.

arXiv.org Artificial IntelligenceJul-9-2024

Knowing the effect of an intervention is critical for human decision-making, but current approaches for causal effect estimation rely on manual data collection and structuring, regardless of the causal assumptions. This increases both the cost and time-to-completion for studies. We show how large, diverse observational text data can be mined with large language models (LLMs) to produce inexpensive causal effect estimates under appropriate causal assumptions. We introduce NATURAL, a novel family of causal effect estimators built with LLMs that operate over datasets of unstructured text. Our estimators use LLM conditional distributions (over variables of interest, given the text data) to assist in the computation of classical estimators of causal effect. We overcome a number of technical challenges to realize this idea, such as automating data curation and using LLMs to impute missing information. We prepare six (two synthetic and four real) observational datasets, paired with corresponding ground truth in the form of randomized trials, which we used to systematically evaluate each step of our pipeline. NATURAL estimators demonstrate remarkable performance, yielding causal effect estimates that fall within 3 percentage points of their ground truth counterparts, including on real-world Phase 3/4 clinical trials. Our results suggest that unstructured text data is a rich source of causal effect information, and NATURAL is a first step towards an automated pipeline to tap this resource.

covariate, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2407.07018

Country: North America > Canada > Ontario > Toronto (0.14)

Genre:

Research Report > Strength High (1.00)
Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.69)
Health & Medicine > Therapeutic Area > Nutrition and Weight Loss (0.69)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Out-Of-Context Prompting Boosts Fairness and Robustness in Large Language Model Predictions

Cotta, Leonardo, Maddison, Chris J.

arXiv.org Artificial IntelligenceJun-11-2024

Frontier Large Language Models (LLMs) are increasingly being deployed for high-stakes decision-making. On the other hand, these models are still consistently making predictions that contradict users' or society's expectations, e.g., hallucinating, or discriminating. Thus, it is important that we develop test-time strategies to improve their trustworthiness. Inspired by prior work, we leverage causality as a tool to formally encode two aspects of trustworthiness in LLMs: fairness and robustness. Under this perspective, existing test-time solutions explicitly instructing the model to be fair or robust implicitly depend on the LLM's causal reasoning capabilities. In this work, we explore the opposite approach. Instead of explicitly asking the LLM for trustworthiness, we design prompts to encode the underlying causal inference algorithm that will, by construction, result in more trustworthy predictions. Concretely, we propose out-of-context prompting as a test-time solution to encourage fairness and robustness in LLMs. Out-of-context prompting leverages the user's prior knowledge of the task's causal model to apply (random) counterfactual transformations and improve the model's trustworthiness. Empirically, we show that out-of-context prompting consistently improves the fairness and robustness of frontier LLMs across five different benchmark datasets without requiring additional data, finetuning or pre-training.

arxiv preprint arxiv, large language model, natural language, (14 more...)

arXiv.org Artificial Intelligence

2406.07685

Country: North America > Canada > Ontario > Toronto (0.14)

Genre: Research Report (0.40)

Industry: Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Probabilistic Invariant Learning with Randomized Linear Classifiers

Cotta, Leonardo, Yehuda, Gal, Schuster, Assaf, Maddison, Chris J.

arXiv.org Artificial IntelligenceSep-27-2023

Designing models that are both expressive and preserve known invariances of tasks is an increasingly hard problem. Existing solutions tradeoff invariance for computational or memory resources. In this work, we show how to leverage randomness and design models that are both expressive and invariant but use less resources. Inspired by randomized algorithms, our key insight is that accepting probabilistic notions of universal approximation and invariance can reduce our resource requirements. More specifically, we propose a class of binary classification models called Randomized Linear Classifiers (RLCs). We give parameter and sample size conditions in which RLCs can, with high probability, approximate any (smooth) function while preserving invariance to compact group transformations. Leveraging this result, we design three RLCs that are provably probabilistic invariant for classification tasks over sets, graphs, and spherical data. We show how these models can achieve probabilistic invariance and universality using less resources than (deterministic) neural networks and their invariant counterparts. Finally, we empirically demonstrate the benefits of this new class of models on invariant tasks where deterministic invariant neural networks are known to struggle.

artificial intelligence, machine learning, probabilistic invariant learning, (1 more...)

arXiv.org Artificial Intelligence

2308.04412

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.60)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.44)

Add feedback

Causal Lifting and Link Prediction

Cotta, Leonardo, Bevilacqua, Beatrice, Ahmed, Nesreen, Ribeiro, Bruno

arXiv.org Artificial IntelligenceJul-27-2023

Spearman's work started a revolution that gave us, among other things, matrix and tensor factorizations, Principal Component Analysis (PCA), and Independent Component Analysis (ICA). Simultaneously, The Abilities of Man also warned us about interpreting the factors of subject i as innate rather than acquired abilities. For instance, regarding Woolley and Fischer's observation that "boys are enormously superior [to girls] at [... ] spatial relations" (i.e., in how objects relate in space) [76], Spearman warns that "evidence of this difference being really innate [rather than acquired] is still dubious". Today, we can describe Spearman's warning as being about two competing causal hypotheses that describe link formation between young children and their abilities. A path-dependent hypothesis where past links influence future links [46] and an innate factors hypothesis where link formation is just a manifestation of latent innate factors [75]. In Woolley and Fischer's experiments, both hypotheses are able to describe the data: Either boys are innately better than girls at spatial reasoning (innate factors hypothesis), or boys in 1914 just happened to have had more playtime with spatial tasks than girls, with each task further improving Work partially done at Purdue University and Intel Labs.

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2302.01198

Country:

North America > United States (0.67)
Europe > Norway > Norwegian Sea (0.24)

Genre:

Research Report > Experimental Study (1.00)
Research Report > Strength High (0.67)
Research Report > New Finding (0.67)

Industry:

Government (0.67)
Information Technology (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Reconstruction for Powerful Graph Representations

Cotta, Leonardo, Morris, Christopher, Ribeiro, Bruno

arXiv.org Artificial IntelligenceOct-26-2021

Graph neural networks (GNNs) have limited expressive power, failing to represent many graph classes correctly. While more expressive graph representation learning (GRL) alternatives can distinguish some of these classes, they are significantly harder to implement, may not scale well, and have not been shown to outperform well-tuned GNNs in real-world tasks. Thus, devising simple, scalable, and expressive GRL architectures that also achieve real-world improvements remains an open challenge. In this work, we show the extent to which graph reconstruction -- reconstructing a graph from its subgraphs -- can mitigate the theoretical and practical problems currently faced by GRL architectures. First, we leverage graph reconstruction to build two new classes of expressive graph representations. Secondly, we show how graph reconstruction boosts the expressive power of any GNN architecture while being a (provably) powerful inductive bias for invariances to vertex removals. Empirically, we show how reconstruction can boost GNN's expressive power -- while maintaining its invariance to permutations of the vertices -- by solving seven graph property tasks not solvable by the original GNN. Further, we demonstrate how it boosts state-of-the-art GNN's performance across nine real-world benchmark datasets.

artificial intelligence, health & medicine, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2110.00577

Country:

North America > United States > Wisconsin (0.14)
North America > Canada > Quebec (0.14)

Genre:

Research Report (1.00)
Overview (1.00)

Industry: Health & Medicine (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Unsupervised Joint $k$-node Graph Representations with Compositional Energy-Based Models

Cotta, Leonardo, Teixeira, Carlos H. C., Swami, Ananthram, Ribeiro, Bruno

arXiv.org Artificial IntelligenceOct-8-2020

Existing Graph Neural Network (GNN) methods that learn inductive unsupervised graph representations focus on learning node and edge representations by predicting observed edges in the graph. Although such approaches have shown advances in downstream node classification tasks, they are ineffective in jointly representing larger $k$-node sets, $k{>}2$. We propose MHM-GNN, an inductive unsupervised graph representation approach that combines joint $k$-node representations with energy-based models (hypergraph Markov networks) and GNNs. To address the intractability of the loss that arises from this combination, we endow our optimization with a loss upper bound using a finite-sample unbiased Markov Chain Monte Carlo estimator. Our experiments show that the unsupervised MHM-GNN representations of MHM-GNN produce better unsupervised representations than existing approaches from the literature.

deep learning, neural network, representation, (20 more...)

arXiv.org Artificial Intelligence

2010.04259

Country:

North America > United States > New York (0.14)
Europe > United Kingdom > England (0.14)

Genre: Research Report (1.00)

Industry:

Information Technology (0.93)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.87)

Add feedback

Graph Pattern Mining and Learning through User-defined Relations (Extended Version)

Teixeira, Carlos H. C., Cotta, Leonardo, Ribeiro, Bruno, Meira, Wagner Jr

arXiv.org Machine LearningSep-13-2018

In this work we propose R-GPM, a parallel computing framework for graph pattern mining (GPM) through a user-defined subgraph relation. More specifically, we enable the computation of statistics of patterns through their subgraph classes, generalizing traditional GPM methods. R-GPM provides efficient estimators for these statistics by employing a MCMC sampling algorithm combined with several optimizations. We provide both theoretical guarantees and empirical evaluations of our estimators in application scenarios such as stochastic optimization of deep high-order graph neural network models and pattern (motif) counting. We also propose and evaluate optimizations that enable improvements of our estimators accuracy, while reducing their computational costs in up to 3-orders-of-magnitude. Finally,we show that R-GPM is scalable, providing near-linear speedups on 44 cores in all of our tests.

health & medicine, neural network, subgraph, (19 more...)

arXiv.org Machine Learning

1809.05241

Country: South America > Brazil (0.28)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.87)

Add feedback