AITopics | South America

Collaborating Authors

South America

Online Machine Learning Techniques for Coq: A Comparison

Zhang, Liao, Blaauwbroek, Lasse, Piotrowski, Bartosz, Černý, Prokop, Kaliszyk, Cezary, Urban, Josef

arXiv.org Artificial IntelligenceApr-12-2021

We present a comparison of several online machine learning techniques for tactical learning and proving in the Coq proof assistant. This work builds on top of Tactician, a plugin for Coq that learns from proofs written by the user to synthesize new proofs. This learning happens in an online manner -- meaning that Tactician's machine learning model is updated immediately every time the user performs a step in an interactive proof. This has important advantages compared to the more studied offline learning systems: (1) it provides the user with a seamless, interactive experience with Tactician and, (2) it takes advantage of locality of proof similarity, which means that proofs similar to the current proof are likely to be found close by. We implement two online methods, namely approximate $k$-nearest neighbors based on locality sensitive hashing forests and random decision forests. Additionally, we conduct experiments with gradient boosted trees in an offline setting using XGBoost. We compare the relative performance of Tactician using these three learning methods on Coq's standard library.

proof state, random forest, tactician, (14 more...)

arXiv.org Artificial Intelligence

2104.05207

Country:

Europe > Italy (0.04)
South America > Brazil > Rio Grande do Norte > Natal (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
(8 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)

Add feedback

The World as a Graph: Improving El Ni\~no Forecasts with Graph Neural Networks

Cachay, Salva Rühling, Erickson, Emma, Bucker, Arthur Fender C., Pokropek, Ernest, Potosnak, Willa, Bire, Suyash, Osei, Salomey, Lütjens, Björn

arXiv.org Machine LearningApr-11-2021

Deep learning-based models have recently outperformed state-of-the-art seasonal forecasting models, such as for predicting El Ni\~no-Southern Oscillation (ENSO). However, current deep learning models are based on convolutional neural networks which are difficult to interpret and can fail to model large-scale atmospheric patterns. In comparison, graph neural networks (GNNs) are capable of modeling large-scale spatial dependencies and are more interpretable due to the explicit modeling of information flow through edge connections. We propose the first application of graph neural networks to seasonal forecasting. We design a novel graph connectivity learning module that enables our GNN model to learn large-scale spatial interactions jointly with the actual ENSO forecasting task. Our model, \graphino, outperforms state-of-the-art deep learning-based models for forecasts up to six months ahead. Additionally, we show that our model is more interpretable as it learns sensible connectivity structures that correlate with the ENSO anomaly pattern.

forecasting, neural network, node, (14 more...)

arXiv.org Machine Learning

2104.05089

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
South America > Brazil > São Paulo (0.04)
North America > United States > Illinois (0.04)
(12 more...)

Genre: Research Report (0.64)

Industry:

Health & Medicine (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Education > Educational Setting (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Individual Explanations in Machine Learning Models: A Case Study on Poverty Estimation

Carrillo, Alfredo, Cantú, Luis F., Tejerina, Luis, Noriega, Alejandro

arXiv.org Artificial IntelligenceApr-11-2021

A. Relevance of Model Explanations in Real-World Contexts Complex estimation and decision-making tasks have traditionally been analyzed and judged by human experts. Hence, decisions have typically been able to be complemented with human-interpretable justifications, when needed, as experts can normally explain the line-of-thought that led to their own decision-making. However, in the past two decades, algorithmic decision-making has spread increasingly to many relevant societal contexts. Despite the notable enthusiasm for the potential benefit that this type of technology can bring, the underlying methods used are typically not inherently transparent, in the sense that they do not readily provide human-interpretable justifications for their decisions [1]. Moreover, in recent years there is a trend where the most successful algorithms, particularly in complex tasks like machine vision and natural language processing, tend to rely on highly complex models, which has led to a further increase in tension between accuracy and interpretability [2]. Relevant societal contexts where algorithmic decision systems have gained substantial traction include medical diagnosis and treatment [3], counter-terrorism [4], criminal justice [5], and risk assessments for credits and insurance [6]. In such impactful contexts, there is a legitimate need for providing human-interpretable explanations along with the estimations and decisions made. Indeed, lack of interpretability has become a barrier to the adoption of machine learning-based systems in many institutions and companies. Hence the value of complementing ML models with human-interpretable accounts of the statistical rationals behind their estimations, in a way that human decision-makers can more easily understand machine estimations, and even integrate their statistical rationals with qualitative information and human expert judgements.

explanation, household, interpretability, (16 more...)

arXiv.org Artificial Intelligence

2104.04148

Country:

South America (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Costa Rica (0.04)
North America > Central America (0.04)

Genre: Research Report (0.50)

Industry:

Information Technology > Security & Privacy (0.48)
Banking & Finance (0.46)
Law Enforcement & Public Safety (0.34)
Law (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.46)

Add feedback

The Proper Use of Google Trends in Forecasting Models

Medeiros, Marcelo C., Pires, Henrique F.

arXiv.org Machine LearningApr-10-2021

It is widely known that Google Trends have become one of the most popular free tools used by forecasters both in academics and in the private and public sectors. There are many papers, from several different fields, concluding that Google Trends improve forecasts' accuracy. However, what seems to be widely unknown, is that each sample of Google search data is different from the other, even if you set the same search term, data and location. This means that it is possible to find arbitrary conclusions merely by chance. This paper aims to show why and when it can become a problem and how to overcome this obstacle.

different sample, gdp growth, google trend, (12 more...)

arXiv.org Machine Learning

2104.03065

Country:

North America > United States (0.30)
South America > Brazil > Rio de Janeiro > Rio de Janeiro (0.04)
Oceania > New Zealand (0.04)
(2 more...)

Genre: Research Report (0.64)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.96)
Banking & Finance (0.72)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Information Management > Search (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)

Add feedback

Auto-weighted Multi-view Feature Selection with Graph Optimization

Wang, Qi, Jiang, Xu, Chen, Mulin, Li, Xuelong

arXiv.org Artificial IntelligenceApr-10-2021

In this paper, we focus on the unsupervised multi-view feature selection which tries to handle high dimensional data in the field of multi-view learning. Although some graph-based methods have achieved satisfactory performance, they ignore the underlying data structure across different views. Besides, their pre-defined laplacian graphs are sensitive to the noises in the original data space, and fail to get the optimal neighbor assignment. To address the above problems, we propose a novel unsupervised multi-view feature selection model based on graph learning, and the contributions are threefold: (1) during the feature selection procedure, the consensus similarity graph shared by different views is learned. Therefore, the proposed model can reveal the data relationship from the feature subset. (2) a reasonable rank constraint is added to optimize the similarity matrix to obtain more accurate information; (3) an auto-weighted framework is presented to assign view weights adaptively, and an effective alternative iterative algorithm is proposed to optimize the problem. Experiments on various datasets demonstrate the superiority of the proposed method compared with the state-of-the-art methods.

dataset, feature selection, selection, (14 more...)

arXiv.org Artificial Intelligence

2104.04906

Country:

North America > United States (0.14)
Asia > China > Shaanxi Province > Xi'an (0.05)
South America > Paraguay > Asunción > Asunción (0.04)
(3 more...)

Genre: Research Report > Promising Solution (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)

Add feedback

Assessment of the influence of features on a classification problem: an application to COVID-19 patients

Davila-Pena, L., García-Jurado, Ignacio, Casas-Méndez, B.

arXiv.org Machine LearningApr-9-2021

This paper deals with an important subject in classification problems addressed by machine learning techniques: the evaluation of the influence of each of the features on the classification of individuals. Specifically, a measure of that influence is introduced using the Shapley value of cooperative games. In addition, an axiomatic characterisation of the proposed measure is provided based on properties of efficiency and balanced contributions. Furthermore, some experiments have been designed in order to validate the appropriate performance of such measure. Finally, the methodology introduced is applied to a sample of COVID-19 patients to study the influence of certain demographic or risk factors on various events of interest related to the evolution of the disease.

classification, classification and influence, different level, (17 more...)

arXiv.org Machine Learning

2104.14958

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Europe > Spain > Galicia > A Coruña Province > Santiago de Compostela (0.04)
Oceania > New Zealand > North Island > Waikato (0.04)
Europe > Spain > Galicia > A Coruña Province > A Coruña (0.04)

Genre: Research Report (0.64)

Industry: Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Probabilistic Box Embeddings for Uncertain Knowledge Graph Reasoning

Chen, Xuelu, Boratko, Michael, Chen, Muhao, Dasgupta, Shib Sankar, Li, Xiang Lorraine, McCallum, Andrew

arXiv.org Artificial IntelligenceApr-9-2021

Knowledge bases often consist of facts which are harvested from a variety of sources, many of which are noisy and some of which conflict, resulting in a level of uncertainty for each triple. Knowledge bases are also often incomplete, prompting the use of embedding methods to generalize from known facts, however, existing embedding methods only model triple-level uncertainty, and reasoning results lack global consistency. To address these shortcomings, we propose BEUrRE, a novel uncertain knowledge graph embedding method with calibrated probabilistic semantics. BEUrRE models each entity as a box (i.e. axis-aligned hyperrectangle) and relations between two entities as affine transforms on the head and tail entity boxes. The geometry of the boxes allows for efficient calculation of intersections and volumes, endowing the model with calibrated probabilistic semantics and facilitating the incorporation of relational constraints. Extensive experiments on two benchmark datasets show that BEUrRE consistently outperforms baselines on confidence prediction and fact ranking due to its probabilistic calibration and ability to capture high-order dependencies among facts.

beurre, constraint, relation, (16 more...)

arXiv.org Artificial Intelligence

2104.04597

Country:

North America > United States > California (0.14)
South America > Argentina > Pampas > Buenos Aires F.D. > Buenos Aires (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
(9 more...)

Genre: Research Report (0.64)

Industry: Automobiles & Trucks > Manufacturer (0.94)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.88)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.62)

Add feedback

Incorporating External Knowledge to Enhance Tabular Reasoning

Neeraja, J., Gupta, Vivek, Srikumar, Vivek

arXiv.org Artificial IntelligenceApr-9-2021

Reasoning about tabular information presents unique challenges to modern NLP approaches which largely rely on pre-trained contextualized embeddings of text. In this paper, we study these challenges through the problem of tabular natural language inference. We propose easy and effective modifications to how information is presented to a model for this task. We show via systematic experiments that these strategies substantially improve tabular inference performance.

computational linguistic, fluorine, modification, (13 more...)

arXiv.org Artificial Intelligence

2104.04243

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Utah (0.05)
South America > Peru > Cusco Department > Cusco Province > Cusco (0.04)
(9 more...)

Genre: Research Report (0.64)

Industry:

Media > Film (0.68)
Leisure & Entertainment (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.46)

Add feedback

XFORMAL: A Benchmark for Multilingual Formality Style Transfer

Briakou, Eleftheria, Lu, Di, Zhang, Ke, Tetreault, Joel

arXiv.org Artificial IntelligenceApr-8-2021

We take the first step towards multilingual style transfer by creating and releasing XFORMAL, a benchmark of multiple formal reformulations of informal text in Brazilian Portuguese, French, and Italian. Results on XFORMAL suggest that state-of-the-art style transfer approaches perform close to simple baselines, indicating that style transfer is even more challenging when moving multilingual.

computational linguistic, proceedings, rewrite, (14 more...)

arXiv.org Artificial Intelligence

2104.04108

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > China > Hong Kong (0.05)
Europe > Belgium > Brussels-Capital Region > Brussels (0.04)
(23 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
(2 more...)

Add feedback

On tuning consistent annealed sampling for denoising score matching

Serrà, Joan, Pascual, Santiago, Pons, Jordi

arXiv.org Artificial IntelligenceApr-8-2021

Score-based generative models provide state-of-the-art quality for image and audio synthesis. Sampling from these models is performed iteratively, typically employing a discretized series of noise levels and a predefined scheme. In this note, we first overview three common sampling schemes for models trained with denoising score matching. Next, we focus on one of them, consistent annealed sampling, and study its hyper-parameter boundaries. We then highlight a possible formulation of such hyper-parameter that explicitly considers those boundaries and facilitates tuning when using few or a variable number of steps. Finally, we highlight some connections of the formulation with other sampling schemes.

formulation, score matching, synthesis, (13 more...)

arXiv.org Artificial Intelligence

2104.03725

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback