AITopics | South America

Collaborating Authors

South America

Solving the Class Imbalance Problem Using a Counterfactual Method for Data Augmentation

arXiv.org Artificial IntelligenceNov-5-2021

Learning from class imbalanced datasets poses challenges for many machine learning algorithms. Many real-world domains are, by definition, class imbalanced by virtue of having a majority class that naturally has many more instances than its minority class (e.g. genuine bank transactions occur much more often than fraudulent ones). Many methods have been proposed to solve the class imbalance problem, among the most popular being oversampling techniques (such as SMOTE). These methods generate synthetic instances in the minority class, to balance the dataset, performing data augmentations that improve the performance of predictive machine learning (ML) models. In this paper we advance a novel data augmentation method (adapted from eXplainable AI), that generates synthetic, counterfactual instances in the minority class. Unlike other oversampling techniques, this method adaptively combines exist-ing instances from the dataset, using actual feature-values rather than interpolating values between instances. Several experiments using four different classifiers and 25 datasets are reported, which show that this Counterfactual Augmentation method (CFA) generates useful synthetic data points in the minority class. The experiments also show that CFA is competitive with many other oversampling methods many of which are variants of SMOTE. The basis for CFAs performance is discussed, along with the conditions under which it is likely to perform better or worse in future tests.

counterfactual, dataset, minority class, (13 more...)

arXiv.org Artificial Intelligence

2111.03516

Country:

Europe > Ireland (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Europe > Germany (0.04)
(4 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine (1.00)
Food & Agriculture > Agriculture (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Case-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
(2 more...)

Add feedback

Tackle racism in AI, BLM co-founder tells tech bosses

#artificialintelligenceNov-4-2021, 13:04:34 GMT

US activist raises concerns over racial bias in AI Urges software developers to listen more to Black people Wrongful arrest in US caused by faulty facial recognition Lisbon playing host to Europe's largest tech event LISBON, Nov 3 (Reuters) - (This Nov. 3 story corrects to change BLM co-founder first name to Ayo from Opal at her request) As concerns grow over racial bias in artificial intelligence, Black Lives Matter (BLM) co-founder Ayo Tometi urged the tech sector to act fast against perpetuating racism in systems such as facial recognition. Artificial intelligence is transforming the world and can be applied in diverse sectors, from improving the early detection of diseases to sorting out data and solving complex problems, but there are also concerns around it. "A lot of the algorithms, a lot of the data is racist," U.S. activist Tometi, who co-founded BLM in 2013, told Reuters on the sidelines of Lisbon's Web Summit. "We need tech to truly understand every way it (racism) shows up in the technologies they are developing," she said. The tech industry has faced a reckoning over the past few years over the ethics of AI technologies, with critics saying such systems could compromise privacy, target marginalised groups and normalise intrusive surveillance.

blm co-founder tell tech boss, racism, tometi, (10 more...)

#artificialintelligence

Country:

Europe > Portugal > Lisbon > Lisbon (0.70)
North America > United States (0.18)
South America > Brazil (0.05)
Europe > United Kingdom (0.05)

Industry: Law > Civil Rights & Constitutional Law (1.00)

Technology: Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)

Add feedback

Causal inference with imperfect instrumental variables

Miklin, Nikolai, Gachechiladze, Mariami, Moreno, George, Chaves, Rafael

arXiv.org Machine LearningNov-4-2021

Instrumental variables allow for quantification of cause and effect relationships even in the absence of interventions. To achieve this, a number of causal assumptions must be met, the most important of which is the independence assumption, which states that the instrument and any confounding factor must be independent. However, if this independence condition is not met, can we still work with imperfect instrumental variables? Imperfect instruments can manifest themselves by violations of the instrumental inequalities that constrain the set of correlations in the scenario. In this paper, we establish a quantitative relationship between such violations of instrumental inequalities and the minimal amount of measurement dependence required to explain them. As a result, we provide adapted inequalities that are valid in the presence of a relaxed measurement dependence assumption in the instrumental scenario. This allows for the adaptation of existing and new lower bounds on the average causal effect for instrumental scenarios with binary outcomes. Finally, we discuss our findings in the context of quantum mechanics.

inequality, instrumental scenario, violation, (15 more...)

arXiv.org Machine Learning

2111.03029

Country:

South America > Brazil > Rio Grande do Norte > Natal (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Poland > Pomerania Province > Gdańsk (0.04)
(4 more...)

Genre: Research Report > New Finding (0.48)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)

Add feedback

Benchmarking Multimodal AutoML for Tabular Data with Text Fields

Shi, Xingjian, Mueller, Jonas, Erickson, Nick, Li, Mu, Smola, Alexander J.

arXiv.org Machine LearningNov-4-2021

We consider the use of automated supervised learning systems for data tables that not only contain numeric/categorical columns, but one or more text fields as well. Here we assemble 18 multimodal data tables that each contain some text fields and stem from a real business application. Our publicly-available benchmark enables researchers to comprehensively evaluate their own methods for supervised learning with numeric, categorical, and text features. To ensure that any single modeling strategy which performs well over all 18 datasets will serve as a practical foundation for multimodal text/tabular AutoML, the diverse datasets in our benchmark vary greatly in: sample size, problem types (a mix of classification and regression tasks), number of features (with the number of text columns ranging from 1 to 28 between datasets), as well as how the predictive signal is decomposed between text vs. numeric/categorical features (and predictive interactions thereof). Over this benchmark, we evaluate various straightforward pipelines to model such data, including standard two-stage approaches where NLP is used to featurize the text such that AutoML for tabular data can then be applied. Compared with human data science teams, the fully automated methodology that performed best on our benchmark (stack ensembling a multimodal Transformer with various tree models) also manages to rank 1st place when fit to the raw text/tabular data in two MachineHack prediction competitions and 2nd place (out of 2380 teams) in Kaggle's Mercari Price Suggestion Challenge.

benchmark, dataset, text field, (15 more...)

arXiv.org Machine Learning

2111.02705

Country:

North America > United States > California (0.04)
Asia > India (0.04)
Asia > China (0.04)
(2 more...)

Genre: Research Report > New Finding (0.92)

Industry: Information Technology > Software (0.34)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Adversarial Attacks on Knowledge Graph Embeddings via Instance Attribution Methods

Bhardwaj, Peru, Kelleher, John, Costabello, Luca, O'Sullivan, Declan

arXiv.org Artificial IntelligenceNov-4-2021

Despite the widespread use of Knowledge Graph Embeddings (KGE), little is known about the security vulnerabilities that might disrupt their intended behaviour. We study data poisoning attacks against KGE models for link prediction. These attacks craft adversarial additions or deletions at training time to cause model failure at test time. To select adversarial deletions, we propose to use the model-agnostic instance attribution methods from Interpretable Machine Learning, which identify the training instances that are most influential to a neural model's predictions on test instances. We use these influential triples as adversarial deletions. We further propose a heuristic method to replace one of the two entities in each influential triple to generate adversarial additions. Our experiments show that the proposed strategies outperform the state-of-art data poisoning attacks on KGE models and improve the MRR degradation due to the attacks by up to 62% over the baselines.

kge model, prediction, target triple, (15 more...)

arXiv.org Artificial Intelligence

2111.0312

Country:

Europe > Ireland > Leinster > County Dublin > Dublin (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
South America > Peru (0.04)
(6 more...)

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

A new step for computing

Vasques, Xavier

arXiv.org Artificial IntelligenceNov-4-2021

The data center of tomorrow is a data center made up of heterogeneous systems, which will run heterogeneous workloads. The systems will be located as close as possible to the data. Heterogeneous systems will be equipped with binary, biological inspired and quantum accelerators. These architectures will be the foundations to address challenges. Like an orchestra conductor, the hybrid cloud will make it possible to set these systems to music thanks to a layer of security and intelligent automation.

aujourd, ordinateur quantique, quantique, (15 more...)

arXiv.org Artificial Intelligence

2108.03997

Country:

Europe > France (0.04)
North America > United States > New York (0.04)
North America > United States > Massachusetts (0.04)
(6 more...)

Genre: Research Report (0.40)

Industry:

Information Technology > Services (0.73)
Media > Music (0.53)
Information Technology > Security & Privacy (0.48)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Cloud Computing (0.73)
Information Technology > Information Management (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Learning suction graspability considering grasp quality and robot reachability for bin-picking

Jiang, Ping, Oaki, Junji, Ishihara, Yoshiyuki, Ooga, Junichiro, Han, Haifeng, Sugahara, Atsushi, Tokura, Seiji, Eto, Haruna, Komoda, Kazuma, Ogawa, Akihito

arXiv.org Artificial IntelligenceNov-3-2021

Deep learning has been widely used for inferring robust grasps. Although human-labeled RGB-D datasets were initially used to learn grasp configurations, preparation of this kind of large dataset is expensive. To address this problem, images were generated by a physical simulator, and a physically inspired model (e.g., a contact model between a suction vacuum cup and object) was used as a grasp quality evaluation metric to annotate the synthesized images. However, this kind of contact model is complicated and requires parameter identification by experiments to ensure real world performance. In addition, previous studies have not considered manipulator reachability such as when a grasp configuration with high grasp quality is unable to reach the target due to collisions or the physical limitations of the robot. In this study, we propose an intuitive geometric analytic-based grasp quality evaluation metric. We further incorporate a reachability evaluation metric. We annotate the pixel-wise grasp quality and reachability by the proposed evaluation metric on synthesized images in a simulator to train an auto-encoder--decoder called suction graspability U-Net++ (SG-U-Net++). Experiment results show that our intuitive grasp quality evaluation metric is competitive with a physically-inspired metric. Learning the reachability helps to reduce motion planning computation time by removing obviously unreachable candidates. The system achieves an overall picking speed of 560 PPH (pieces per hour).

artificial intelligence, grasp quality, machine learning, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.3389/fnbot.2022.806898

2111.02571

Country:

South America > Uruguay > Artigas > Artigas (0.04)
Asia > Japan (0.04)
Asia > China > Guangxi Province > Nanning (0.04)

Genre: Research Report > New Finding (0.86)

Industry: Information Technology (0.86)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

Add feedback

Multi-task Learning of Order-Consistent Causal Graphs

Chen, Xinshi, Sun, Haoran, Ellington, Caleb, Xing, Eric, Song, Le

arXiv.org Machine LearningNov-3-2021

We consider the problem of discovering $K$ related Gaussian directed acyclic graphs (DAGs), where the involved graph structures share a consistent causal order and sparse unions of supports. Under the multi-task learning setting, we propose a $l_1/l_2$-regularized maximum likelihood estimator (MLE) for learning $K$ linear structural equation models. We theoretically show that the joint estimator, by leveraging data across related tasks, can achieve a better sample complexity for recovering the causal order (or topological order) than separate estimations. Moreover, the joint estimator is able to recover non-identifiable DAGs, by estimating them together with some identifiable DAGs. Lastly, our analysis also shows the consistency of union support recovery of the structures. To allow practical implementation, we design a continuous optimization problem whose optimizer is the same as the joint estimator and can be approximated efficiently by an iterative algorithm. We validate the theoretical analysis and the effectiveness of the joint estimator in experiments.

estimation, probability, theorem 3, (16 more...)

arXiv.org Machine Learning

2111.02545

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Asia > Middle East > Jordan (0.04)
Asia > Japan > Kyūshū & Okinawa > Kyūshū > Nagasaki Prefecture > Nagasaki (0.04)

Genre: Research Report (0.81)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.92)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.88)

Add feedback

Improving Peer Assessment with Graph Convolutional Networks

Namanloo, Alireza A., Thorpe, Julie, Salehi-Abari, Amirali

arXiv.org Artificial IntelligenceNov-3-2021

Peer assessment systems are emerging in many social and multi-agent settings, such as peer grading in large (online) classes, peer review in conferences, peer art evaluation, etc. However, peer assessments might not be as accurate as expert evaluations, thus rendering these systems unreliable. The reliability of peer assessment systems is influenced by various factors such as assessment ability of peers, their strategic assessment behaviors, and the peer assessment setup (e.g., peer evaluating group work or individual work of others). In this work, we first model peer assessment as multi-relational weighted networks that can express a variety of peer assessment setups, plus capture conflicts of interest and strategic behaviors. Leveraging our peer assessment network model, we introduce a graph convolutional network which can learn assessment patterns and user behaviors to more accurately predict expert evaluations. Our extensive experiments on real and synthetic datasets demonstrate the efficacy of our proposed approach, which outperforms existing peer assessment methods.

assessment, dataset, gcn-soan, (15 more...)

arXiv.org Artificial Intelligence

2111.04466

Country:

North America > Canada > Ontario > Durham Region > Oshawa (0.14)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(21 more...)

Genre:

Instructional Material (0.87)
Research Report > New Finding (0.68)

Industry:

Education > Educational Setting > Online (1.00)
Education > Educational Technology > Educational Software > Computer Based Training (0.34)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Proximal Policy Optimization with Continuous Bounded Action Space via the Beta Distribution

Petrazzini, Irving G. B., Antonelo, Eric A.

arXiv.org Artificial IntelligenceNov-3-2021

Reinforcement learning methods for continuous control tasks have evolved in recent years generating a family of policy gradient methods that rely primarily on a Gaussian distribution for modeling a stochastic policy. However, the Gaussian distribution has an infinite support, whereas real world applications usually have a bounded action space. This dissonance causes an estimation bias that can be eliminated if the Beta distribution is used for the policy instead, as it presents a finite support. In this work, we investigate how this Beta policy performs when it is trained by the Proximal Policy Optimization (PPO) algorithm on two continuous control tasks from OpenAI gym. For both tasks, the Beta policy is superior to the Gaussian policy in terms of agent's final expected reward, also showing more stability and faster convergence of the training process. For the CarRacing environment with high-dimensional image input, the agent's success rate was improved by 63% over the Gaussian policy.

action space, agent, beta distribution, (14 more...)

arXiv.org Artificial Intelligence

2111.02202

Country:

South America > Brazil > Santa Catarina > Florianópolis (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.82)

Industry: Leisure & Entertainment > Games (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.90)

Add feedback