AITopics | South America

Collaborating Authors

South America

Quota-based debiasing can decrease representation of already underrepresented groups

Smirnov, Ivan, Lemmerich, Florian, Strohmaier, Markus

arXiv.org Artificial IntelligenceJun-13-2020

Many important decisions in societies such as school admissions, hiring, or elections are based on the selection of top-ranking individuals from a larger pool of candidates. This process is often subject to biases, which typically manifest as an under-representation of certain groups among the selected or accepted individuals. The most common approach to this issue is debiasing, for example via the introduction of quotas that ensure proportional representation of groups with respect to a certain, often binary attribute. Cases include quotas for women on corporate boards or ethnic quotas in elections. This, however, has the potential to induce changes in representation with respect to other attributes. For the case of two correlated binary attributes we show that quota-based debiasing based on a single attribute can worsen the representation of already underrepresented groups and decrease overall fairness of selection. We use several data sets from a broad range of domains from recidivism risk assessments to scientific citations to assess this effect in real-world settings. Our results demonstrate the importance of including all relevant attributes in debiasing procedures and that more efforts need to be put into eliminating the root causes of inequalities as purely numerical solutions such as quota-based debiasing might lead to unintended consequences.

artificial intelligence, representation, selection, (18 more...)

arXiv.org Artificial Intelligence

2006.07647

Country:

North America > United States > Illinois > Cook County > Chicago (0.05)
South America > Brazil (0.04)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
(6 more...)

Genre: Research Report > New Finding (0.54)

Industry:

Law (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.93)
Education > Educational Setting > Higher Education (0.46)
Education > Operations > Student Enrollment (0.34)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

High-contrast "gaudy" images improve the training of deep neural network models of visual cortex

Cowley, Benjamin R., Pillow, Jonathan W.

arXiv.org Machine LearningJun-13-2020

A key challenge in understanding the sensory transformations of the visual system is to obtain a highly predictive model of responses from visual cortical neurons. Deep neural networks (DNNs) provide a promising candidate for such a model. However, DNNs require orders of magnitude more training data than neuroscientists can collect from real neurons because experimental recording time is severely limited. This motivates us to find images that train highly-predictive DNNs with as little training data as possible. We propose gaudy images---high-contrast binarized versions of natural images---to efficiently train DNNs. In extensive simulation experiments, we find that training DNNs with gaudy images substantially reduces the number of training images needed to accurately predict the simulated responses of visual cortical neurons. We also find that gaudy images, chosen before training, outperform images chosen during training by active learning algorithms. Thus, gaudy images overemphasize features of natural images, especially edges, that are the most important for efficiently training DNNs. We believe gaudy images will aid in the modeling of visual cortical neurons, potentially opening new scientific questions about visual processing, as well as aid general practitioners that seek ways to improve the training of DNNs.

artificial intelligence, gaudy image, machine learning, (15 more...)

arXiv.org Machine Learning

2006.11412

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Europe > Poland (0.04)
Asia > Middle East > Jordan (0.04)
Asia > Middle East > Israel > Southern District > Eilat (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Disney's Splash Mountain Ride Inspired By Studio's 'Most Racist Movie': Here's Why

International Business TimesJun-12-2020, 23:32:55 GMT

Disney Parks have reportedly been urged to overhaul the theme of its famous Splash Mountain ride. The park's popular log flume ride has recently been getting calls for Disney to alter its whole attraction motif. In the wake of the protests against police brutality and racial injustices and amid the ongoing Black Lives Matter movement, fans and parkgoers have claimed that Disney's Splash Mountain is based on the 1946 Disney film "Song of the South" -- which has been dubbed one of the "most racist movies" for its stereotypes of black people. According to a report from CNN, the controversial live-action animated musical film has long been criticized for romanticizing the post-Civil War period in South America. "Song of the South" is set in a plantation in Georgia in the 1800s during the Reconstruction era.

artificial intelligence, disney, science fiction, (11 more...)

International Business Times

Country:

South America (0.25)
North America > United States > California (0.07)
North America > United States > Florida > Orange County (0.05)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)
Law > Civil Rights & Constitutional Law (1.00)

Technology: Information Technology > Artificial Intelligence > Science Fiction (0.40)

Add feedback

A Formal Language Approach to Explaining RNNs

Ghosh, Bishwamittra, Neider, Daniel

arXiv.org Artificial IntelligenceJun-12-2020

This paper presents LEXR, a framework for explaining the decision making of recurrent neural networks (RNNs) using a formal description language called Linear Temporal Logic (LTL). LTL is the de facto standard for the specification of temporal properties in the context of formal verification and features many desirable properties that make the generated explanations easy for humans to interpret: it is a descriptive language, it has a variable-free syntax, and it can easily be translated into plain English. To generate explanations, LEXR follows the principle of counterexample-guided inductive synthesis and combines Valiant's probably approximately correct learning (PAC) with constraint solving. We prove that LEXR's explanations satisfy the PAC guarantee (provided the RNN can be described by LTL) and show empirically that these explanations are more accurate and easier-to-understand than the ones generated by recent algorithms that extract deterministic finite automata from RNNs.

explanation, logic & formal reasoning, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2006.07292

Country:

Europe > Germany > Rhineland-Palatinate > Kaiserslautern (0.04)
Asia > Singapore (0.04)
South America > Argentina > Pampas > Buenos Aires F.D. > Buenos Aires (0.04)
(3 more...)

Genre: Research Report (0.82)

Industry:

Health & Medicine > Therapeutic Area (0.46)
Government > Regional Government (0.30)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

Add feedback

Analyzing the Impact of Foursquare and Streetlight Data with Human Demographics on Future Crime Prediction

Bappee, Fateha Khanam, Petry, Lucas May, Soares, Amilcar, Matwin, Stan

arXiv.org Machine LearningJun-12-2020

Finding the factors contributing to criminal activities and their consequences is essential to improve quantitative crime research. To respond to this concern, we examine an extensive set of features from different perspectives and explanations. Our study aims to build data-driven models for predicting future crime occurrences. In this paper, we propose the use of streetlight infrastructure and Foursquare data along with demographic characteristics for improving future crime incident prediction. We evaluate the classification performance based on various feature combinations as well as with the baseline model. Our proposed model was tested on each smallest geographic region in Halifax, Canada. Our findings demonstrate the effectiveness of integrating diverse sources of data to gain satisfactory classification performance.

data mining, machine learning, prediction, (20 more...)

arXiv.org Machine Learning

2006.07516

Country:

North America > Canada > Nova Scotia > Halifax Regional Municipality > Halifax (0.34)
North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.04)
(5 more...)

Genre: Research Report > New Finding (1.00)

Industry: Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (0.46)

Add feedback

Heterogeneity-Aware Federated Learning

Yang, Chengxu, Wang, QiPeng, Xu, Mengwei, Wang, Shangguang, Bian, Kaigui, Liu, Xuanzhe

arXiv.org Machine LearningJun-12-2020

Federated learning (FL) is an emerging distributed machine learning paradigm that stands out with its inherent privacy-preserving advantages. Heterogeneity is one of the core challenges in FL, which resides in the diverse user behaviors and hardware capacity across devices who participate in the training. Heterogeneity inherently exerts a huge influence on the FL training process, e.g., causing device unavailability. However, existing FL literature usually ignores the impacts of heterogeneity. To fill in the knowledge gap, we build FLASH, the first heterogeneity-aware FL platform. Based on FLASH and a large-scale user trace from 136k real-world users, we demonstrate the usefulness of FLASH in anatomizing the impacts of heterogeneity in FL by exploring three previously unaddressed research questions: whether and how can heterogeneity affect FL performance; how to configure a heterogeneity-aware FL system; and what are heterogeneity's impacts on existing FL optimizations. It shows that heterogeneity causes nontrivial performance degradation in FL from various aspects, and even invalidates some typical FL optimizations.

artificial intelligence, heterogeneity, machine learning, (13 more...)

arXiv.org Machine Learning

2006.06983

Country:

North America > United States > Virginia (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
South America > Brazil (0.04)
(8 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Optimal Allocation of Real-Time-Bidding and Direct Campaigns

Jauvion, Grégoire, Grislain, Nicolas

arXiv.org Artificial IntelligenceJun-12-2020

In this paper, we consider the problem of optimizing the revenue a web publisher gets through real-time bidding (i.e. from ads sold in real-time auctions) and direct (i.e. from ads sold through contracts agreed in advance). We consider a setting where the publisher is able to bid in the real-time bidding auction for each impression. If it wins the auction, it chooses a direct campaign to deliver and displays the corresponding ad. This paper presents an algorithm to build an optimal strategy for the publisher to deliver its direct campaigns while maximizing its real-time bidding revenue. The optimal strategy gives a formula to determine the publisher bid as well as a way to choose the direct campaign being delivered if the publisher bidder wins the auction, depending on the impression characteristics. The optimal strategy can be estimated on past auctions data. The algorithm scales with the number of campaigns and the size of the dataset. This is a very important feature, as in practice a publisher may have thousands of active direct campaigns at the same time and would like to estimate an optimal strategy on billions of auctions. The algorithm is a key component of a system which is being developed, and which will be deployed on thousands of web publishers worldwide, helping them to serve efficiently billions of ads a day to hundreds of millions of visitors.

artificial intelligence, auction, real time system, (20 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3219819.3219877

2006.0707

Country:

North America > United States > New York > New York County > New York City (0.05)
Europe > United Kingdom > England > Greater London > London (0.05)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > New Jersey > Middlesex County > Piscataway (0.04)

Genre: Research Report (0.40)

Industry:

Information Technology > Services (0.47)
Marketing (0.47)

Technology:

Information Technology > Architecture > Real Time Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.41)

Add feedback

Seeing Results From AI, Even During Covid-19 Recession

#artificialintelligenceJun-11-2020, 01:46:37 GMT

New York has always been known as the city that never sleeps. In this crisis time, this definition has never been clearer. Citizens of NYC have been subjected to the stress of Coronavirus pressures, which have hit Americans with fatigue and feelings of hopelessness as a result of our grim economic situation. NYC continues to fight back and be resilient, combating the crisis situation with innovative companies like IPsoft that provide unique AI solutions for clients. By leveraging AI solutions, we can bring America back to its former glory, and instead transform the cause of sleepless nights from recession anxiety to the exhaustion of a memorable night out in the city that never sleeps.

artificial intelligence, customer, digital employee, (15 more...)

#artificialintelligence

Country:

North America > United States > New York (0.26)
North America > United States > California (0.15)
South America > Peru (0.06)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.86)
Health & Medicine > Therapeutic Area > Immunology (0.86)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (0.35)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Asymptotics of Ridge(less) Regression under General Source Condition

Richards, Dominic, Mourtada, Jaouad, Rosasco, Lorenzo

arXiv.org Machine LearningJun-11-2020

Understanding the generalisation properties of Artificial Deep Neural Networks (ANN) has recently motivated a number of statistical questions. These models perform well in practice despite perfectly fitting (interpolating) the data, a property that seems at odds with classical statistical theory [49]. This has motivated the investigation of the generalisation performance of methods that achieve zero training error (interpolators) [32, 9, 11, 10, 8] and, in the context of linear least squares, the unique least norm solution to which gradient descent converges [22, 5, 37, 8, 21, 38, 20, 39]. Overparameterized linear models, where the number of variables exceed the number of points, are arguably the simplest and most natural setting where interpolation can be studied. Moreover, in certain regimes ANN can be approximated by suitable linear models [24, 17, 18, 2, 13]. The learning curve (test error versus model capacity) for interpolators has been shown to exhibit a characteristic "Double Descent" [1, 7] shape, where the test error decreases after peaking at the "interpolating" threshold, that is, the model capacity required to interpolate the data. The regime beyond this threshold naturally captures the settings of ANN [49], and thus, has motivated its investigation [36, 44, 39].

artificial intelligence, machine learning, regression, (15 more...)

arXiv.org Machine Learning

2006.06386

Country:

Europe > Italy (0.04)
South America > Brazil > Rio de Janeiro > Rio de Janeiro (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(4 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Stability of Stochastic Gradient Descent on Nonsmooth Convex Losses

Bassily, Raef, Feldman, Vitaly, Guzmán, Cristóbal, Talwar, Kunal

arXiv.org Machine LearningJun-11-2020

Uniform stability is a notion of algorithmic stability that bounds the worst case change in the model output by the algorithm when a single data point in the dataset is replaced. An influential work of Hardt et al. (2016) provides strong upper bounds on the uniform stability of the stochastic gradient descent (SGD) algorithm on sufficiently smooth convex losses. These results led to important progress in understanding of the generalization properties of SGD and several applications to differentially private convex optimization for smooth losses. Our work is the first to address uniform stability of SGD on {\em nonsmooth} convex losses. Specifically, we provide sharp upper and lower bounds for several forms of SGD and full-batch GD on arbitrary Lipschitz nonsmooth convex losses. Our lower bounds show that, in the nonsmooth case, (S)GD can be inherently less stable than in the smooth case. On the other hand, our upper bounds show that (S)GD is sufficiently stable for deriving new and useful bounds on generalization error. Most notably, we obtain the first dimension-independent generalization bounds for multi-pass SGD in the nonsmooth case. In addition, our bounds allow us to derive a new algorithm for differentially private nonsmooth stochastic convex optimization with optimal excess population risk. Our algorithm is simpler and more efficient than the best known algorithm for the nonsmooth case Feldman et al. (2020).

algorithm, artificial intelligence, machine learning, (14 more...)

arXiv.org Machine Learning

2006.06914

Country:

South America > Chile (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > Ohio (0.04)
(5 more...)

Genre: Research Report (0.82)

Industry: Information Technology > Security & Privacy (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (1.00)

Add feedback