AITopics | South America

Collaborating Authors

South America

PoNet: Pooling Network for Efficient Token Mixing in Long Sequences

Tan, Chao-Hong, Chen, Qian, Wang, Wen, Zhang, Qinglin, Zheng, Siqi, Ling, Zhen-Hua

arXiv.org Artificial IntelligenceOct-7-2021

Transformer-based models have achieved great success in various NLP, vision, and speech tasks. However, the core of Transformer, the self-attention mechanism, has a quadratic time and memory complexity with respect to the sequence length, which hinders applications of Transformer-based models to long sequences. Many approaches have been proposed to mitigate this problem, such as sparse attention mechanisms, low-rank matrix approximations and scalable kernels, and token mixing alternatives to self-attention. We propose a novel Pooling Network (PoNet) for token mixing in long sequences with linear complexity. We design multi-granularity pooling and pooling fusion to capture different levels of contextual information and combine their interactions with tokens. On the Long Range Arena benchmark, PoNet significantly outperforms Transformer and achieves competitive accuracy, while being only slightly slower than the fastest model, FNet, across all sequence lengths measured on GPUs. We also conduct systematic studies on the transfer learning capability of PoNet and observe that PoNet achieves 96.0% of the accuracy of BERT on the GLUE benchmark, outperforming FNet by 4.5% relative. Comprehensive ablation analysis demonstrates effectiveness of the designed multi-granularity pooling and pooling fusion for token mixing in long sequences and efficacy of the designed pre-training tasks for PoNet to learn transferable contextualized language representations.

information, ponet, transformer, (16 more...)

arXiv.org Artificial Intelligence

2110.02442

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Austria (0.04)
Africa > Ethiopia > Addis Ababa > Addis Ababa (0.04)
(6 more...)

Genre: Research Report (0.41)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.55)

Add feedback

Detecting adversaries in Crowdsourcing

Traganitis, Panagiotis A., Giannakis, Georgios B.

arXiv.org Machine LearningOct-7-2021

Despite its successes in various machine learning and data science tasks, crowdsourcing can be susceptible to attacks from dedicated adversaries. This work investigates the effects of adversaries on crowdsourced classification, under the popular Dawid and Skene model. The adversaries are allowed to deviate arbitrarily from the considered crowdsourcing model, and may potentially cooperate. To address this scenario, we develop an approach that leverages the structure of second-order moments of annotator responses, to identify large numbers of adversaries, and mitigate their impact on the crowdsourcing task. The potential of the proposed approach is empirically demonstrated on synthetic and real crowdsourcing datasets.

adversary, algorithm, annotator, (15 more...)

arXiv.org Machine Learning

2110.04117

Country:

South America > Brazil > Rio de Janeiro > Rio de Janeiro (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Nevada > Clark County > Las Vegas (0.04)
(9 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Communications > Social Media > Crowdsourcing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.93)

Add feedback

Multifile Partitioning for Record Linkage and Duplicate Detection

Aleshin-Guendel, Serge, Sadinle, Mauricio

arXiv.org Machine LearningOct-7-2021

Merging datafiles containing information on overlapping sets of entities is a challenging task in the absence of unique identifiers, and is further complicated when some entities are duplicated in the datafiles. Most approaches to this problem have focused on linking two files assumed to be free of duplicates, or on detecting which records in a single file are duplicates. However, it is common in practice to encounter scenarios that fit somewhere in between or beyond these two settings. We propose a Bayesian approach for the general setting of multifile record linkage and duplicate detection. We use a novel partition representation to propose a structured prior for partitions that can incorporate prior information about the data collection processes of the datafiles in a flexible manner, and extend previous models for comparison data to accommodate the multifile setting. We also introduce a family of loss functions to derive Bayes estimates of partitions that allow uncertain portions of the partitions to be left unresolved. The performance of our proposed methodology is explored through extensive simulations. Code implementing the methodology is available at https://github.com/aleshing/multilink .

partition, simulation, specification, (15 more...)

arXiv.org Machine Learning

2110.03839

Country:

North America > United States > Washington > King County > Seattle (0.28)
South America > Colombia (0.04)
North America > United States > Massachusetts (0.04)
(2 more...)

Genre: Research Report (0.81)

Industry: Law (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Creating Training Sets via Weak Indirect Supervision

Zhang, Jieyu, Wang, Bohan, Song, Xiangchen, Wang, Yujing, Yang, Yaming, Bai, Jing, Ratner, Alexander

arXiv.org Machine LearningOct-7-2021

Creating labeled training sets has become one of the major roadblocks in machine learning. To address this, recent Weak Supervision (WS) frameworks synthesize training labels from multiple potentially noisy supervision sources. However, existing frameworks are restricted to supervision sources that share the same output space as the target task. To extend the scope of usable sources, we formulate Weak Indirect Supervision (WIS), a new research problem for automatically synthesizing training labels based on indirect supervision sources that have different output label spaces. To overcome the challenge of mismatched output spaces, we develop a probabilistic modeling approach, PLRM, which uses user-provided label relations to model and leverage indirect supervision sources. Moreover, we provide a theoretically-principled test of the distinguishability of PLRM for unseen labels, along with an generalization bound. On both image and text classification tasks as well as an industrial advertising application, we demonstrate the advantages of PLRM by outperforming baselines by a margin of 2%-9%.

label relation, relation, unseen label, (13 more...)

arXiv.org Machine Learning

2110.03484

Country:

North America > United States > New York > New York County > New York City (0.04)
South America > Brazil > Rio de Janeiro > Rio de Janeiro (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
(6 more...)

Genre: Research Report (0.64)

Industry: Health & Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.87)

Add feedback

Detecting and Quantifying Malicious Activity with Simulation-based Inference

Gambardella, Andrew, State, Bogdan, Khan, Naeemullah, Tsourides, Leo, Torr, Philip H. S., Baydin, Atılım Güneş

arXiv.org Machine LearningOct-7-2021

Probabilistic programming provides numerous advantages Ideally speaking, a good recommendations system should be over other techniques, including but not able to identify and remove malicious users before they can limited to providing a disentangled representation disrupt the ranking system by a significant margin. However, of how malicious users acted under a structured to eliminate the risk of false positives a resilient ranking model, as well as allowing for the quantification system can use as much data as possible. So we have to of damage caused by malicious users. We show adjust the tradeoff between false positives and the damage a experiments in malicious user identification using set of malicious users can cause to a ranking system.

algorithm, detecting and quantifying malicious activity, malicious user, (13 more...)

arXiv.org Machine Learning

2110.02483

Country:

South America > Brazil (0.04)
North America > United States > New York (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Research Report (0.64)

Industry:

Media (1.00)
Government (0.93)
Information Technology > Security & Privacy (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications > Social Media (0.95)
(2 more...)

Add feedback

Google puts AI to work to make Maps, Search more environment friendly

#artificialintelligenceOct-6-2021, 13:45:36 GMT

Tech giant Google is going to try and make some of its software more environment friendly. In a slew of updates announced today, the company said software like Maps and Search will start showing results that are meant to steer people towards environment friendly options. For instance, users in the US will not only see the fastest route to a destination on Google Maps, but the app will also show the most fuel-efficient route. The company said it is using artificial intelligence (AI) and insights from the US Department of Energy's National Renewable Energy Laboratory (NREL) to provide eco-friendly routes on Android and iOS versions of Maps. The feature is available only in the US right now and will come to Europe next year.

emission, flight, google, (3 more...)

#artificialintelligence

Country:

North America > United States (1.00)
Europe (0.26)
South America > Brazil > Rio de Janeiro > Rio de Janeiro (0.06)
Asia > Middle East > Israel (0.06)

Genre: Press Release (0.83)

Industry:

Government > Regional Government > North America Government > United States Government (0.58)
Energy > Renewable (0.58)
Information Technology > Services (0.57)

Technology: Information Technology > Artificial Intelligence > Applied AI (0.54)

Add feedback

Google turns its AI on traffic lights to reduce pollution

EngadgetOct-6-2021, 07:01:32 GMT

Poorly timed traffic lights don't just waste precious minutes. Like Google's chief sustainability officer Kate Brandt pointed out at a media event yesterday, they're also bad for the environment and public health. The company unveiled a slew of sustainability-centric products and updates today that aim to help users make more informed, environmentally friendly decisions. But it's also been working on a project that could use AI to make traffic lights more efficient and, as a result, decrease pollution in general. When your vehicle stops at an intersection, that idling time leads to wasted fuel and "more street-level air pollution," Brandt said.

intersection, pollution, traffic light, (5 more...)

Engadget

Country:

South America > Brazil > Rio de Janeiro > Rio de Janeiro (0.07)
Asia > Middle East > Israel > Southern District > Beer-Sheva (0.07)
Asia > Middle East > Israel > Haifa District > Haifa (0.07)

Industry:

Transportation > Infrastructure & Services (0.88)
Transportation > Ground > Road (0.88)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

DoubleStar: Long-Range Attack Towards Depth Estimation based Obstacle Avoidance in Autonomous Systems

Zhou, Ce, Yan, Qiben, Shi, Yan, Sun, Lichao

arXiv.org Artificial IntelligenceOct-6-2021

Depth estimation-based obstacle avoidance has been widely adopted by autonomous systems (drones and vehicles) for safety purpose. It normally relies on a stereo camera to automatically detect obstacles and make flying/driving decisions, e.g., stopping several meters ahead of the obstacle in the path or moving away from the detected obstacle. In this paper, we explore new security risks associated with the stereo vision-based depth estimation algorithms used for obstacle avoidance. By exploiting the weaknesses of the stereo matching in depth estimation algorithms and the lens flare effect in optical imaging, we propose DoubleStar, a long-range attack that injects fake obstacle depth by projecting pure light from two complementary light sources. DoubleStar includes two distinctive attack formats: beams attack and orbs attack, which leverage projected light beams and lens flare orbs respectively to cause false depth perception. We successfully attack two commercial stereo cameras designed for autonomous systems (ZED and Intel RealSense). The visualization of fake depth perceived by the stereo cameras illustrates the false stereo matching induced by DoubleStar. We further use Ardupilot to simulate the attack and demonstrate its impact on drones. To validate the attack on real systems, we perform a real-world attack towards a commercial drone equipped with state-of-the-art obstacle avoidance algorithms. Our attack can continuously bring a flying drone to a sudden stop or drift it away across a long distance under various lighting conditions, even bypassing sensor fusion mechanisms. Specifically, our experimental results show that DoubleStar creates fake depth up to 15 meters in distance at night and up to 8 meters during the daytime. To mitigate this newly discovered threat, we provide discussions on potential countermeasures to defend against DoubleStar.

doublestar, drone, fake depth, (16 more...)

arXiv.org Artificial Intelligence

2110.03154

Country:

North America > United States > Michigan (0.04)
South America > Chile (0.04)
North America > United States > Pennsylvania (0.04)
(2 more...)

Genre: Research Report > New Finding (0.88)

Industry:

Information Technology > Security & Privacy (1.00)
Automobiles & Trucks (1.00)
Information Technology > Robotics & Automation (0.88)
Transportation > Ground > Road (0.67)

Technology:

Information Technology > Artificial Intelligence > Vision > Image Understanding (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

On The Vulnerability of Recurrent Neural Networks to Membership Inference Attacks

Yang, Yunhao, Gohari, Parham, Topcu, Ufuk

arXiv.org Artificial IntelligenceOct-6-2021

We study the privacy implications of deploying recurrent neural networks in machine learning. We consider membership inference attacks (MIAs) in which an attacker aims to infer whether a given data record has been used in the training of a learning agent. Using existing MIAs that target feed-forward neural networks, we empirically demonstrate that the attack accuracy wanes for data records used earlier in the training history. Alternatively, recurrent networks are specifically designed to better remember their past experience; hence, they are likely to be more vulnerable to MIAs than their feed-forward counterparts. We develop a pair of MIA layouts for two primary applications of recurrent networks, namely, deep reinforcement learning and sequence-to-sequence tasks. We use the first attack to provide empirical evidence that recurrent networks are indeed more vulnerable to MIAs than feed-forward networks with the same performance level. We use the second attack to showcase the differences between the effects of overtraining recurrent and feed-forward networks on the accuracy of their respective MIAs. Finally, we deploy a differential privacy mechanism to resolve the privacy vulnerability that the MIAs exploit. For both attack layouts, the privacy mechanism degrades the attack accuracy from above 80% to 50%, which is equal to guessing the data membership uniformly at random, while trading off less than 10% utility.

dataset, dirichlet mechanism, mia, (14 more...)

arXiv.org Artificial Intelligence

2110.03054

Country:

North America > United States > Texas > Travis County > Austin (0.14)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
(2 more...)

Genre: Research Report (0.50)

Industry:

Information Technology > Security & Privacy (1.00)
Government (1.00)
Law (0.93)
Health & Medicine (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Solving Multistage Stochastic Linear Programming via Regularized Linear Decision Rules: An Application to Hydrothermal Dispatch Planning

Nazare, Felipe, Street, Alexandre

arXiv.org Machine LearningOct-6-2021

The solution of multistage stochastic linear problems (MSLP) represents a challenge for many applications. Long-term hydrothermal dispatch planning (LHDP) materializes this challenge in a real-world problem that affects electricity markets, economies, and natural resources worldwide. No closed-form solutions are available for MSLP and the definition of non-anticipative policies with high-quality out-of-sample performance is crucial. Linear decision rules (LDR) provide an interesting simulation-based framework for finding high-quality policies to MSLP through two-stage stochastic models. In practical applications, however, the number of parameters to be estimated when using an LDR may be close or higher than the number of scenarios, thereby generating an in-sample overfit and poor performances in out-of-sample simulations. In this paper, we propose a novel regularization scheme for LDR based on the AdaLASSO (adaptive least absolute shrinkage and selection operator). The goal is to use the parsimony principle as largely studied in high-dimensional linear regression models to obtain better out-of-sample performance for an LDR applied to MSLP. Computational experiments show that the overfit threat is non-negligible when using the classical non-regularized LDR to solve MSLP. For the LHDP problem, our analysis highlights the following benefits of the proposed framework in comparison to the non-regularized benchmark: 1) significant reductions in the number of non-zero coefficients (model parsimony), 2) substantial cost reductions in out-of-sample evaluations, and 3) improved spot-price profiles.

coefficient, ldr, scenario, (15 more...)

arXiv.org Machine Learning

2110.03146

Country: South America > Brazil > Rio de Janeiro > Rio de Janeiro (0.04)

Genre: Research Report (0.82)

Industry:

Energy > Power Industry (1.00)
Energy > Renewable (0.95)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.68)

Add feedback