AITopics | Banff

Collaborating Authors

Banff

Multi-step domain adaptation by adversarial attack to $\mathcal{H} \Delta \mathcal{H}$-divergence

Asadulaev, Arip, Panfilov, Alexander, Filchenkov, Andrey

arXiv.org Artificial IntelligenceJul-18-2022

Adversarial examples are transferable between different models. In our paper, we propose to use this property for multi-step domain adaptation. In unsupervised domain adaptation settings, we demonstrate that replacing the source domain with adversarial examples to $\mathcal{H} \Delta \mathcal{H}$-divergence can improve source classifier accuracy on the target domain. Our method can be connected to most domain adaptation techniques. We conducted a range of experiments and achieved improvement in accuracy on Digits and Office-Home datasets.

adaptation, domain adaptation, proceedings, (13 more...)

arXiv.org Artificial Intelligence

2207.08948

Country:

Asia > Middle East > Jordan (0.05)
North America > United States > California > San Diego County > San Diego (0.05)
Europe > France > Hauts-de-France > Nord > Lille (0.05)
(14 more...)

Genre: Research Report (0.66)

Industry:

Information Technology > Security & Privacy (0.43)
Government > Military (0.43)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

Repairing Systematic Outliers by Learning Clean Subspaces in VAEs

Eduardo, Simao, Xu, Kai, Nazabal, Alfredo, Sutton, Charles

arXiv.org Artificial IntelligenceJul-16-2022

Data cleaning often comprises outlier detection and data repair. Systematic errors result from nearly deterministic transformations that occur repeatedly in the data, e.g. specific image pixels being set to default values or watermarks. Consequently, models with enough capacity easily overfit to these errors, making detection and repair difficult. Seeing as a systematic outlier is a combination of patterns of a clean instance and systematic error patterns, our main insight is that inliers can be modelled by a smaller representation (subspace) in a model than outliers. By exploiting this, we propose Clean Subspace Variational Autoencoder (CLSVAE), a novel semi-supervised model for detection and automated repair of systematic errors. The main idea is to partition the latent space and model inlier and outlier patterns separately. CLSVAE is effective with much less labelled data compared to previous related models, often with less than 2% of the data. We provide experiments using three image datasets in scenarios with different levels of corruption and labelled set sizes, comparing to relevant baselines. CLSVAE provides superior repairs without human intervention, e.g. with just 0.25% of labelled data we see a relative error decrease of 58% compared to the closest baseline.

dataset, outlier, systematic error, (15 more...)

arXiv.org Artificial Intelligence

2207.0805

Country:

North America > United States > California > Santa Clara County > Mountain View (0.04)
North America > Canada > Alberta > Census Division No. 15 > Improvement District No. 9 > Banff (0.04)
Europe > United Kingdom > Scotland (0.04)
Europe > Hungary > Budapest > Budapest (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Interpretable Deep Learning: Interpretation, Interpretability, Trustworthiness, and Beyond

Li, Xuhong, Xiong, Haoyi, Li, Xingjian, Wu, Xuanyu, Zhang, Xiao, Liu, Ji, Bian, Jiang, Dou, Dejing

arXiv.org Artificial IntelligenceJul-15-2022

Deep neural networks have been well-known for their superb handling of various machine learning and artificial intelligence tasks. However, due to their over-parameterized black-box nature, it is often difficult to understand the prediction results of deep models. In recent years, many interpretation tools have been proposed to explain or reveal how deep models make decisions. In this paper, we review this line of research and try to make a comprehensive survey. Specifically, we first introduce and clarify two basic concepts -- interpretations and interpretability -- that people usually get confused about. To address the research efforts in interpretations, we elaborate the designs of a number of interpretation algorithms, from different perspectives, by proposing a new taxonomy. Then, to understand the interpretation results, we also survey the performance metrics for evaluating interpretation algorithms. Further, we summarize the current works in evaluating models' interpretability using "trustworthy" interpretation algorithms. Finally, we review and discuss the connections between deep models' interpretations and other factors, such as adversarial robustness and learning from interpretations, and we introduce several open-source libraries for interpretation algorithms and evaluation approaches.

artificial intelligence, machine learning, survey article, (15 more...)

arXiv.org Artificial Intelligence

2103.10689

Country:

North America > United States > California > Los Angeles County > Long Beach (0.14)
North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.14)
North America > United States > Oregon > Lane County > Eugene (0.14)
(42 more...)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Information Technology (0.92)
Health & Medicine > Diagnostic Medicine > Imaging (0.67)
Health & Medicine > Therapeutic Area > Neurology (0.45)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Anomalous behaviour in loss-gradient based interpretability methods

Subramanian, Vinod, Gururani, Siddharth, Benetos, Emmanouil, Sandler, Mark

arXiv.org Artificial IntelligenceJul-15-2022

Loss-gradients are used to interpret the decision making process of deep learning models. In this work, we evaluate loss-gradient based attribution methods by occluding parts of the input and comparing the performance of the occluded input to the original input. We observe that the occluded input has better performance than the original across the test dataset under certain conditions. Similar behaviour is observed in sound and image recognition tasks. We explore different loss-gradient attribution methods, occlusion levels and replacement values to explain the phenomenon of performance improvement under occlusion.

attribution method, gradient, input feature, (14 more...)

arXiv.org Artificial Intelligence

2207.07769

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.05)
Europe > Italy > Marche > Ancona Province > Ancona (0.05)
North America > Canada > Alberta > Census Division No. 15 > Improvement District No. 9 > Banff (0.04)
(2 more...)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Comparing the latent space of generative models

Asperti, Andrea, Tonelli, Valerio

arXiv.org Artificial IntelligenceJul-14-2022

Different encodings of datapoints in the latent space of latent-vector generative models may result in more or less effective and disentangled characterizations of the different explanatory factors of variation behind the data. Many works have been recently devoted to the explorationof the latent space of specific models, mostly focused on the study of how features are disentangled and of how trajectories producing desired alterations of data in the visible space can be found. In this work we address the more general problem of comparing the latent spaces of different models, looking for transformations between them. We confined the investigation to the familiar and largely investigated case of generative models for the data manifold of human faces. The surprising, preliminary result reported in this article is that (provided models have not been taught or explicitly conceived to act differently) a simple linear mapping is enough to pass from a latent space to another while preserving most of the information.

latent space, springer nature 2021, stylegan, (12 more...)

arXiv.org Artificial Intelligence

2207.06812

Country:

North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > Puerto Rico > San Juan > San Juan (0.04)
North America > Canada > Quebec > Montreal (0.04)
(11 more...)

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language > Generation (0.82)
Information Technology > Artificial Intelligence > Vision (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.67)

Add feedback

Closing the Loop: A Framework for Trustworthy Machine Learning in Power Systems

Stiasny, Jochen, Chevalier, Samuel, Nellikkath, Rahul, Sævarsson, Brynjar, Chatzivasileiadis, Spyros

arXiv.org Artificial IntelligenceJul-14-2022

Deep decarbonization of the energy sector will require massive penetration of stochastic renewable energy resources and an enormous amount of grid asset coordination; this represents a challenging paradigm for the power system operators who are tasked with maintaining grid stability and security in the face of such changes. With its ability to learn from complex datasets and provide predictive solutions on fast timescales, machine learning (ML) is well-posed to help overcome these challenges as power systems transform in the coming decades. In this work, we outline five key challenges (dataset generation, data pre-processing, model training, model assessment, and model embedding) associated with building trustworthy ML models which learn from physics-based simulation data. We then demonstrate how linking together individual modules, each of which overcomes a respective challenge, at sequential stages in the machine learning pipeline can help enhance the overall performance of the training process. In particular, we implement methods that connect different elements of the learning pipeline through feedback, thus "closing the loop" between model training, performance assessments, and re-training. We demonstrate the effectiveness of this framework, its constituent modules, and its feedback connections by learning the N-1 small-signal stability margin associated with a detailed model of a proposed North Sea Wind Power Hub system.

assessment, dataset, system dynamic and control symposium, (10 more...)

arXiv.org Artificial Intelligence

2203.07505

Country:

Europe > North Sea (0.25)
Atlantic Ocean > North Atlantic Ocean > North Sea (0.25)
North America > Canada > Alberta > Census Division No. 15 > Improvement District No. 9 > Banff (0.05)
(4 more...)

Genre:

Workflow (1.00)
Overview (0.93)
Research Report (0.64)

Industry:

Energy > Power Industry (1.00)
Energy > Renewable > Wind (0.66)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Add feedback

Neural Networks for Encoding Dynamic Security-Constrained Optimal Power Flow

Murzakhanov, Ilgiz, Venzke, Andreas, Misyris, George S., Chatzivasileiadis, Spyros

arXiv.org Artificial IntelligenceJul-14-2022

This paper introduces a framework to capture previously intractable optimization constraints and transform them to a mixed-integer linear program, through the use of neural networks. We encode the feasible space of optimization problems characterized by both tractable and intractable constraints, e.g. differential equations, to a neural network. Leveraging an exact mixed-integer reformulation of neural networks, we solve mixed-integer linear programs that accurately approximate solutions to the originally intractable non-linear optimization problem. We apply our methods to the AC optimal power flow problem (AC-OPF), where directly including dynamic security constraints renders the AC-OPF intractable. Our proposed approach has the potential to be significantly more scalable than traditional approaches. We demonstrate our approach for power system operation considering N-1 security and small-signal stability, showing how it can efficiently obtain cost-optimal solutions which at the same time satisfy both static and dynamic security constraints.

constraint, neural network, small-signal stability, (11 more...)

arXiv.org Artificial Intelligence

2003.07939

Country:

North America > Canada > Alberta > Census Division No. 15 > Improvement District No. 9 > Banff (0.05)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Europe > Denmark > Capital Region > Kongens Lyngby (0.04)

Genre: Research Report (0.40)

Industry: Energy > Power Industry (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Improved $\alpha$-GAN architecture for generating 3D connected volumes with an application to radiosurgery treatment planning

Mohammadjafari, Sanaz, Cevik, Mucahit, Basar, Ayse

arXiv.org Artificial IntelligenceJul-13-2022

Generative Adversarial Networks (GANs) have gained significant attention in several computer vision tasks for generating high-quality synthetic data. Various medical applications including diagnostic imaging and radiation therapy can benefit greatly from synthetic data generation due to data scarcity in the domain. However, medical image data is typically kept in 3D space, and generative models suffer from the curse of dimensionality issues in generating such synthetic data. In this paper, we investigate the potential of GANs for generating connected 3D volumes. We propose an improved version of 3D $\alpha$-GAN by incorporating various architectural enhancements. On a synthetic dataset of connected 3D spheres and ellipsoids, our model can generate fully connected 3D shapes with similar geometrical characteristics to that of training data. We also show that our 3D GAN model can successfully generate high-quality 3D tumor volumes and associated treatment specifications (e.g., isocenter locations). Similar moment invariants to the training data as well as fully connected 3D shapes confirm that improved 3D $\alpha$-GAN implicitly learns the training data distribution, and generates realistic-looking samples. The capability of improved 3D $\alpha$-GAN makes it a valuable source for generating synthetic medical image data that can help future research in this domain.

architecture, dataset, subsphere, (13 more...)

arXiv.org Artificial Intelligence

2207.11223

Country:

North America > Canada > Ontario > Toronto (0.05)
North America > Canada > Alberta > Census Division No. 15 > Improvement District No. 9 > Banff (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Nuclear Medicine (1.00)
Health & Medicine > Health Care Technology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Active Distribution System Coordinated Control Method via Artificial Intelligence

Lau, Matthew, Thames, Kayla, Meliopoulos, Sakis

arXiv.org Artificial IntelligenceJul-12-2022

The increasing deployment of end use power resources in distribution systems created active distribution systems. Uncontrolled active distribution systems exhibit wide variations of voltage and loading throughout the day as some of these resources operate under max power tracking control of highly variable wind and solar irradiation while others exhibit random variations and/or dependency on weather conditions. It is necessary to control the system to provide power reliably and securely under normal voltages and frequency. Classical optimization approaches to control the system towards this goal suffer from the dimensionality of the problem and the need for a global optimization approach to coordinate a huge number of small resources. Artificial Intelligence (AI) methods offer an alternative that can provide a practical approach to this problem. We suggest that neural networks with self-attention mechanisms have the potential to aid in the optimization of the system. In this paper, we present this approach and provide promising preliminary results.

architecture, distribution system, efficiency, (9 more...)

arXiv.org Artificial Intelligence

2207.14642

Country:

North America > Canada > Alberta > Census Division No. 15 > Improvement District No. 9 > Banff (0.05)
North America > United States > Maryland (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
(3 more...)

Genre: Research Report (0.82)

Industry:

Energy > Renewable > Solar (1.00)
Energy > Power Industry (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Word is Worth A Thousand Dollars: Adversarial Attack on Tweets Fools Stock Predictions

Xie, Yong, Wang, Dakuo, Chen, Pin-Yu, Xiong, Jinjun, Liu, Sijia, Koyejo, Sanmi

arXiv.org Artificial IntelligenceJul-12-2022

More and more investors and machine learning models rely on social media (e.g., Twitter and Reddit) to gather real-time information and sentiment to predict stock price movements. Although text-based models are known to be vulnerable to adversarial attacks, whether stock prediction models have similar vulnerability is underexplored. In this paper, we experiment with a variety of adversarial attack configurations to fool three stock prediction victim models. We address the task of adversarial generation by solving combinatorial optimization problems with semantics and budget constraints. Our results show that the proposed attack method can achieve consistent success rates and cause significant monetary loss in trading simulation by simply concatenating a Figure 1: An example of word-replacement adversarial perturbed but semantically similar tweet.

computational linguistic, perturbation, tweet, (15 more...)

arXiv.org Artificial Intelligence

2205.01094

Country:

North America > Dominican Republic (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
(13 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military (1.00)
Banking & Finance > Trading (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
(2 more...)

Add feedback