Goto

Collaborating Authors

 cf-v ae


Reviewer # 1 1 [ Issue ] Limited novelty - the proposed method is very similar to CF-V AE

Neural Information Processing Systems

We provide various samples of SoftPointFlow in the appendix to support the performance improvement. This allows image data to have continuous values and volume components in data space. We mentioned that point in the conclusion section, 'Our


Generating Semantic Adversarial Examples via Feature Manipulation

Wang, Shuo, Chen, Shangyu, Chen, Tianle, Nepal, Surya, Rudolph, Carsten, Grobler, Marthie

arXiv.org Machine Learning

The vulnerability of deep neural networks to adversarial attacks has been widely demonstrated (e.g., adversarial example attacks). Traditional attacks perform unstructured pixel-wise perturbation to fool the classifier. An alternative approach is to have perturbations in the latent space. However, such perturbations are hard to control due to the lack of interpretability and disentanglement. In this paper, we propose a more practical adversarial attack by designing structured perturbation with semantic meanings. Our proposed technique manipulates the semantic attributes of images via the disentangled latent codes. The intuition behind our technique is that images in similar domains have some commonly shared but theme-independent semantic attributes, e.g. thickness of lines in handwritten digits, that can be bidirectionally mapped to disentangled latent codes. We generate adversarial perturbation by manipulating a single or a combination of these latent codes and propose two unsupervised semantic manipulation approaches: vector-based disentangled representation and feature map-based disentangled representation, in terms of the complexity of the latent codes and smoothness of the reconstructed images. We conduct extensive experimental evaluations on real-world image data to demonstrate the power of our attacks for black-box classifiers. We further demonstrate the existence of a universal, image-agnostic semantic adversarial example.


Conditional Flow Variational Autoencoders for Structured Sequence Prediction

Bhattacharyya, Apratim, Hanselmann, Michael, Fritz, Mario, Schiele, Bernt, Straehle, Christoph-Nikolas

arXiv.org Machine Learning

Prediction of future states of the environment and interacting agents is a key competence required for autonomous agents to operate successfully in the real world. Prior work for structured sequence prediction based on latent variable models imposes a uni-modal standard Gaussian prior on the latent variables. This induces a strong model bias which makes it challenging to fully capture the multi-modality of the distribution of the future states. In this work, we introduce Conditional Flow Variational Autoencoders which uses our novel conditional normalizing flow based prior. We show that using our novel complex multi-modal conditional prior we can capture complex multi-modal conditional distributions. Furthermore, we study for the first time latent variable collapse with normalizing flows and propose solutions to prevent such failure cases. Our experiments on three multi-modal structured sequence prediction datasets -- MNIST Sequences, Stanford Drone and HighD -- show that the proposed method obtains state of art results across different evaluation metrics.