Goto

Collaborating Authors

 trillo


An Optimal Transport Approach for Computing Adversarial Training Lower Bounds in Multiclass Classification

Trillos, Nicolas Garcia, Jacobs, Matt, Kim, Jakwang, Werenski, Matthew

arXiv.org Artificial Intelligence

Despite the success of deep learning-based algorithms, it is widely known that neural networks may fail to be robust. A popular paradigm to enforce robustness is adversarial training (AT), however, this introduces many computational and theoretical difficulties. Recent works have developed a connection between AT in the multiclass classification setting and multimarginal optimal transport (MOT), unlocking a new set of tools to study this problem. In this paper, we leverage the MOT connection to propose computationally tractable numerical algorithms for computing universal lower bounds on the optimal adversarial risk and identifying optimal classifiers. We propose two main algorithms based on linear programming (LP) and entropic regularization (Sinkhorn). Our key insight is that one can harmlessly truncate the higher order interactions between classes, preventing the combinatorial run times typically encountered in MOT problems. We validate these results with experiments on MNIST and CIFAR-$10$, which demonstrate the tractability of our approach.


Spectral Neural Networks: Approximation Theory and Optimization Landscape

Li, Chenghui, Sonthalia, Rishi, Trillos, Nicolas Garcia

arXiv.org Machine Learning

There is a large variety of machine learning methodologies that are based on the extraction of spectral geometric information from data. However, the implementations of many of these methods often depend on traditional eigensolvers, which present limitations when applied in practical online big data scenarios. To address some of these challenges, researchers have proposed different strategies for training neural networks as alternatives to traditional eigensolvers, with one such approach known as Spectral Neural Network (SNN). In this paper, we investigate key theoretical aspects of SNN. First, we present quantitative insights into the tradeoff between the number of neurons and the amount of spectral geometric information a neural network learns. Second, we initiate a theoretical exploration of the optimization landscape of SNN's objective to shed light on the training dynamics of SNN. Unlike typical studies of convergence to global solutions of NN training dynamics, SNN presents an additional complexity due to its non-convex ambient loss function.


An A.I.-Generated Film Depicts Human Loneliness, in "Thank You for Not Answering"

The New Yorker

In the first thirty seconds of the director and artist Paul Trillo's short film "Thank You for Not Answering," a woman gazes out the window of a subway car that appears to have sunk underwater. A man appears in the window swimming toward the car, his body materializing from the darkness and swirling water. It's a frightening, claustrophobic, violent scene--one that could have taken hundreds of thousands of dollars of props and special effects to shoot, but Trillo generated it in a matter of minutes using an experimental tool kit made by an artificial-intelligence company called Runway. The figures in the film appear real, played by humans who may actually be underwater. But another glance reveals the uncanniness in their blank eyes, distended limbs, mushy features.


The Multimarginal Optimal Transport Formulation of Adversarial Multiclass Classification

Trillos, Nicolas Garcia, Jacobs, Matt, Kim, Jakwang

arXiv.org Artificial Intelligence

We study a family of adversarial multiclass classification problems and provide equivalent reformulations in terms of: 1) a family of generalized barycenter problems introduced in the paper and 2) a family of multimarginal optimal transport problems where the number of marginals is equal to the number of classes in the original classification problem. These new theoretical results reveal a rich geometric structure of adversarial learning problems in multiclass classification and extend recent results restricted to the binary classification setting. A direct computational implication of our results is that by solving either the barycenter problem and its dual, or the MOT problem and its dual, we can recover the optimal robust classification rule and the optimal adversarial strategy for the original adversarial problem. Examples with synthetic and real data illustrate our results.


Clustering dynamics on graphs: from spectral clustering to mean shift through Fokker-Planck interpolation

Craig, Katy, Trillos, Nicolás García, Slepčev, Dejan

arXiv.org Machine Learning

In this work we build a unifying framework to interpolate between density-driven and geometry-based algorithms for data clustering, and specifically, to connect the mean shift algorithm with spectral clustering at discrete and continuum levels. We seek this connection through the introduction of Fokker-Planck equations on data graphs. Besides introducing new forms of mean shift algorithms on graphs, we provide new theoretical insights on the behavior of the family of diffusion maps in the large sample limit as well as provide new connections between diffusion maps and mean shift dynamics on a fixed graph. Several numerical examples illustrate our theoretical findings and highlight the benefits of interpolating density-driven and geometry-based clustering algorithms.