Goto

Collaborating Authors

 Country


2019 in Review: 10 Essential AI YouTube Channels

#artificialintelligence

We're experiencing a profound shift in how educational content is created and delivered. Online education platforms are flourishing, and a recent Pew Research poll shows the world's leading video platform YouTube has finally become more than a repository of cat videos -- about half of YouTube users say it's very important for helping them figure out how to do things they've never done before. The year 2019 saw unprecedented growth in YouTube educational content on artificial intelligence. Synced has selected 10 AI-oriented YouTube channels we hope might provide our readers a cozy little holiday binge-watching session. Preserve Knowledge focuses on advances in mathematics, computer science, and artificial intelligence.


What Does Blockchain & The 4th Industrial Revolution Mean for Us?

#artificialintelligence

The magic wand of blockchain technology has touched our lives in multiple ways over the last decade. It has made cryptocurrency traders out of ordinary investors who would even shy away from the traditional stock markets. It has provided us with an easy way of transferring money across borders, without the interference of banks. On a larger scale, blockchain has provided enterprises with a trustless and tamper-proof apparatus for tracking assets, verifying identity, settling contracts, and so much more. New industries based on asset tokenization and decentralized finance (DeFi) have been created on the strength of blockchain technology.


What Does Blockchain & The 4th Industrial Revolution Mean for Us?

#artificialintelligence

The magic wand of blockchain technology has touched our lives in multiple ways over the last decade. It has made cryptocurrency traders out of ordinary investors who would even shy away from the traditional stock markets. It has provided us with an easy way of transferring money across borders, without the interference of banks. On a larger scale, blockchain has provided enterprises with a trustless and tamper-proof apparatus for tracking assets, verifying identity, settling contracts, and so much more. New industries based on asset tokenization and decentralized finance (DeFi) have been created on the strength of blockchain technology.


Learning to Prove Theorems by Learning to Generate Theorems

arXiv.org Artificial Intelligence

We consider the task of automated theorem proving, a key AI task. Deep learning has shown promise for training theorem provers, but there are limited human-written theorems and proofs available for supervised learning. To address this limitation, we propose to learn a neural generator that automatically synthesizes theorems and proofs for the purpose of training a theorem prover. Experiments on real-world tasks demonstrate that synthetic data from our approach improves the theorem prover and advances the state of the art of automated theorem proving in Metamath.


Kalman meets Bellman: Improving Policy Evaluation through Value Tracking

arXiv.org Machine Learning

Policy evaluation is a key process in Reinforcement Learning (RL). It assesses a given policy by estimating the corresponding value function. When using parameterized value functions, common approaches minimize the sum of squared Bellman temporal-difference errors and receive a point-estimate for the parameters. Kalman-based and Gaussian-processes based frameworks were suggested to evaluate the policy by treating the value as a random variable. These frameworks can learn uncertainties over the value parameters and exploit them for policy exploration. When adopting these frameworks to solve deep RL tasks, several limitations are revealed: excessive computations in each optimization step, difficulty with handling batches of samples which slows training and the effect of memory in stochastic environments which prevents off-policy learning. In this work, we discuss these limitations and propose to overcome them by an alternative general framework, based on the extended Kalman filter. We devise an optimization method, called Kalman Optimization for Value Approximation (KOVA) that can be incorporated as a policy evaluation component in policy optimization algorithms. KOVA minimizes a regularized objective function that concerns both parameter and noisy return uncertainties. We analyze the properties of KOVA and present its performance on deep RL control tasks.


4D Semantic Cardiac Magnetic Resonance Image Synthesis on XCAT Anatomical Model

arXiv.org Machine Learning

We propose a hybrid controllable image generation method to synthesize anatomically meaningful 3D t labeled Cardiac Magnetic Resonance (CMR) images. Our hybrid method takes the mechanistic 4D eXtended CArdiac Torso (XCAT) heart model as the anatomical ground truth and synthesizes CMR images via a data-driven Generative Adversarial Network (GAN). We employ the state-of-the-art SPatially Adaptive De-normalization (SPADE) technique for conditional image synthesis to preserve the semantic spatial information of ground truth anatomy. Using the parameterized motion model of the XCAT heart, we generate labels for 25 time frames of the heart for one cardiac cycle at 18 locations for the short axis view. Subsequently, realistic images are generated from these labels, with modality-specific features that are learned from real CMR image data. We demonstrate that style transfer from another cardiac image can be accomplished by using a style encoder network. Due to the flexibility of XCAT in creating new heart models, this approach can result in a realistic virtual population to address different challenges the medical image analysis research community is facing such as expensive data collection. Our proposed method has a great potential to synthesize 4D controllable CMR images with annotations and adaptable styles to be used in various supervised multi-site, multi-vendor applications in medical image analysis.


Regularized Training and Tight Certification for Randomized Smoothed Classifier with Provable Robustness

arXiv.org Machine Learning

Recently smoothing deep neural network based classifiers via isotropic Gaussian perturbation is shown to be an effective and scalable way to provide state-of-the-art probabilistic robustness guarantee against $\ell_2$ norm bounded adversarial perturbations. However, how to train a good base classifier that is accurate and robust when smoothed has not been fully investigated. In this work, we derive a new regularized risk, in which the regularizer can adaptively encourage the accuracy and robustness of the smoothed counterpart when training the base classifier. It is computationally efficient and can be implemented in parallel with other empirical defense methods. We discuss how to implement it under both standard (non-adversarial) and adversarial training scheme. At the same time, we also design a new certification algorithm, which can leverage the regularization effect to provide tighter robustness lower bound that holds with high probability. Our extensive experimentation demonstrates the effectiveness of the proposed training and certification approaches on CIFAR-10 and ImageNet datasets.


Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium

arXiv.org Machine Learning

We develop provably efficient reinforcement learning algorithms for two-player zero-sum Markov games in which the two players simultaneously take actions. To incorporate function approximation, we consider a family of Markov games where the reward function and transition kernel possess a linear structure. Both the offline and online settings of the problems are considered. In the offline setting, we control both players and the goal is to find the Nash Equilibrium efficiently by minimizing the worst-case duality gap. In the online setting, we control a single player to play against an arbitrary opponent and the goal is to minimize the regret. For both settings, we propose an optimistic variant of the least-squares minimax value iteration algorithm. We show that our algorithm is computationally efficient and provably achieves an $\tilde O(\sqrt{d^3 H^3 T})$ upper bound on the duality gap and regret, without requiring additional assumptions on the sampling model. We highlight that our setting requires overcoming several new challenges that are absent in Markov decision processes or turn-based Markov games. In particular, to achieve optimism in simultaneous-move Marko games, we construct both upper and lower confidence bounds of the value function, and then compute the optimistic policy by solving a general-sum matrix game with these bounds as the payoff matrices. As finding the Nash Equilibrium of such a general-sum game is computationally hard, our algorithm instead solves for a Coarse Correlated Equilibrium (CCE), which can be obtained efficiently via linear programming. To our best knowledge, such a CCE-based scheme for implementing optimism has not appeared in the literature and might be of interest in its own right.


Augmented Normalizing Flows: Bridging the Gap Between Generative Flows and Latent Variable Models

arXiv.org Machine Learning

In this work, we propose a new family of generative flows on an augmented data space, with an aim to improve expressivity without drastically increasing the computational cost of sampling and evaluation of a lower bound on the likelihood. Theoretically, we prove the proposed flow can approximate a Hamiltonian ODE as a universal transport map. Empirically, we demonstrate state-of-the-art performance on standard benchmarks of flow-based generative modeling.


Investigating the Compositional Structure Of Deep Neural Networks

arXiv.org Machine Learning

The current understanding of deep neural networks can only partially explain how input structure, network parameters and optimization algorithms jointly contribute to achieve the strong generalization power that is typically observed in many real-world applications. In order to improve the comprehension and interpretability of deep neural networks, we here introduce a novel theoretical framework based on the compositional structure of piecewise linear activation functions. By defining a direct acyclic graph representing the composition of activation patterns through the network layers, it is possible to characterize the instances of the input data with respect to both the predicted label and the specific (linear) transformation used to perform predictions. Preliminary tests on the MNIST dataset show that our method can group input instances with regard to their similarity in the internal representation of the neural network, providing an intuitive measure of input complexity.