AITopics

Industry: Banking & Finance (0.56)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (0.77)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.53)

arXiv.org Machine LearningNov-14-2016

Benchmarking Quantum Hardware for Training of Fully Visible Boltzmann Machines

Korenkevych, Dmytro, Xue, Yanbo, Bian, Zhengbing, Chudak, Fabian, Macready, William G., Rolfe, Jason, Andriyash, Evgeny

Quantum annealing (QA) is a hardware-based heuristic optimization and sampling method applicable to discrete undirected graphical models. While similar to simulated annealing, QA relies on quantum, rather than thermal, effects to explore complex search spaces. For many classes of problems, QA is known to offer computational advantages over simulated annealing. Here we report on the ability of recent QA hardware to accelerate training of fully visible Boltzmann machines. We characterize the sampling distribution of QA hardware, and show that in many cases, the quantum distributions differ significantly from classical Boltzmann distributions. In spite of this difference, training (which seeks to match data and model statistics) using standard classical gradient updates is still effective. We investigate the use of QA for seeding Markov chains as an alternative to contrastive divergence (CD) and persistent contrastive divergence (PCD). Using $k=50$ Gibbs steps, we show that for problems with high-energy barriers between modes, QA-based seeds can improve upon chains with CD and PCD initializations. For these hard problems, QA gradient estimates are more accurate, and allow for faster learning. Furthermore, and interestingly, even the case of raw QA samples (that is, $k=0$) achieved similar improvements. We argue that this relates to the fact that we are training a quantum rather than classical Boltzmann distribution in this case. The learned parameters give rise to hardware QA distributions closely approximating classical Boltzmann distributions that are hard to train with CD/PCD.

artificial intelligence, deep learning, machine learning, (15 more...)

1611.04528

Country:

North America > Canada (0.46)
North America > United States (0.28)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

arXiv.org Machine LearningNov-14-2016

Practical Secure Aggregation for Federated Learning on User-Held Data

Bonawitz, Keith, Ivanov, Vladimir, Kreuter, Ben, Marcedone, Antonio, McMahan, H. Brendan, Patel, Sarvar, Ramage, Daniel, Segal, Aaron, Seth, Karn

Secure Aggregation protocols allow a collection of mutually distrust parties, each holding a private value, to collaboratively compute the sum of those values without revealing the values themselves. We consider training a deep neural network in the Federated Learning model, using distributed stochastic gradient descent across user-held training data on mobile devices, wherein Secure Aggregation protects each user's model gradient. We design a novel, communication-efficient Secure Aggregation protocol for high-dimensional data that tolerates up to 1/3 users failing to complete the protocol. For 16-bit input values, our protocol offers 1.73x communication expansion for $2^{10}$ users and $2^{20}$-dimensional vectors, and 1.98x expansion for $2^{14}$ users and $2^{24}$ dimensional vectors.

artificial intelligence, deep learning, machine learning, (17 more...)

1611.04482

Country: North America > United States (0.68)

Genre: Research Report (0.40)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Xiang, Xiang, Tran, Trac D.

Pose-Selective Max Pooling for Measuring Similarity

arXiv.org Artificial IntelligenceNov-13-2016

In this paper, we deal with two challenges for measuring the similarity of the subject identities in practical video-based face recognition - the variation of the head pose in uncontrolled environments and the computational expense of processing videos. Since the frame-wise feature mean is unable to characterize the pose diversity among frames, we define and preserve the overall pose diversity and closeness in a video. Then, identity will be the only source of variation across videos since the pose varies even within a single video. Instead of simply using all the frames, we select those faces whose pose point is closest to the centroid of the K-means cluster containing that pose point. Then, we represent a video as a bag of frame-wise deep face features while the number of features has been reduced from hundreds to K. Since the video representation can well represent the identity, now we measure the subject similarity between two videos as the max correlation among all possible pairs in the two bags of features. On the official 5,000 video-pairs of the YouTube Face dataset for face verification, our algorithm achieves a comparable performance with VGG-face that averages over deep features of all frames. Other vision tasks can also benefit from the generic idea of employing geometric cues to improve the descriptiveness of deep features.

artificial intelligence, machine learning, video, (16 more...)

arXiv.org Artificial Intelligence

1609.07042

Country: North America > United States (0.28)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Vision > Face Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.94)
(2 more...)

Fraccaro, Marco, Sønderby, Søren Kaae, Paquet, Ulrich, Winther, Ole

Sequential Neural Models with Stochastic Layers

arXiv.org Machine LearningNov-13-2016

How can we efficiently propagate uncertainty in a latent state representation with recurrent neural networks? This paper introduces stochastic recurrent neural networks which glue a deterministic recurrent neural network and a state space model together to form a stochastic and sequential neural generative model. The clear separation of deterministic and stochastic layers allows a structured variational inference network to track the factorization of the model's posterior distribution. By retaining both the nonlinear recursive structure of a recurrent neural network and averaging over the uncertainty in a latent path, like a state space model, we improve the state of the art results on the Blizzard and TIMIT speech modeling data sets by a large margin, while achieving comparable performances to competing methods on polyphonic music modeling.

artificial intelligence, machine learning, neural network, (17 more...)

1605.07571

Genre: Research Report (0.64)

Industry:

Media > Music (0.88)
Leisure & Entertainment (0.88)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Kusner, Matt J., Hernández-Lobato, José Miguel

GANS for Sequences of Discrete Elements with the Gumbel-softmax Distribution

arXiv.org Machine LearningNov-12-2016

Generative Adversarial Networks (GAN) have limitations when the goal is to generate sequences of discrete elements. The reason for this is that samples from a distribution on discrete objects such as the multinomial are not differentiable with respect to the distribution parameters. This problem can be avoided by using the Gumbel-softmax distribution, which is a continuous approximation to a multinomial distribution parameterized in terms of the softmax function. In this work, we evaluate the performance of GANs based on recurrent neural networks with Gumbel-softmax output distributions in the task of generating sequences of discrete elements.

artificial intelligence, machine learning, sequence, (16 more...)

1611.04051

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.59)

#artificialintelligenceNov-11-2016, 02:55:14 GMT

Oxford Scientists Have an AI That Can Read Your Lips

Lip reading is a way of understanding speech by interpreting a person's lip movement. However, human speech is highly complex and nuanced, where one lip movement could correspond to different phonemes, or basic units of sound. Therefore, the practice is prone to errors, which can sometimes lead to humorous results. Scientists from Oxford University have described an artificial intelligence system, called LipNet, which can accurately read lips. The system employs deep learning to train itself using 29,000 three-second-long videos labeled with captions.

artificial intelligence, deep learning, machine learning, (8 more...)

Country: Europe > United Kingdom > England > Oxfordshire > Oxford (0.27)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

#artificialintelligenceNov-11-2016, 02:55:10 GMT

AI for UAVs - Association for Unmanned Vehicle Systems International

Artificial Intelligence is affecting almost every industry and is transforming the way businesses operate. The combination of new algorithms, big data, and GPUs has made it possible to address problems that were not practically solvable until now. During this webinar we'll provide an overview of the different AI and deep learning applications for UAVs, including warehouse management, aerial inspection, search and rescue, and agriculture, and explain how these applications can be easily deployed via Jetson.

artificial intelligence, deep learning, machine learning, (3 more...)

Industry: Food & Agriculture > Agriculture (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.75)

#artificialintelligenceNov-11-2016, 01:31:41 GMT

7 Key Factors Driving the Artificial Intelligence Revolution

Under, behind and inside many of the apps we use every day, a revolution is underway. It's a revolution that started decades ago but today is empowering companies to deliver better, smarter services with greater ease and on broader scales than ever before. At Singularity University's inaugural Global Summit, Neil Jacobstein, chair of Artificial Intelligence and Robotics, provided a primer showing how artificial intelligence literally transforms everything it touches. First of all, it's critical to define the scope of artificial intelligence (AI), which can be categorized into four areas: techniques in pattern recognition, software agency (that is, software that acts like real users), an exponential technology that is accelerating other exponential technologies, and a vision of a future superhuman intelligence (that fortunately hasn't happened yet). Anyone who has seen a science fiction film is likely familiar with this last area, but it's the other three areas where AI is making huge strides at a revolutionary pace.

machine learning, pattern recognition, revolution, (16 more...)

Country:

North America > United States > Virginia (0.05)
Asia > China (0.05)

Industry: Information Technology (0.30)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.49)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.36)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)

Li, Chun-Liang, Ravanbakhsh, Siamak, Poczos, Barnabas

Annealing Gaussian into ReLU: a New Sampling Strategy for Leaky-ReLU RBM

arXiv.org Machine LearningNov-11-2016

A BSTRACT Restricted Boltzmann Machine (RBM) is a bipartite graphical model that is used as the building block in energy-based deep generative models. Due to numerical stability and quantifiability of the likelihood, RBM is commonly used with Bernoulli units. Here, we consider an alternative member of exponential family RBM with leaky rectified linear units - called leaky RBM. We first study the joint and marginal distributions of leaky RBM under different leakiness, which provides us important insights by connecting the leaky RBM model and truncated Gaussian distributions. The connection leads us to a simple yet efficient method for sampling from this model, where the basic idea is to anneal the leakiness rather than the energy; - i.e., start from a fully Gaussian/Linear unit and gradually decrease the leakiness over iterations. This serves as an alternative to the annealing of the temperature parameter and enables numerical estimation of the likelihood that are more efficient and more accurate than the commonly used annealed importance sampling (AIS). We further demonstrate that the proposed sampling algorithm enjoys faster mixing property than contrastive divergence algorithm, which benefits the training without any additional computational cost. 1 I NTRODUCTION In this paper, we are interested in deep generative models. There is a family of directed deep generative models which can be trained by back-propagation (e.g., Kingma & Welling, 2013; Goodfellow et al., 2014). The other family is the deep energy-based models, including deep belief network (Hinton et al., 2006) and deep Boltzmann machine (Salakhutdinov & Hinton, 2009).

algorithm, artificial intelligence, machine learning, (17 more...)

1611.03879

Country: North America (0.28)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.65)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.50)