Goto

Collaborating Authors

 Pattern Recognition


Oakland-based art and tech studio takes critical look at A.I.

#artificialintelligence

Artificial intelligence is not as advanced as you may think. In the last couple of months, Microsoft has had a couple of failed attempts with artificial intelligence. The first involved an image recognition app called Fetch!, which looks at photos of dogs to identify its breed. People, of course, started to use the app to determine what breed of dog people resemble. In doing so, people began to notice that the app identified Asian people either as Pekingese or Chinese Crested dogs.


Google Cloud Machine Learning is sailing into mainstream

#artificialintelligence

Google had an announcement that means strictly business for its push to be known as a leader in cloud services. "Today [Wednesday], "we've taken a major stride forward with the announcement of a new product family: Cloud Machine Learning." The move is all about taking Cloud Machine Learning mainstream, "giving data scientists and developers a way to build a new class of intelligent applications," according to a post from Fausto Ibarra, director, product management, Google Cloud Platform. Blair Hanley Frank of IDG News Service said in so doing, that "Google is making it easier for businesses to take advantage of the machine learning revolution with a new product for building models that predict the future." Cade Metz in Wired said "the company unveiled a new family of cloud computing services that allow any developer or business to use the machine learning technologies that power some of Google's most powerful services." Basically, as Robert Hof in SiliconANGLE put it, "Now that Google has infused the brand of artificial intelligence known as machine learning into everything from search to speech recognition, it's tossing the technology it views as tech's next big wave into the public domain." The announcement was part of events at a two day conference in San Francisco, namely the Google Cloud Platform (GCP) Next 2016. The company said the Cloud Machine Learning provides machine learning services, with pre-trained models and platform so that a person can generate his or her own tailored models. Major Google applications use Cloud Machine Learning, including Photos (image search), the Google app (voice search), Translate, and Inbox (Smart Reply)--but now their platform is available as a cloud service for business applications. Google is not shy about blowing its own horn when it comes to machine learning technology: Compared to other large scale deep learning systems, said Google, "Our neural net-based ML platform has better training performance and increased accuracy." This announcement may be filed under strategy. Robert Hof made the observation: "Google made it clear that it intends to pitch the Cloud Machine Learning services announcement as a key differentiator.


Google Working On Gesture-Based Keyboard For iPhone With GIF, Image And Search Functions: Report

International Business Times

Google may be looking to ensure its control of the world's search market by building a gesture-based smartphone and tablet keyboard designed to work with Apple's iPhone and iPad, which has integrated search functionality, according to sources speaking to the Verge. According to the report, Google employees have been testing the new keyboard for several months already though the search giant has yet to decide when or if it will release the keyboard to Apple's App Store. With the introduction of iOS 8 in 2014, Apple finally opened up its software to allow users install third-party keyboards to replace the stock iOS version. Google's keyboard, like the stock Android keyboard, allows for gesture typing, a feature that allows users to swipe their finger from one letter to the next allowing Google to then guess what word you want based on the shape of the gesture you have made. As well as this gesture feature, Google is said to be integrating GIFs and images directly into the keyboard which would be powered by Google's own image search.


Mapping Temporal Variables into the NeuCube for Improved Pattern Recognition, Predictive Modelling and Understanding of Stream Data

arXiv.org Machine Learning

This paper proposes a new method for an optimized mapping of temporal variables, describing a temporal stream data, into the recently proposed NeuCube spiking neural network architecture. This optimized mapping extends the use of the NeuCube, which was initially designed for spatiotemporal brain data, to work on arbitrary stream data and to achieve a better accuracy of temporal pattern recognition, a better and earlier event prediction and a better understanding of complex temporal stream data through visualization of the NeuCube connectivity. The effect of the new mapping is demonstrated on three bench mark problems. The first one is early prediction of patient sleep stage event from temporal physiological data. The second one is pattern recognition of dynamic temporal patterns of traffic in the Bay Area of California and the last one is the Challenge 2012 contest data set. In all cases the use of the proposed mapping leads to an improved accuracy of pattern recognition and event prediction and a better understanding of the data when compared to traditional machine learning techniques or spiking neural network reservoirs with arbitrary mapping of the variables.


Selective Inference Approach for Statistically Sound Predictive Pattern Mining

arXiv.org Machine Learning

Discovering statistically significant patterns from databases is an important challenging problem. The main obstacle of this problem is in the difficulty of taking into account the selection bias, i.e., the bias arising from the fact that patterns are selected from extremely large number of candidates in databases. In this paper, we introduce a new approach for predictive pattern mining problems that can address the selection bias issue. Our approach is built on a recently popularized statistical inference framework called selective inference. In selective inference, statistical inferences (such as statistical hypothesis testing) are conducted based on sampling distributions conditional on a selection event. If the selection event is characterized in a tractable way, statistical inferences can be made without minding selection bias issue. However, in pattern mining problems, it is difficult to characterize the entire selection process of mining algorithms. Our main contribution in this paper is to solve this challenging problem for a class of predictive pattern mining problems by introducing a novel algorithmic framework. We demonstrate that our approach is useful for finding statistically significant patterns from databases.


Better Computer Go Player with Neural Network and Long-term Prediction

arXiv.org Artificial Intelligence

A BSTRACT Competing with top human players in the ancient game of Go has been a long-term goal of artificial intelligence. Go's high branching factor makes traditional search techniques ineffective, even on leading-edge hardware, and Go's evaluation function could change drastically with one stone change. Recent works [Maddi-son et al. (2015); Clark & Storkey (2015)] show that search is not strictly necessary for machine Go players. A pure pattern-matching approach, based on a Deep Convolutional Neural Network (DCNN) that predicts the next move, can perform as well as Monte Carlo Tree Search (MCTS)-based open source Go engines such as Pachi [Baudis & Gailly (2012)] if its search budget is limited. We extend this idea in our bot nameddarkforest, which relies on a DCNN designed for long-term predictions. Darkforest substantially improves the win rate for pattern-matching approaches against MCTS-based approaches, even with looser search budgets. Against human players, the newest versions, darkfores2, achieve a stable 3d level on KGS Go Server as a ranked bot, a substantial improvement upon the estimated 4k-5k ranks for DCNN reported in Clark & Storkey (2015) based on games against other machine players. Adding MCTS to darkfores2 creates a much stronger player named darkfmcts3: with 5000 rollouts, it beats Pachi with 10k rollouts in all 250 games; with 75k rollouts it achieves a stable 5d level in KGS server, on par with state-of-the-art Go AIs (e.g., Zen, DolBaram, CrazyStone) except for AlphaGo [Silver et al. (2016)]; with 110k rollouts, it won the 3rd place in January KGS Go Tournament. 1 I NTRODUCTION For a long time, computer Go is considered to be a grand challenge in artificial intelligence. Figure 1 shows a simple illustration of the game of Go. Black plays first on an empty board.


The IBM 2016 Speaker Recognition System

arXiv.org Machine Learning

In this paper we describe the recent advancements made in the IBM i-vector speaker recognition system for conversational speech. In particular, we identify key techniques that contribute to significant improvements in performance of our system, and quantify their contributions. The techniques include: 1) a nearest-neighbor discriminant analysis (NDA) approach that is formulated to alleviate some of the limitations associated with the conventional linear discriminant analysis (LDA) that assumes Gaussian class-conditional distributions, 2) the application of speaker- and channel-adapted features, which are derived from an automatic speech recognition (ASR) system, for speaker recognition, and 3) the use of a deep neural network (DNN) acoustic model with a large number of output units (~10k senones) to compute the frame-level soft alignments required in the i-vector estimation process. We evaluate these techniques on the NIST 2010 speaker recognition evaluation (SRE) extended core conditions involving telephone and microphone trials. Experimental results indicate that: 1) the NDA is more effective (up to 35% relative improvement in terms of EER) than the traditional parametric LDA for speaker recognition, 2) when compared to raw acoustic features (e.g., MFCCs), the ASR speaker-adapted features provide gains in speaker recognition performance, and 3) increasing the number of output units in the DNN acoustic model (i.e., increasing the senone set size from 2k to 10k) provides consistent improvements in performance (for example from 37% to 57% relative EER gains over our baseline GMM i-vector system). To our knowledge, results reported in this paper represent the best performances published to date on the NIST SRE 2010 extended core tasks.


Safe Pattern Pruning: An Efficient Approach for Predictive Pattern Mining

arXiv.org Machine Learning

In this paper we study predictive pattern mining problems where the goal is to construct a predictive model based on a subset of predictive patterns in the database. Our main contribution is to introduce a novel method called safe pattern pruning (SPP) for a class of predictive pattern mining problems. The SPP method allows us to efficiently find a superset of all the predictive patterns in the database that are needed for the optimal predictive model. The advantage of the SPP method over existing boosting-type method is that the former can find the superset by a single search over the database, while the latter requires multiple searches. The SPP method is inspired by recent development of safe feature screening. In order to extend the idea of safe feature screening into predictive pattern mining, we derive a novel pruning rule called safe pattern pruning (SPP) rule that can be used for searching over the tree defined among patterns in the database. The SPP rule has a property that, if a node corresponding to a pattern in the database is pruned out by the SPP rule, then it is guaranteed that all the patterns corresponding to its descendant nodes are never needed for the optimal predictive model. We apply the SPP method to graph mining and item-set mining problems, and demonstrate its computational advantage.


Keeping it Short and Simple: Summarising Complex Event Sequences with Multivariate Patterns

arXiv.org Artificial Intelligence

We study how to obtain concise descriptions of discrete multivariate sequential data. In particular, how to do so in terms of rich multivariate sequential patterns that can capture potentially highly interesting (cor)relations between sequences. To this end we allow our pattern language to span over the domains (alphabets) of all sequences, allow patterns to overlap temporally, as well as allow for gaps in their occurrences. We formalise our goal by the Minimum Description Length principle, by which our objective is to discover the set of patterns that provides the most succinct description of the data. To discover high-quality pattern sets directly from data, we introduce DITTO, a highly efficient algorithm that approximates the ideal result very well. Experiments show that DITTO correctly discovers the patterns planted in synthetic data. Moreover, it scales favourably with the length of the data, the number of attributes, the alphabet sizes. On real data, ranging from sensor networks to annotated text, DITTO discovers easily interpretable summaries that provide clear insight in both the univariate and multivariate structure.


Font Identification in Historical Documents Using Active Learning

arXiv.org Machine Learning

Identifying the type of font (e.g., Roman, Blackletter) used in historical documents can help optical character recognition (OCR) systems produce more accurate text transcriptions. Towards this end, we present an active-learning strategy that can significantly reduce the number of labeled samples needed to train a font classifier. Our approach extracts image-based features that exploit geometric differences between fonts at the word level, and combines them into a bag-of-word representation for each page in a document. We evaluate six sampling strategies based on uncertainty, dissimilarity and diversity criteria, and test them on a database containing over 3,000 historical documents with Blackletter, Roman and Mixed fonts. Our results show that a combination of uncertainty and diversity achieves the highest predictive accuracy (89% of test cases correctly classified) while requiring only a small fraction of the data (17%) to be labeled. We discuss the implications of this result for mass digitization projects of historical documents.