Goto

Collaborating Authors

 Pattern Recognition


On deep speaker embeddings for text-independent speaker recognition

arXiv.org Machine Learning

We investigate deep neural network performance in the textindependent speaker recognition task. We demonstrate that using angular softmax activation at the last classification layer of a classification neural network instead of a simple softmax activation allows to train a more generalized discriminative speaker embedding extractor. Cosine similarity is an effective metric for speaker verification in this embedding space. We also address the problem of choosing an architecture for the extractor. We found that deep networks with residual frame level connections outperform wide but relatively shallow architectures. This paper also proposes several improvements for previous DNN-based extractor systems to increase the speaker recognition accuracy. We show that the discriminatively trained similarity metric learning approach outperforms the standard LDA-PLDA method as an embedding backend. The results obtained on Speakers in the Wild and NIST SRE 2016 evaluation sets demonstrate robustness of the proposed systems when dealing with close to real-life conditions.


How much do AI gurus really get paid? And is NIPS such a great name for a conference?

#artificialintelligence

A public tax form from OpenAI reveals the crazy salaries of top AI researchers. There are more competitions pushing for improved image recognition models on mobiles, as well as training systems as fast and cheap as possible. Image recognition on mobiles Google has launched another computer vision challenge to push image recognition in real time for mobile phones. The On-Device Visual Intelligence Challenge (ODVI) is part of a workshop track at the Computer Vision and Pattern Recognition conference (CVPR) happening in June later this year in Salt Lake City. It's challenging to build fast, accurate models on small mobile chips given the latency limit.


9 key mistakes organizations make when analyzing data

#artificialintelligence

Data analysis is becoming less of a niche skill and more of a common requirement for jobs and roles of all shapes and sizes. Over the past 20 years, data has gone from relatively scarce to so abundant we aren't sure what to do with it. Gathering and analyzing data is a now part of most jobs within most organizations, either to better understand your role, to measure your results or to guide you in what to do next. Unfortunately, the accessibility and ubiquity of data has led to an increased number of amateur mistakes made in analyzing it--so if you want to improve your own analytic abilities and guard against these mistakes, you need to understand them. If your data is bad, even the best data analyst in the world can't save it from leading to bad conclusions.


The Tsetlin Machine - A Game Theoretic Bandit Driven Approach to Optimal Pattern Recognition with Propositional Logic

arXiv.org Artificial Intelligence

Although simple individually, artificial neurons provide state-of-the-art performance when interconnected in deep networks. Unknown to many, there exists an arguably even simpler and more versatile learning mechanism, namely, the Tsetlin Automaton. Merely by means of a single integer as memory, it learns the optimal action in stochastic environments. In this paper, we introduce the Tsetlin Machine, which solves complex pattern recognition problems with easy-to-interpret propositional formulas, composed by a collective of Tsetlin Automata. To eliminate the longstanding problem of vanishing signal-to-noise ratio, the Tsetlin Machine orchestrates the automata using a novel game. Our theoretical analysis establishes that the Nash equilibria of the game are aligned with the propositional formulas that provide optimal pattern recognition accuracy. This translates to learning without local optima, only global ones. We argue that the Tsetlin Machine finds the propositional formula that provides optimal accuracy, with probability arbitrarily close to unity. In four distinct benchmarks, the Tsetlin Machine outperforms both Neural Networks, SVMs, Random Forests, the Naive Bayes Classifier and Logistic Regression. It further turns out that the accuracy advantage of the Tsetlin Machine increases with lack of data. The Tsetlin Machine has a significant computational performance advantage since both inputs, patterns, and outputs are expressed as bits, while recognition of patterns relies on bit manipulation. The combination of accuracy, interpretability, and computational simplicity makes the Tsetlin Machine a promising tool for a wide range of domains, including safety-critical medicine. Being the first of its kind, we believe the Tsetlin Machine will kick-start completely new paths of research, with a potentially significant impact on the AI field and the applications of AI.


Pattern Discovery in Data Mining Coursera

@machinelearnbot

Module 3 consists of two lessons: Lessons 5 and 6. In Lesson 5, we discuss mining sequential patterns. We will learn several popular and efficient sequential pattern mining methods, including an Apriori-based sequential pattern mining method, GSP; a vertical data format-based sequential pattern method, SPADE; and a pattern-growth-based sequential pattern mining method, PrefixSpan. We will also learn how to directly mine closed sequential patterns. In Lesson 6, we will study concepts and methods for mining spatiotemporal and trajectory patterns as one kind of pattern mining applications.


This software startup just made artificial intelligence breakthrough using General-AI for 3D Object Recognition from any direction Technology Startups News Tech News

#artificialintelligence

The recognition of objects is one of the main goals for computer vision research. Some of the applications include: the automation on the assembly line, inspection of integrated circuit chips to detect defects in them, security in face and fingerprint recognition, medical diagnosis and detection of abnormal cells that may indicate cancer, remote sensing for automated recognition of possible hostile terrain to generate maps and aids for the visually impaired of mechanical guide dogs. However, 3D object recognition has been one of the challenging processes facing computer vision systems. One Maryland-based startup may finally have the answer to the problem. The startup, Z Advanced Computing, announced today that it has made technical and scientific breakthrough towards Machine Learning and Artificial Intelligence (AI), where the various attributes and details of 3D (three dimensional) objects can be recognized from any view or angle, using its novel General-AI techniques.


Facebook users could get up to $5,000 compensation for EVERY picture used without their consent

Daily Mail - Science & tech

Facebook will face a class action law suit in the wake of its privacy scandal, a US federal judge has ruled. Allegations of privacy violations emerged when it was revealed the app used a photo-scanning tool on users' images without their explicit consent. The facial recognition tool, launched in 2010, suggests names for people it identifies in photos uploaded by users. Under Illinois state law, the company could be fined $1,000 to $5,000 (£700 - £3,500) each time a person's image was used without consent. The technology was suspended for users in Europe in 2012 over privacy fears but is still live in the US and other regions worldwide.


Reverse image search engines using out of the box machine learning libraries

@machinelearnbot

We propose a simple, robust, and scalable reverse image search engine that leverages convolutional features from Keras' pre-trained neural networks and the distance metric from Scikit-Learn's K-Nearest Neighbors. We show example queries using data scraped from Google images, and dive deeper in how we use the search engine to track the proliferation of memes from the dark web.


Five Most Popular Open Source Frameworks Used in Machine Learning Analytics Insight

#artificialintelligence

Machine language a branch of artificial intelligence which enables system the ability to learn from data without being programmed. Machine learning got evolved from pattern recognition and computational learning theory in artificial intelligence. It has revolutionized the conventional way through developing algorithms that can learn and make predictions on data. There are innumerable factors that have improved the contribution of machine learning. Open source frameworks are one of the major reasons for the boost in machine learning. A framework is a collection of programs, libraries and languages evolved to use in application development.


Solving Bongard Problems with a Visual Language and Pragmatic Reasoning

arXiv.org Artificial Intelligence

More than 50 years ago Bongard introduced 100 visual concept learning problems as a testbed for intelligent vision systems. These problems are now known as Bongard problems. Although they are well known in the cognitive science and AI communities only moderate progress has been made towards building systems that can solve a substantial subset of them. In the system presented here, visual features are extracted through image processing and then translated into a symbolic visual vocabulary. We introduce a formal language that allows representing complex visual concepts based on this vocabulary. Using this language and Bayesian inference, complex visual concepts can be induced from the examples that are provided in each Bongard problem. Contrary to other concept learning problems the examples from which concepts are induced are not random in Bongard problems, instead they are carefully chosen to communicate the concept, hence requiring pragmatic reasoning. Taking pragmatic reasoning into account we find good agreement between the concepts with high posterior probability and the solutions formulated by Bongard himself. While this approach is far from solving all Bongard problems, it solves the biggest fraction yet.