AITopics | Pattern Recognition

Collaborating Authors

Pattern Recognition

"... the research area that studies the operation and design of systems that recognize patterns in data." It includes statistical methods like discriminant analysis, feature extraction, error estimation, cluster analysis.
– Pattern Recognition Laboratory at Delft University of Technology

News Overviews Instructional Materials AI-Alerts Classics

Bilevel Distance Metric Learning for Robust Image Recognition

Xu, Jie, Luo, Lei, Deng, Cheng, Huang, Heng

Neural Information Processing SystemsFeb-14-2020, 14:12:34 GMT

Metric learning, aiming to learn a discriminative Mahalanobis distance matrix M that can effectively reflect the similarity between data samples, has been widely studied in various image recognition problems. Most of the existing metric learning methods input the features extracted directly from the original data in the preprocess phase. What's worse, these features usually take no consideration of the local geometrical structure of the data and the noise existed in the data, thus they may not be optimal for the subsequent metric learning task. In this paper, we integrate both feature extraction and metric learning into one joint optimization framework and propose a new bilevel distance metric learning model. Specifically, the lower level characterizes the intrinsic data structure using graph regularized sparse coefficients, while the upper level forces the data samples from the same class to be close to each other and pushes those from different classes far away.

bilevel distance metric learning, data sample, robust image recognition

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Image Matching (0.65)

Add feedback

Generative Shape Models: Joint Text Recognition and Segmentation with Very Little Training Data

Lou, Xinghua, Kansky, Ken, Lehrach, Wolfgang, Laan, CC, Marthi, Bhaskara, Phoenix, D., George, Dileep

Neural Information Processing SystemsFeb-14-2020, 12:27:10 GMT

We demonstrate that a generative model for object shapes can achieve state of the art results on challenging scene text recognition tasks, and with orders of magnitude fewer training images than required for competing discriminative methods. In addition to transcribing text from challenging images, our method performs fine-grained instance segmentation of characters. We show that our model is more robust to both affine transformations and non-affine deformations compared to previous approaches. Papers published at the Neural Information Processing Systems Conference.

generative shape model, joint text recognition and segmentation, training data

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Text Recognition (0.71)

Add feedback

Deep Speaker Embeddings for Far-Field Speaker Recognition on Short Utterances

Gusev, Aleksei, Volokhov, Vladimir, Andzhukaev, Tseren, Novoselov, Sergey, Lavrentyeva, Galina, Volkova, Marina, Gazizullina, Alice, Shulipa, Andrey, Gorlanov, Artem, Avdeeva, Anastasia, Ivanov, Artem, Kozlov, Alexander, Pekhovsky, Timur, Matveev, Yuri

arXiv.org Machine LearningFeb-14-2020

Speaker recognition systems based on deep speaker embeddings have achieved significant performance in controlled conditions according to the results obtained for early NIST SRE (Speaker Recognition Evaluation) datasets. From the practical point of view, taking into account the increased interest in virtual assistants (such as Amazon Alexa, Google Home, AppleSiri, etc.), speaker verification on short utterances in uncontrolled noisy environment conditions is one of the most challenging and highly demanded tasks. This paper presents approaches aimed to achieve two goals: a) improve the quality of far-field speaker verification systems in the presence of environmental noise, reverberation and b) reduce the system qualitydegradation for short utterances. For these purposes, we considered deep neural network architectures based on TDNN (TimeDelay Neural Network) and ResNet (Residual Neural Network) blocks. We experimented with state-of-the-art embedding extractors and their training procedures. Obtained results confirm that ResNet architectures outperform the standard x-vector approach in terms of speaker verification quality for both long-duration and short-duration utterances. We also investigate the impact of speech activity detector, different scoring models, adaptation and score normalization techniques. The experimental results are presented for publicly available data and verification protocols for the VoxCeleb1, VoxCeleb2, and VOiCES datasets.

extractor, protocol, voices, (16 more...)

arXiv.org Machine Learning

2002.06033

Country:

Asia > Russia (0.14)
Europe > Austria > Styria > Graz (0.05)
Europe > Sweden > Stockholm > Stockholm (0.05)
(14 more...)

Genre: Research Report > New Finding (0.88)

Industry: Information Technology > Security & Privacy (0.66)

Technology:

Information Technology > Artificial Intelligence > Speech (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Speech Recognition (0.83)

Add feedback

x-vectors meet emotions: A study on dependencies between emotion and speaker recognition

Pappagari, Raghavendra, Wang, Tianzi, Villalba, Jesus, Chen, Nanxin, Dehak, Najim

arXiv.org Machine LearningFeb-12-2020

In this work, we explore the dependencies between speaker recognition and emotion recognition. We first show that knowledge learned for speaker recognition can be reused for emotion recognition through transfer learning. Then, we show the effect of emotion on speaker recognition. For emotion recognition, we show that using a simple linear model is enough to obtain good performance on the features extracted from pre-trained models such as the x-vector model. Then, we improve emotion recognition performance by fine-tuning for emotion classification. We evaluated our experiments on three different types of datasets: IEMOCAP, MSP-Podcast, and Crema-D. By fine-tuning, we obtained 30.40%, 7.99%, and 8.61% absolute improvement on IEMOCAP, MSP-Podcast, and Crema-D respectively over baseline model with no pre-training. Finally, we present results on the effect of emotion on speaker verification. We observed that speaker verification performance is prone to changes in test speaker emotions. We found that trials with angry utterances performed worst in all three datasets. We hope our analysis will initiate a new line of research in the speaker recognition community.

dataset, emotion recognition, recognition, (15 more...)

arXiv.org Machine Learning

2002.05039

Country:

North America > United States > Texas (0.04)
North America > United States > Maryland > Baltimore (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
Asia > South Korea > Gyeonggi-do > Suwon (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Emotion (0.99)

Add feedback

AI Online Filters to Real World Image Recognition

Xiao, Hai, Shang, Jin, Huang, Mengyuan

arXiv.org Artificial IntelligenceFeb-11-2020

Deep artificial neural networks, trained with labeled data sets are widely used in numerous vision and robotics applications today. In terms of AI, these are called reflex models, referring to the fact that they do not self-evolve or actively adapt to environmental changes. As demand for intelligent robot control expands to many high level tasks, reinforcement learning and state based models play an increasingly important role. Herein, in computer vision and robotics domain, we study a novel approach to add reinforcement controls onto the image recognition reflex models to attain better overall performance, specifically to a wider environment range beyond what is expected of the task reflex models. Follow a common infrastructure with environment sensing and AI based modeling of self-adaptive agents, we implement multiple types of AI control agents. To the end, we provide comparative results of these agents with baseline, and an insightful analysis of their benefit to improve overall image recognition performance in real world.

accuracy, agent, de-noise filter, (17 more...)

arXiv.org Artificial Intelligence

2002.08242

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > United States > Florida > Broward County > Fort Lauderdale (0.04)
North America > United States > California > Santa Clara County > Sunnyvale (0.04)
North America > Canada > Quebec > Montreal (0.04)

Genre: Research Report > New Finding (0.69)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Image Matching (0.82)

Add feedback

Why Did Humans Evolve Pattern Recognition Abilities? Cognition Today

#artificialintelligenceFeb-9-2020, 19:34:06 GMT

These mechanisms emerge as a response to patterns in the environment or enable us to refine our ability to spot them. Pattern recognition skills sit at the helm of our basic cognitive architecture. A common problem during hunting is to estimate how many predators there are – based on cues like animal sounds, footprints, etc. Say a pack of 4 hunters is trying to isolate a prey for food. The hunters can only survive if they have the physical capability to defend themselves and successfully kill or escape. If they do not have the ability, they will die.

brain, information, pattern recognition, (13 more...)

#artificialintelligence

Country: Asia > India > Maharashtra > Pune (0.04)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.96)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.76)

Add feedback

Automatic Speech Transcription And Speaker Recognition Simultaneously Using Apple AI

#artificialintelligenceFeb-9-2020, 13:44:58 GMT

Last year, Apple witnessed several controversies regarding its speech recognition technology. To provide quality control in the company's voice assistant Siri, Apple asked its contractors to regularly hear the confidential voice recordings in the name of the "Siri Grading Program". However, to this matter, the company later apologised and published a statement where it announced the changes in the Siri grading program. This year, the tech giant has been gearing up a number of researchers regarding speech recognition technology to upgrade its voice assistant. Recently, the researchers at Apple developed an AI model which can perform automatic speech transcription and speaker recognition simultaneously.

automatic speech transcription, bilstm layer, training data, (9 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Speech Recognition (0.72)

Add feedback

Improving S&P stock prediction with time series stock similarity

Sidi, Lior

arXiv.org Machine LearningFeb-8-2020

Stock market prediction with forecasting algorithms is a popular topic these days where most of the forecasting algorithms train only on data collected on a particular stock. In this paper, we enriched the stock data with related stocks just as a professional trader would have done to improve the stock prediction models. We tested five different similarities functions and found co-integration similarity to have the best improvement on the prediction model. We evaluate the models on seven S&P stocks from various industries over five years period. The prediction model we trained on similar stocks had significantly better results with 0.55 mean accuracy, and 19.782 profit compare to the state of the art model with an accuracy of 0.52 and profit of 6.6.

configuration, prediction, similarity, (16 more...)

arXiv.org Machine Learning

2002.05784

Country:

North America > United States > New York > New York County > New York City (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
Asia > Middle East > Israel (0.04)

Genre: Research Report > New Finding (0.68)

Industry: Banking & Finance > Trading (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.68)

Add feedback

The Science Of Patterns

#artificialintelligenceFeb-7-2020, 01:35:31 GMT

Humans are natural pattern recognizers. Whether, as in prehistoric times, we were recognizing danger in a telltale rustle of the bushes or skimming a page of letters and numbers today, we use patterns to derive meaning without having to do a more detailed inspection. Futurist and entrepreneur Ray Kurzweil considers pattern recognition so important that in his recent book, How to Create A Mind, he argued that pattern recognition and intelligence are essentially the same thing. Expertise, in essence, is the familiarity of patterns of a specific field. Today, machines are learning to recognize patterns as well.

bible code, pattern recognition, ramsey, (5 more...)

#artificialintelligence

Country: North America > United States > Virginia (0.05)

Industry:

Education > Curriculum > Subject-Specific Education (0.49)
Health & Medicine > Pharmaceuticals & Biotechnology (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.78)

Add feedback

Natural Language Processing (NLP) Market to Reach USD 80.68 billion by 2026; Increasing Demand for Enhanced Algorithms to Boost Growth, says Fortune Business Insights

#artificialintelligenceFeb-5-2020, 18:02:59 GMT

Key Companies Covered in NLP Market Research Report are 3M Company, Adobe Systems Inc., Amazon Web Services Inc., Apple Inc., Google (Alphabet Inc.), Hewlett-Packard Enterprise Company, Intel Corporation, Microsoft Corporation, SAS Institute Inc., Other key market players The global Natural Language Processing (NLP) Market size is projected to reach USD 80.68 billion by 2026, thereby exhibiting a CAGR of 32.4% during the forecast period. This information is published by Fortune Business Insights, in a report, titled, "Natural Language Processing (NLP) Market Size, Share & Industry Analysis, By Deployment (On-Premises, Cloud, and Hybrid), By Technology (Interactive Voice Response (IVR), Optical Character Recognition (OCR), Text Analytics, Speech Analytics, Classification and Categorization, Pattern and Image Recognition, and Others), By Industry Vertical (Healthcare, Retail, High Tech and Telecom, BFSI, Automotive & Transportation, Advertising & Media, Manufacturing, and Others) and Regional Forecast, 2019-2026." The report further states that the market was USD 8.61 billion in 2018. It is set to gain momentum from the rising demand for big data, improved algorithms, and powerful computing. What Does the Report Contain?

industry analysis, market size, regional forecast, (14 more...)

#artificialintelligence

Country:

Asia > China (0.05)
South America (0.05)
North America > United States (0.05)
(6 more...)

Genre: Research Report (0.69)

Industry:

Information Technology > Services (0.69)
Information Technology > Security & Privacy (0.49)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.55)
Information Technology > Data Science > Data Mining > Big Data (0.50)

Add feedback