AITopics | Pattern Recognition

Collaborating Authors

Pattern Recognition

"... the research area that studies the operation and design of systems that recognize patterns in data." It includes statistical methods like discriminant analysis, feature extraction, error estimation, cluster analysis.
– Pattern Recognition Laboratory at Delft University of Technology

News Overviews Instructional Materials AI-Alerts Classics

Grouping Local Process Models

Peeva, Viki, van der Aalst, Wil M. P.

arXiv.org Artificial IntelligenceNov-6-2023

In recent years, process mining emerged as a proven technology to analyze and improve operational processes. An expanding range of organizations using process mining in their daily operation brings a broader spectrum of processes to be analyzed. Some of these processes are highly unstructured, making it difficult for traditional process discovery approaches to discover a start-to-end model describing the entire process. Therefore, the subdiscipline of Local Process Model (LPM) discovery tries to build a set of LPMs, i.e., smaller models that explain sub-behaviors of the process. However, like other pattern mining approaches, LPM discovery algorithms also face the problems of model explosion and model repetition, i.e., the algorithms may create hundreds if not thousands of models, and subsets of them are close in structure or behavior. This work proposes a three-step pipeline for grouping similar LPMs using various process model similarity measures. We demonstrate the usefulness of grouping through a real-life case study, and analyze the impact of different measures, the gravity of repetition in the discovered LPMs, and how it improves after grouping on multiple real event logs.

event log, process model, similarity measure, (12 more...)

arXiv.org Artificial Intelligence

2311.0304

Country:

Europe > Germany > North Rhine-Westphalia > Cologne Region > Aachen (0.04)
Europe > Czechia > South Moravian Region > Brno (0.04)

Genre: Research Report (0.64)

Industry: Health & Medicine > Therapeutic Area (0.47)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.34)

Add feedback

Disentangling Voice and Content with Self-Supervision for Speaker Recognition

Liu, Tianchi, Lee, Kong Aik, Wang, Qiongqiong, Li, Haizhou

arXiv.org Artificial IntelligenceNov-1-2023

For speaker recognition, it is difficult to extract an accurate speaker representation from speech because of its mixture of speaker traits and content. This paper proposes a disentanglement framework that simultaneously models speaker traits and content variability in speech. It is realized with the use of three Gaussian inference layers, each consisting of a learnable transition model that extracts distinct speech components. Notably, a strengthened transition model is specifically designed to model complex speech dynamics. We also propose a self-supervision method to dynamically disentangle content without the use of labels other than speaker identities. The efficacy of the proposed framework is validated via experiments conducted on the VoxCeleb and SITW datasets with 9.56% and 8.24% average reductions in EER and minDCF, respectively. Since neither additional model training nor data is specifically needed, it is easily applicable in practical use.

disentangling voice and content, self-supervision, speaker recognition

arXiv.org Artificial Intelligence

2310.01128

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Speech Recognition (0.60)

Add feedback

Open-world Semi-supervised Generalized Relation Discovery Aligned in a Real-world Setting

Hogan, William, Li, Jiacheng, Shang, Jingbo

arXiv.org Artificial IntelligenceNov-1-2023

Open-world Relation Extraction (OpenRE) has recently garnered significant attention. However, existing approaches tend to oversimplify the problem by assuming that all unlabeled texts belong to novel classes, thereby limiting the practicality of these methods. We argue that the OpenRE setting should be more aligned with the characteristics of real-world data. Specifically, we propose two key improvements: (a) unlabeled data should encompass known and novel classes, including hard-negative instances; and (b) the set of novel classes should represent long-tail relation types. Furthermore, we observe that popular relations such as titles and locations can often be implicitly inferred through specific patterns, while long-tail relations tend to be explicitly expressed in sentences. Motivated by these insights, we present a novel method called KNoRD (Known and Novel Relation Discovery), which effectively classifies explicitly and implicitly expressed relations from known and novel classes within unlabeled data. Experimental evaluations on several Open-world RE benchmarks demonstrate that KNoRD consistently outperforms other existing methods, achieving significant performance gains.

dataset, novel class, unlabeled data, (13 more...)

arXiv.org Artificial Intelligence

2305.13533

Country:

North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.14)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
North America > United States > Nebraska > Douglas County > Omaha (0.04)
(9 more...)

Genre:

Research Report > Promising Solution (0.66)
Research Report > New Finding (0.46)

Industry: Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.67)

Add feedback

Google Image Search Will Now Show a Photo's History. Can It Spot Fakes?

WIREDOct-25-2023, 16:00:00 GMT

The spread of misinformation is a massive problem online, and generative AI is only helping boost the creation of inauthentic or real-but-repurposed media. Even in the pre-generative-AI era, an image surfaced through a quick Google search might have been used out of context or attached to a less-than-reliable website. Google believes it has at least one solution for this problem. In Google image search results, users will start seeing an information box called "About this image." It rolls out today in the US (and initially only in English).

google, information, search result, (10 more...)

WIRED

AI-Alerts: 2023 > 2023-10 > AAAI AI-Alert for Oct 31, 2023 (1.00)

Country:

North America > United States (0.26)
Europe > Poland (0.06)

Industry: Media > News (0.75)

Technology:

Information Technology > Information Management > Search (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Image Matching (0.64)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.48)

Add feedback

Death to captchas

MIT Technology ReviewOct-24-2023, 09:00:00 GMT

The arms race between humans and machines has been progressing for a while. As early as 2016, researchers at Columbia University showed they could solve Google's image captchas with 70% accuracy using off-the-shelf automated image recognition tools, the sort that could readily be used by bot designers. Captchas have gotten more complex out of necessity. Because as AI gets more sophisticated, they've become less effective. By now, some captchas have gotten a little surreal.

captcha, death, university, (1 more...)

MIT Technology Review

Technology: Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.60)

Add feedback

Boosting Generalization with Adaptive Style Techniques for Fingerprint Liveness Detection

Zhu, Kexin, Lin, Bo, Qiu, Yang, Yule, Adam, Tang, Yao, Liang, Jiajun

arXiv.org Artificial IntelligenceOct-24-2023

We introduce a high-performance fingerprint liveness feature extraction technique that secured first place in LivDet 2023 Fingerprint Representation Challenge. Additionally, we developed a practical fingerprint recognition system with 94.68% accuracy, earning second place in LivDet 2023 Liveness Detection in Action. By investigating various methods, particularly style transfer, we demonstrate improvements in accuracy and generalization when faced with limited training data. As a result, our approach achieved state-of-the-art performance in LivDet 2023 Challenges.

augmentation, livdet 2023, liveness, (12 more...)

arXiv.org Artificial Intelligence

2310.13573

Genre: Research Report (0.40)

Industry: Information Technology > Security & Privacy (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.51)

Add feedback

UWB Based Static Gesture Classification

Sebastian, Abhishek

arXiv.org Artificial IntelligenceOct-23-2023

Our paper presents a robust framework for UWB-based static gesture recognition, leveraging proprietary UWB radar sensor technology. Extensive data collection efforts were undertaken to compile datasets containing five commonly used gestures. Our approach involves a comprehensive data pre-processing pipeline that encompasses outlier handling, aspect ratio-preserving resizing, and false-color image transformation. Both CNN and MobileNet models were trained on the processed images. Remarkably, our best-performing model achieved an accuracy of 96.78%. Additionally, we developed a user-friendly GUI framework to assess the model's system resource usage and processing times, which revealed low memory utilization and real-time task completion in under one second. This research marks a significant step towards enhancing static gesture recognition using UWB technology, promising practical applications in various domains.

accuracy, gesture recognition, recognition, (14 more...)

arXiv.org Artificial Intelligence

2310.15036

Country: Asia > India > Tamil Nadu > Chennai (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Vision > Gesture Recognition (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.49)

Add feedback

EDIS: Entity-Driven Image Search over Multimodal Web Content

Liu, Siqi, Feng, Weixi, Fu, Tsu-jui, Chen, Wenhu, Wang, William Yang

arXiv.org Artificial IntelligenceOct-23-2023

Making image retrieval methods practical for real-world search applications requires significant progress in dataset scales, entity comprehension, and multimodal information fusion. In this work, we introduce \textbf{E}ntity-\textbf{D}riven \textbf{I}mage \textbf{S}earch (EDIS), a challenging dataset for cross-modal image search in the news domain. EDIS consists of 1 million web images from actual search engine results and curated datasets, with each image paired with a textual description. Unlike datasets that assume a small set of single-modality candidates, EDIS reflects real-world web image search scenarios by including a million multimodal image-text pairs as candidates. EDIS encourages the development of retrieval models that simultaneously address cross-modal information fusion and matching. To achieve accurate ranking results, a model must: 1) understand named entities and events from text queries, 2) ground entities onto images or text descriptions, and 3) effectively fuse textual and visual representations. Our experimental results show that EDIS challenges state-of-the-art methods with dense entities and a large-scale candidate set. The ablation study also proves that fusing textual features with visual features is critical in improving retrieval results.

dataset, query, retrieval, (14 more...)

arXiv.org Artificial Intelligence

2305.13631

Country:

Europe > United Kingdom (0.14)
Asia > South Korea (0.14)
Asia > Cambodia > Preah Sihanouk Province > Sihanoukville (0.04)
(11 more...)

Genre: Research Report > New Finding (0.68)

Industry:

Government > Voting & Elections (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Banking & Finance (0.93)
(2 more...)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
(2 more...)

Add feedback

Hamming Encoder: Mining Discriminative k-mers for Discrete Sequence Classification

Dong, Junjie, Jiang, Mudi, Hu, Lianyu, He, Zengyou

arXiv.org Artificial IntelligenceOct-20-2023

Sequence classification has numerous applications in various fields. Despite extensive studies in the last decades, many challenges still exist, particularly in pattern-based methods. Existing pattern-based methods measure the discriminative power of each feature individually during the mining process, leading to the result of missing some combinations of features with discriminative power. Furthermore, it is difficult to ensure the overall discriminative performance after converting sequences into feature vectors. To address these challenges, we propose a novel approach called Hamming Encoder, which utilizes a binarized 1D-convolutional neural network (1DCNN) architecture to mine discriminative k-mer sets. In particular, we adopt a Hamming distance-based similarity measure to ensure consistency in the feature mining and classification procedure. Our method involves training an interpretable CNN encoder for sequential data and performing a gradient-based search for discriminative k-mer combinations. Experiments show that the Hamming Encoder method proposed in this paper outperforms existing state-of-the-art methods in terms of classification accuracy.

classification, hamming encoder, neural network, (15 more...)

arXiv.org Artificial Intelligence

2310.10321

Country:

Asia > China > Liaoning Province > Dalian (0.05)
Europe > United Kingdom > England > Leicestershire > Leicester (0.04)
Europe > Switzerland (0.04)
(3 more...)

Genre:

Overview (0.88)
Research Report > Promising Solution (0.54)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.46)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.42)
Materials > Metals & Mining (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

Data Augmentation for Time-Series Classification: An Extensive Empirical Study and Comprehensive Survey

Gao, Zijun, Li, Lingbo, Xu, Tianhua

arXiv.org Artificial IntelligenceOct-19-2023

Data Augmentation (DA) has emerged as an indispensable strategy in Time Series Classification (TSC), primarily due to its capacity to amplify training samples, thereby bolstering model robustness, diversifying datasets, and curtailing overfitting. However, the current landscape of DA in TSC is plagued with fragmented literature reviews, nebulous methodological taxonomies, inadequate evaluative measures, and a dearth of accessible, user-oriented tools. In light of these challenges, this study embarks on an exhaustive dissection of DA methodologies within the TSC realm. Our initial approach involved an extensive literature review spanning a decade, revealing that contemporary surveys scarcely capture the breadth of advancements in DA for TSC, prompting us to meticulously analyze over 100 scholarly articles to distill more than 60 unique DA techniques. This rigorous analysis precipitated the formulation of a novel taxonomy, purpose-built for the intricacies of DA in TSC, categorizing techniques into five principal echelons: Transformation-Based, Pattern-Based, Generative, Decomposition-Based, and Automated Data Augmentation. Our taxonomy promises to serve as a robust navigational aid for scholars, offering clarity and direction in method selection. Addressing the conspicuous absence of holistic evaluations for prevalent DA techniques, we executed an all-encompassing empirical assessment, wherein upwards of 15 DA strategies were subjected to scrutiny across 8 UCR time-series datasets, employing ResNet and a multi-faceted evaluation paradigm encompassing Accuracy, Method Ranking, and Residual Analysis, yielding a benchmark accuracy of 88.94 +- 11.83%. Our investigation underscored the inconsistent efficacies of DA techniques, with...

augmentation, data augmentation, dataset, (13 more...)

arXiv.org Artificial Intelligence

2310.1006

Country:

Europe > United Kingdom > England > West Midlands > Coventry (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > California > Riverside County > Riverside (0.04)
(3 more...)

Genre:

Research Report > New Finding (1.00)
Overview (1.00)
Research Report > Promising Solution (0.92)

Industry:

Health & Medicine > Diagnostic Medicine (0.93)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(3 more...)

Add feedback