AITopics

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.58)

#artificialintelligenceNov-27-2021, 21:32:57 GMT

Computer Vision: Python OCR & Object Detection Quick Starter

This is the third course from my Computer Vision series. Image Recognition, Object Detection, Object Recognition and also Optical Character Recognition are among the most used applications of Computer Vision. Using these techniques, the computer will be able to recognize and classify either the whole image, or multiple objects inside a single image predicting the class of the objects with the percentage accuracy score. Using OCR, it can also recognize and convert text in the images to machine readable format like text or a document. Object Detection and Object Recognition is widely used in many simple applications and also complex ones like self driving cars.

image recognition, optical character recognition, recognition, (11 more...)

Genre: Instructional Material (0.31)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.68)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.67)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.55)

#artificialintelligenceNov-27-2021, 11:50:25 GMT

Use Cases and Roll-Out Tips for Image Recognition in Retail

Heavily shattered by the pandemic, the retail sector is on the lookout for innovation. Among the many technologies retailers focus on, artificial intelligence is an undeniable leader. The market of artificial intelligence solutions for retail is projected to reach $23.32 billion by 2027, quite a leap compared to $5.06 billion in 2021. Within AI, computer vision and image recognition have become notable areas of interest for the retail sector -- the global market of retail image recognition software is expected to grow at a CAGR of 22% and attain the value of $3.7 billion by 2025. Bringing image recognition into their technology mixes, retailers hope to optimize inventories, simplify checkouts, and boost customer experience.

customer, image recognition, recognition, (10 more...)

Industry: Retail (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Image Matching (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.80)

arXiv.org Artificial IntelligenceNov-24-2021

CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition

Zheng, Tianlun, Chen, Zhineng, Fang, Shancheng, Xie, Hongtao, Jiang, Yu-Gang

The attention-based encoder-decoder framework is becoming popular in scene text recognition, largely due to its superiority in integrating recognition clues from both visual and semantic domains. However, recent studies show the two clues might be misaligned in the difficult text (e.g., with rare text shapes) and introduce constraints such as character position to alleviate the problem. Despite certain success, a content-free positional embedding hardly associates with meaningful local image regions stably. In this paper, we propose a novel module called Multi-Domain Character Distance Perception (MDCDP) to establish a visual and semantic related position encoding. MDCDP uses positional embedding to query both visual and semantic features following the attention mechanism. It naturally encodes the positional clue, which describes both visual and semantic distances among characters. We develop a novel architecture named CDistNet that stacks MDCDP several times to guide precise distance modeling. Thus, the visual-semantic alignment is well built even various difficulties presented. We apply CDistNet to two augmented datasets and six public benchmarks. The experiments demonstrate that CDistNet achieves state-of-the-art recognition accuracy. While the visualization also shows that CDistNet achieves proper attention localization in both visual and semantic domains. We will release our code upon acceptance.

cdistnet, recognition, text recognition, (15 more...)

2111.11011

Country: Asia > China (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Text Recognition (0.64)

Bergami, Giacomo, Di Francescomarino, Chiara, Ghidini, Chiara, Maggi, Fabrizio Maria, Puura, Joonas

Exploring Business Process Deviance with Sequential and Declarative Patterns

arXiv.org Artificial IntelligenceNov-24-2021

Business process deviance refers to the phenomenon whereby a subset of the executions of a business process deviate, in a negative or positive way, with respect to {their} expected or desirable outcomes. Deviant executions of a business process include those that violate compliance rules, or executions that undershoot or exceed performance targets. Deviance mining is concerned with uncovering the reasons for deviant executions by analyzing event logs stored by the systems supporting the execution of a business process. In this paper, the problem of explaining deviations in business processes is first investigated by using features based on sequential and declarative patterns, and a combination of them. Then, the explanations are further improved by leveraging the data attributes of events and traces in event logs through features based on pure data attribute values and data-aware declarative rules. The explanations characterizing the deviances are then extracted by direct and indirect methods for rule induction. Using real-life logs from multiple domains, a range of feature types and different forms of decision rules are evaluated in terms of their ability to accurately discriminate between non-deviant and deviant executions of a process as well as in terms of understandability of the final outcome returned to the users.

classifier, constraint, event log, (17 more...)

2111.12454

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Italy > Lazio > Rome (0.04)
South America > Brazil > Rio de Janeiro > Rio de Janeiro (0.04)
(6 more...)

Genre: Research Report > New Finding (0.68)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.46)

Journal of Artificial Intelligence ResearchNov-23-2021

Graph Kernels: A Survey

Nikolentzos, Giannis | Siglidis, Giannis | Vazirgiannis, Michalis (Ecole Polytechnique)

Graph kernels have attracted a lot of attention during the last decade, and have evolved into a rapidly developing branch of learning on structured data. During the past 20 years, the considerable research activity that occurred in the field resulted in the development of dozens of graph kernels, each focusing on specific structural properties of graphs. Graph kernels have proven successful in a wide range of domains, ranging from social networks to bioinformatics. The goal of this survey is to provide a unifying view of the literature on graph kernels. In particular, we present a comprehensive overview of a wide range of graph kernels. Furthermore, we perform an experimental evaluation of several of those kernels on publicly available datasets, and provide a comparative study. Finally, we discuss key applications of graph kernels, and outline some challenges that remain to be addressed.

graph, graph kernel, kernel, (15 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1.13225

AI Access Foundation

13225

Journal of Artificial Intelligence Research

Country:

Europe > France (0.04)
North America > United States > California (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.67)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.67)
(2 more...)

Chen, Chien-Ming, Chen, Lili, Gan, Wensheng

Flexible Pattern Discovery and Analysis

arXiv.org Artificial IntelligenceNov-23-2021

--Based on the analysis of the proportion of utility in the supporting transactions used in the field of data mining, high utility-occupancy pattern mining (HUOPM) has recently attracted widespread attention. Unlike high-utility pattern mining (HUPM), which involves the enumeration of high-utility (e.g., profitable) patterns, HUOPM aims to find patterns representing a collection of existing transactions. In practical applications, however, not all patterns are used or valuable. For example, a pattern might contain too many items, that is, the pattern might be too specific and therefore lack value for users in real life. T o achieve qualified patterns with a flexible length, we constrain the minimum and maximum lengths during the mining process and introduce a novel algorithm for the mining of flexible high utility-occupancy patterns. In addition, a utility-occupancy nested list (UO-nlist) and a frequency-utility-occupancy table (FUO-table) are employed to avoid multiple scans of the database. Evaluation results of the subsequent experiments confirm that the proposed algorithm can effectively control the length of the derived patterns, for both real-world and synthetic datasets. Moreover, it can decrease the execution time and memory consumption. HE initial motivation for frequent pattern mining (FPM) was to analyze the shopping behavior of customers using transactional databases and recommend frequently purchased patterns to customers [1], [2], [3], [4], [5]. In this case, researchers believed that the item is binary and whether an item appears in a transaction is considered primary. However, frequent purchase patterns are occasionally less profitable than infrequent purchase patterns with high profits, which poses a fundamental problem. Hence, the discovery of high-utility patterns that consider not only the internal utility (e.g., quantity) but also the external utility (e.g., profit, interest, or weight) [6], [7], [8], [9] has gained substantial research attention. Moreover, a framework called high-utility pattern mining (HUPM) [10], [11] was proposed to address this practical issue. In contrast with frequent pattern mining (FPM), the lack of a downward closure property makes HUPM more difficult and intractable. This research was partially supported by National Natural Science Foundation of China (Grant No. 62002136), Guangzhou Basic and Applied Basic Research Foundation (Grant No. 202102020277). Wensheng Gan is with the College of Cyber Security, Jinan University, Guangzhou 510632, China.

algorithm, mining, transaction, (13 more...)

2111.12218

Country:

Asia > China > Guangdong Province > Guangzhou (0.44)
Europe > Finland > Uusimaa > Helsinki (0.04)
Asia > China > Shandong Province > Qingdao (0.04)

Genre: Research Report (0.82)

Industry:

Information Technology (0.54)
Materials (0.34)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (1.00)

Rajeswar, Sai, Rodriguez, Pau, Singhal, Soumye, Vazquez, David, Courville, Aaron

Multi-label Iterated Learning for Image Classification with Label Ambiguity

arXiv.org Artificial IntelligenceNov-23-2021

Transfer learning from large-scale pre-trained models has become essential for many computer vision tasks. Recent studies have shown that datasets like ImageNet are weakly labeled since images with multiple object classes present are assigned a single label. This ambiguity biases models towards a single prediction, which could result in the suppression of classes that tend to co-occur in the data. Inspired by language emergence literature, we propose multi-label iterated learning (MILe) to incorporate the inductive biases of multi-label learning from single labels using the framework of iterated learning. MILe is a simple yet effective procedure that builds a multi-label description of the image by propagating binary predictions through successive generations of teacher and student networks with a learning bottleneck. Experiments show that our approach exhibits systematic benefits on ImageNet accuracy as well as ReaL F1 score, which indicates that MILe deals better with label ambiguity than the standard training procedure, even when fine-tuning from self-supervised weights. We also show that MILe is effective reducing label noise, achieving state-of-the-art performance on real-world large-scale noisy data such as WebVision. Furthermore, MILe improves performance in class incremental settings such as IIRC and it is robust to distribution shifts. Code: https://github.com/rajeswar18/MILe

learning, prediction, student, (16 more...)

2111.12172

Country: North America > Canada > Quebec > Montreal (0.14)

Genre: Research Report (1.00)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.46)

#artificialintelligenceNov-22-2021, 19:17:15 GMT

Justitia ex Machina: The Case for Automating Morals

This piece was a finalist for the inaugural Gradient Prize. Machine Learning is a powerful technique to automatically learn models from data that have recently been the driving force behind several impressive technological leaps such as self-driving cars, robust speech recognition, and, arguably, better-than-human image recognition. We rely on these machine learning models daily; they influence our lives in ways we did not expect, and they are only going to become even more ubiquitous. Consider a couple of example machine learning models: 1) Detecting cats in images 2) Deciding which ads to show you online 3) Predicting which areas will suffer crime, and 4) Predicting how likely a criminal is to re-offend. The first two seem harmless enough.

fairness, machine learning, prediction, (13 more...)

Country: Europe > Denmark > Capital Region > Copenhagen (0.04)

Industry:

Transportation (0.56)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.47)
Law (0.47)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Image Matching (0.34)

#artificialintelligenceNov-21-2021, 23:47:13 GMT

Artificial intelligence key to Bristlebird recovery

Experts across multiple states and regions are working on cutting edge science like'call recognition' software to help the shy and elusive Eastern Bristlebird recover the devastating Black Summer bushfires. The state-of-the-art deep learning AI pattern recognition tool is one of eight new recovery projects that have received funding through the Morrison Government's multiregional species coordinator. Minister for the Environment Sussan Ley said the projects will cover a range of recovery actions including the use of scientific surveys to record sightings of the birds to improve understanding of subpopulations and habitat connectivity. "Eastern Bristlebirds are a very secretive bird but can be easily recognised by their melodic song and alarm-call, which is why we are developing new listening tools to support the identification and recovery of this endangered species," Minister Ley said. "By creating a tool that automatically and accurately detects the bird's calls from remote field recordings, and updating radio-transmitter attachment methods, we will be able to track remaining and translocated populations to support their recovery in the future. "We will also be using highly-skilled volunteer scientists to collect data that will inform the future recovery actions we need to take to support the recovery of the Bristlebird across its entire range." Other projects for the Eastern Bristlebird will focus on enhancing recovery through habitat restoration, health and disease research, and support for the establishment of a new genetically viable population in Victoria as a safety net in case of extreme weather events or the spread of disease. "One of the key learnings from the Black Summer bushfires was a need for coordinated on-ground action, monitoring and research, across the entire range of a species, to support its recovery," Minister Ley added. "That is why the Australian Government's $200 million investment in bushfire recovery for wildlife and their habitats is seeing states, territories and stakeholders continuing to work together to support the recovery of ecosystems over a year on from the devastating bushfires.

artificial intelligence key, bristlebird recovery, recovery, (6 more...)

Country:

Oceania > Australia > Queensland (0.07)
Oceania > Australia > New South Wales (0.07)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.95)