Goto

Collaborating Authors

 Pattern Recognition


Learning to classify complex patterns using a VLSI network of spiking neurons

Neural Information Processing Systems

We propose a compact, low power VLSI network of spiking neurons which can learn to classify complex patterns of mean firing rates on–line and in real–time. The network of integrate-and-fire neurons is connected by bistable synapses that can change their weight using a local spike–based plasticity mechanism. Learning is supervised by a teacher which provides an extra input to the output neurons during training. The synaptic weights are updated only if the current generated by the plastic synapses does not match the output desired by the teacher (as in the perceptron learning rule). We present experimental results that demonstrate how this VLSI network is able to robustly classify uncorrelated linearly separable spatial patterns of mean firing rates.


Filtering Abstract Senses From Image Search Results

Neural Information Processing Systems

We propose an unsupervised method that, given a word, automatically selects non-abstract senses of that word from an online ontology and generates images depicting the corresponding entities. When faced with the task of learning a visual model based only on the name of an object, a common approach is to find images on the web that are associated with the object name, and then train a visual classifier from the search result. As words are generally polysemous, this approach can lead to relatively noisy models if many examples due to outlier senses are added to the model. We argue that images associated with an abstract word sense should be excluded when training a visual classifier to learn a model of a physical object. While image clustering can group together visually coherent sets of returned images, it can be difficult to distinguish whether an image cluster relates to a desired object or to an abstract sense of the word.


A Convergence Analysis of Log-Linear Training

Neural Information Processing Systems

Log-linear models are widely used probability models for statistical pattern recognition. Typically, log-linear models are trained according to a convex criterion. In recent years, the interest in log-linear models has greatly increased. The optimization of log-linear model parameters is costly and therefore an important topic, in particular for large-scale applications. Different optimization algorithms have been evaluated empirically in many papers.


Term Rewriting Based On Set Automaton Matching

arXiv.org Artificial Intelligence

In this article we investigate how a subterm pattern matching algorithm can be exploited to implement efficient term rewriting procedures. From the left-hand sides of the rewrite system we construct a set automaton, which can be used to find all redexes in a term efficiently. We formally describe a procedure that, given a rewrite strategy, interleaves pattern matching steps and rewriting steps and thus smoothly integrates redex discovery and subterm replacement. We then present an efficient implementation that instantiates this procedure with outermost rewriting, and present the results of some experiments. Our implementation shows to be competitive with comparable tools.


Improving Scene Text Recognition for Character-Level Long-Tailed Distribution

arXiv.org Artificial Intelligence

Despite the recent remarkable improvements in scene text recognition (STR), the majority of the studies focused mainly on the English language, which only includes few number of characters. However, STR models show a large performance degradation on languages with a numerous number of characters (e.g., Chinese and Korean), especially on characters that rarely appear due to the long-tailed distribution of characters in such languages. To address such an issue, we conducted an empirical analysis using synthetic datasets with different character-level distributions (e.g., balanced and long-tailed distributions). While increasing a substantial number of tail classes without considering the context helps the model to correctly recognize characters individually, training with such a synthetic dataset interferes the model with learning the contextual information (i.e., relation among characters), which is also important for predicting the whole word. Based on this motivation, we propose a novel Context-Aware and Free Experts Network (CAFE-Net) using two experts: 1) context-aware expert learns the contextual representation trained with a long-tailed dataset composed of common words used in everyday life and 2) context-free expert focuses on correctly predicting individual characters by utilizing a dataset with a balanced number of characters. By training two experts to focus on learning contextual and visual representations, respectively, we propose a novel confidence ensemble method to compensate the limitation of each expert. Through the experiments, we demonstrate that CAFE-Net improves the STR performance on languages containing numerous number of characters. Moreover, we show that CAFE-Net is easily applicable to various STR models.


Getting Started with AI: How to Use Python for Machine Learning

#artificialintelligence

Artificial Intelligence (AI) and Machine Learning (ML) are two rapidly growing fields in technology, and Python has become the go-to programming language for both. Python has a vast array of libraries and tools available for AI and ML development, making it an ideal language for beginners to get started with these fields. In this article, we will discuss the basics of using Python for machine learning and provide some code samples to help you get started. Machine learning is a subset of AI that involves training machines to learn from data and make predictions or decisions. It is a form of statistical analysis that involves the use of algorithms to find patterns in data and use those patterns to make predictions.


Scalable handwritten text recognition system for lexicographic sources of under-resourced languages and alphabets

arXiv.org Artificial Intelligence

The paper discusses an approach to decipher large collections of handwritten index cards of historical dictionaries. Our study provides a working solution that reads the cards, and links their lemmas to a searchable list of dictionary entries, for a large historical dictionary entitled the Dictionary of the 17th- and 18th-century Polish, which comprizes 2.8 million index cards. We apply a tailored handwritten text recognition (HTR) solution that involves (1) an optimized detection model; (2) a recognition model to decipher the handwritten content, designed as a spatial transformer network (STN) followed by convolutional neural network (RCNN) with a connectionist temporal classification layer (CTC), trained using a synthetic set of 500,000 generated Polish words of different length; (3) a post-processing step using constrained Word Beam Search (WBC): the predictions were matched against a list of dictionary entries known in advance. Our model achieved the accuracy of 0.881 on the word level, which outperforms the base RCNN model. Within this study we produced a set of 20,000 manually annotated index cards that can be used for future benchmarks and transfer learning HTR applications.


Galaxy Classification Using Transfer Learning and Ensemble of CNNs With Multiple Colour Spaces

arXiv.org Artificial Intelligence

Big data has become the norm in astronomy, making it an ideal domain for computer science research. Astronomers typically classify galaxies based on their morphologies, a practice that dates back to Hubble (1936). With small datasets, classification could be performed by individuals or small teams, but the exponential growth of data from modern telescopes necessitates automated classification methods. In December 2013, Winton Capital, Galaxy Zoo, and the Kaggle team created the Galaxy Challenge, which tasked participants with developing models to classify galaxies. The Kaggle Galaxy Zoo dataset has since been widely used by researchers. This study investigates the impact of colour space transformation on classification accuracy and explores the effect of CNN architecture on this relationship. Multiple colour spaces (RGB, XYZ, LAB, etc.) and CNN architectures (VGG, ResNet, DenseNet, Xception, etc.) are considered, utilizing pre-trained models and weights. However, as most pre-trained models are designed for natural RGB images, we examine their performance with transformed, non-natural astronomical images. We test our hypothesis by evaluating individual networks with RGB and transformed colour spaces and examining various ensemble configurations. A minimal hyperparameter search ensures optimal results. Our findings indicate that using transformed colour spaces in individual networks yields higher validation accuracy, and ensembles of networks and colour spaces further improve accuracy. This research aims to validate the utility of colour space transformation for astronomical image classification and serve as a benchmark for future studies.


WiMi Hologram Cloud Develops A CNN Algorithm-Based Image Recognition System

#artificialintelligence

WiMi Hologram Cloud Inc. (NASDAQ: WIMI) ("WiMi" or the "Company"), a leading global Hologram Augmented Reality ("AR") Technology provider, today announced that it has developed a CNN (convolutional neural network) algorithm-based image recognition system. CNN is a highly efficient recognition algorithm based on an artificial neural network. WiMi applies the CNN algorithm to image recognition technology, showing apparent advantages compared to the traditional machine learning algorithm. CNN realizes the construction of features by the computer itself, thus breaking through the bottleneck of the original way of classification. This has brought image recognition to a new level.


Building an Image Recognition Model using TensorFlow and Keras Libraries in Python - Code Armada, LLC

#artificialintelligence

Building an Image Recognition Model using TensorFlow and Keras Libraries in Python Image recognition models are extremely useful in a wide range of applications, from autonomous vehicles and medical diagnosis to social media analysis and e-commerce. By teaching a computer to identify and classify images based on certain features, such as color, shape, and texture, we can automate tasks that would be difficult or impossible for humans to do at scale. For example, an image recognition model can be used to detect objects in images, recognize faces and emotions, identify text in images, and even diagnose medical conditions based on medical images. In e-commerce, image recognition models can be used to recommend products based on visual similarity, allowing for more personalized and relevant product recommendations. Pretty cool, right? Let’s give it a try… Step 1. Install the required libraries: First, you need to install TensorFlow and Keras libraries in Python. You can install them using pip command in the terminal. pip install tensorflow pip install keras Step 2. Import the required libraries: Once the libraries are installed, you need to import them in your Python script. import tensorflow as tf from tensorflow import keras Step 3. Load the dataset: Next, […]