Goto

Collaborating Authors

 Image Matching


Improving Image Recognition to Accelerate Machine Learning - Advanced Science News

#artificialintelligence

Deep learning is a fascinating sub field of machine learning that creates artificially intelligent systems inspired by the structure and function of the brain. The basis of these models are bio-inspired artificial neural networks that mimic the neural connectivity of animal brains to carry out cognitive functions such as problem solving. A field with the most impressive results of neuromorphic computing is that of visual image analysis. Similar to how our brains learn to recognize objects in order to make predictions and act upon them, artificial intelligence must be shown millions of pictures before they are able to generalize them in order to make their best educated guesses for images they have never seen before. Professor Cheol Seong Hwang from the Department of Material Science and Engineering at Seoul National University and his research team have developed a method to accelerate the image recognition process by combining the inherent efficiency of resistive random access memory (ReRAM) and cross-bar array structures, two of the most commonly used hardware. Many of us have performed a reversed image search to find information based on a certain image in order to browse similar results.


A Two-Stage Approach to Few-Shot Learning for Image Recognition

arXiv.org Machine Learning

--This paper proposes a multi-layer neural network structure for few-shot image recognition of novel categories. The proposed multi-layer neural network architecture encodes transferable knowledge extracted from a large annotated dataset of base categories. This architecture is then applied to novel categories containing only a few samples. The transfer of knowledge is carried out at the feature-extraction and the classification levels distributed across the two training stages. In the first-training stage, we introduce the relative feature to capture the structure of the data as well as obtain a low-dimensional discriminative space. Secondly, we account for the variable variance of different categories by using a network to predict the variance of each class. Classification is then performed by computing the Maha-lanobis distance to the mean-class representation in contrast to previous approaches that used the Euclidean distance. In the second-training stage, a category-agnostic mapping is learned from the mean-sample representation to its corresponding class-prototype representation. This is because the mean-sample representation may not accurately represent the novel category prototype. Finally, we evaluate the proposed network structure on four standard few-shot image recognition datasets, where our proposed few-shot learning system produces competitive performance compared to previous work. We also extensively studied and analyzed the contribution of each component of our proposed framework. For the past decade, deep convolutional neural networks (CNN) have produced excellent results in visual recognition tasks such as object recognition, scene classification, etc. [1]- [3]. A CNN learns to recognize a large quantity of visual categories by training on a large collection of annotated images using a gradient-descent technique [4]. Although the training procedure is computationally intensive, it can be parallelized using a Graphics Processing Unit (GPU). Even after a long training period, the CNN can only recognize a fixed set of image categories. To learn to recognize novel categories, one has to collect new training data and retrain the CNN model with further adjustments. Unfortunately, in some cases, there might not be enough labeled data available for training a novel category. This work was supported in part by the National Science Foundation under Grant IIS-1813935. Any opinion, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation. We also gratefully acknowledge the support of NVIDIA Corporation for the donation of a TIT AN XP GPU used for this research. Object categories follow a long tailed distribution with a lot of rare classes and very few common classes. In such a long-tailed distribution, only a few object categories occur frequently.


Artificial intelligence: Towards a better understanding of the underlying mechanisms

#artificialintelligence

The automatic identification of complex features in images has already become a reality thanks to artificial neural networks. Some examples of software exploiting this technique are Facebook's automatic tagging system, Google's image search engine and the animal and plant recognition system used by iNaturalist. We know that these networks are inspired by the human brain, but their working mechanism is still mysterious. New research, conducted by SISSA in association with the Technical University of Munich and published for the 33rd Annual NeurIPS Conference, proposes a new approach for studying deep neural networks and sheds new light on the image elaboration processes that these networks are able to carry out. Similar to what happens in the visual system, neural networks used for automatic image recognition analyse the content progressively, through a chain of processing stages.


Artificial intelligence: Towards a better understanding of the underlying mechanisms

#artificialintelligence

The automatic identification of complex features in images has already become a reality thanks to artificial neural networks. Some examples of software exploiting this technique are Facebook's automatic tagging system, Google's image search engine and the animal and plant recognition system used by iNaturalist. We know that these networks are inspired by the human brain, but their working mechanism is still mysterious. New research, conducted by SISSA in association with the Technical University of Munich and published for the 33rd Annual NeurIPS Conference, proposes a new approach for studying deep neural networks and sheds new light on the image elaboration processes that these networks are able to carry out. Similar to what happens in the visual system, neural networks used for automatic image recognition analyse the content progressively, through a chain of processing stages.


Computer vision API- Skyl.ai

#artificialintelligence

Computer vision APIs let you run computer vision tasks programmatically at scale in real time. Once set up, the computer vision API can run computer vision tasks simultaneously on millions of data. This makes it easy to integrate these APIs into your apps or websites and deliver cutting edge computer vision backed experiences to your customers easily. For example, you might have a reverse image search engine which takes in a photo as an input and returns a set of similar images from the web. You can implement this in no time using computer vision APIs even though you do not have any expertise in machine learning or computer vision.


Alibaba's New AI Chip Can Process Nearly 80K Images Per Second

#artificialintelligence

The Hanguang 800 is being implemented across many application scenarios within Aliyun, ranging from video classification to smart city applications. For example, the company's popular Pailitao platform applies visual image search to e-commerce, allowing customers to search for items by taking a photo of the query object. Using AI-based image recognition & indexing powered by the new Hanguang 800, Aliyun can increase image processing efficiency by 12 times compared to GPUs. With regard to smart city tech, Aliyun says it previously used 40 traditional GPUs to process videos of central Hangzhou with a latency of 300ms. Now the task requires only four Hanguang 800 with a lower latency of 150ms.


Martin's Playtime with Tensorflow Lite / Dr Who image recognition

#artificialintelligence

Sign in to report inappropriate content. Digital Maker's Martin Evans has been experimenting with TensorFlow Lite on the Raspberry Pi 4 to recognise Dr Who character shapes. This is a short video of the Pi camera recognising a Dalek & a Cyberman, with the output going to an Ada Fruit Display Screen.


AI Image Recognition Market-Growth, Trends, and Forecast (2019-2024)

#artificialintelligence

Market Overview The AI image recognition market was valued at USD 1.41 billion in 2018 and is projected to reach a market value of USD 5.32 billion by 2024 at a CAGR of 24.7% over the forecast period (2019 - 2024). Image recognition technologies comprise voice, iris, palm, hand vein pattern, fingerprints, retina, hand geometry, facial pattern recognition, object identification etc. Image recognition based on these indications can be applied across various fields, such as vehicular safety, advertising, security and surveillance, biometric scanning machines, pedestrian recognition, and E-commerce. The adoption of artificial intelligence (AI) technology is rising, owing to its ability to enhance and automate operations and enrich the user experience. Governments are also focusing on increasing their AI capabilities to revolutionize various sectors, from healthcare to transport. EU has committed to invest EUR 1.5 billion in AI to catch up with the United States and Asia.


Microsoft and Graphcore collaborate to accelerate Artificial Intelligence

#artificialintelligence

Our research team at Qwant works at the cutting edge of AI to quickly deliver the best possible results on our users search queries while ensuring the results are neutral, impartial and accurate. We see millions of searches each day for images alone. One of the latest AI innovations that we are implementing is a new class of image recognition model called ResNext, to improve our accuracy and speed when delivering image search results. We have been working closely with Microsoft and Graphcore to use IPU processor technology in Azure and are seeing a significant improvement – with 3.5x higher performance - in our image search capability using ResNext on IPUs, out of the box. There is huge potential for innovation with Graphcore IPUs on new machine intelligence models and we are working on these approaches to refine our search results so that we can deliver exactly what our customers are looking for.


Building a Deep Image Search Engine using tf.Keras

#artificialintelligence

Imagine having a data collection of hundreds of thousands to millions of images without any metadata describing the content of each image. How can we build a system that is able to find a sub-set of those images that best answer a user's search query? What we will basically need is a search engine that is able to rank image results given how well they correspond to the search query, which can be either expressed in a natural language or by another query image. The way we will solve the problem in this post is by training a deep neural model that learns a fixed length representation (or embedding) of any input image and text and makes it so those representations are close in the euclidean space if the pairs text-image or image-image are "similar". I could not find a data-set of search result ranking that is big enough but I was able to get this data-set: http://jmcauley.ucsd.edu/data/amazon/