Image Understanding: Overviews


Viewpoint Invariant Change Captioning

arXiv.org Artificial Intelligence

The ability to detect that something has changed in an environment is valuable, but often only if it can be accurately conveyed to a human operator. We introduce Viewpoint Invariant Change Captioning, and develop models which can both localize and describe via natural language complex changes in an environment. Moreover, we distinguish between a change in a viewpoint and an actual scene change (e.g. a change of objects' attributes). To study this new problem, we collect a Viewpoint Invariant Change Captioning Dataset (VICC), building it off the CLEVR dataset and engine. We introduce 5 types of scene changes, including changes in attributes, positions, etc. To tackle this problem, we propose an approach that distinguishes a viewpoint change from an important scene change, localizes the change between "before" and "after" images, and dynamically attends to the relevant visual features when describing the change. We benchmark a number of baselines on our new dataset, and systematically study the different change types. We show the superiority of our proposed approach in terms of change captioning and localization. Finally, we also show that our approach is general and can be applied to real images and language on the recent Spot-the-diff dataset.


OpenCV Tutorial: A Guide to Learn OpenCV - PyImageSearch

#artificialintelligence

Whether you're interested in learning how to apply facial recognition to video streams, building a complete deep learning pipeline for image classification, or simply want to tinker with your Raspberry Pi and add image recognition to a hobby project, you'll need to learn OpenCV somewhere along the way. The truth is that learning OpenCV used to be quite challenging. The documentation was hard to navigate. The tutorials were hard to follow and incomplete. And even some of the books were a bit tedious to work through. The good news is learning OpenCV isn't as hard as it used to be. And in fact, I'll go as far as to say studying OpenCV has become significantly easier. And to prove it to you (and help you learn OpenCV), I've put together this complete guide to learning the fundamentals of the OpenCV library using the Python programming language. Let's go ahead and get started learning the basics of OpenCV and image processing. By the end of today's blog post, you'll understand the fundamentals of OpenCV.


A guide to AI image recognition

#artificialintelligence

Artificial intelligence is becoming a centralised part of our everyday lives, even if we don't realise it. In fact, half of the people who encounter AI don't know they are doing so.


Fasttrack AI workshop

#artificialintelligence

The OpenPOWER workshop on PowerAI hosted by the NHCE on 19th of December 2017. The Program, led and managed by Ganesan Narayanasamy introduced a wide range of specialist topics ranging from IBM powerAI, deep learning, machine learning, tensorFlow frameworks, Image classification with example. In this session an introduction to OpenPower foundation was delivered.This included an overview of the cooperation of over 300 institutions ranging from academia to industry as well as a more in depth look at some of the success and developments currently underway within the OpenPOWER framework. The Oak Ridge Leadership Computing Facility provides the open scientific community access to America's fastest, most powerful supercomputer and is a key member of the OpenPOWER Founation. Also included an outline of the conventionally used qubit technologies as well as an indication of the current status of the quantum computer projects underway at some of the lead player institutions including IBM, Microsoft and NASA.