20 Analysis of Curved Line Drawings Using Context and Global Information

AI Classics/files/AI/classics/Machine Intelligence 6/MI6-Ch20-Guzman.pdf 

We describe the analysis of visual scenes consisting of black on white drawings formed with curved lines, depicting familiar objects and forms: houses, trees, persons, and so on; for instance, drawings found in coloring books. The goal of such analysis is to recognize (by computer) such forms and shapes when present in the input scene; that is, to name (correctly) as many parts of the scene as possible: finger, hand, girl, dance, and so on. Complications occur because each input scene contains several such objects, partially occluding each other and in varying degrees of orientation, size, and so on. The analysis of these line drawings is an instance of'the context problem', which can be stated as'given that a set (a scene) is formed by components that locally (by their shape) are ambiguous, because each shape allows a component to have one of several possible values (a circle can be sun, ball, eye, hole) or meanings, can we make use of context information stated in the form of models, in order to single out for each component a value in such manner that the whole set (scene) is consistent or makes global sense?' Thus, shape drastically limits the values that a component could have, and further disambiguation is possible only by using global information (derived from several components and their inter-relations or inter-connections) under the assumption that the scene as a whole is meaningful. This paper proposes a way to solve'the context problem' in the paradigm of coloring book drawings. We have not implemented this approach; indeed, a purpose of this paper is to collect criticisms and suggestions. INTRODUCTION 1.1 Statement of the problem An input picture is read into a computer. We would like to analyze it. The input picture The input picture consists of a line drawing (black curved thin lines on white paper) containing familiar objects [figure 1(a)]; one could think of drawings in coloring books for children. The objects forming the picture should be drawn correctly and accurately: no intentional distortions, caricatures, or humanizations of animals (figure 2) will be allowed. Figure 2. Line drawings containing distortions, caricatures, comic strips, humanized animals, and so on, will not be accepted. Thus, it can be said that the class of input pictures we want to analyze is that found in coloring books, except distortions. We could think of a person looking at the input data [figures 1(a) or 1(b)] and saying: there is a straight line from point (30, 40) to point (67, --18.5), The input picture [figure 1(a)] is stored initially in the memory of the computer as a collection of black points (specified by their two-dimensional coordinates) closely spaced along each black line [figure 1(b)]. We will assume that: (a) The points are uniformly spaced along the lines of the drawing.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found