TRAFFIC: Recognizing Objects Using Hierarchical Reference Frame Transformations
Zemel, Richard S., Mozer, Michael C., Hinton, Geoffrey E.
–Neural Information Processing Systems
We describe a model that can recognize two-dimensional shapes in an unsegmented image, independent of their orientation, position, and scale. The model, called TRAFFIC, efficiently represents the structural relation between an object and each of its component features by encoding the fixed viewpoint-invariant transformation from the feature's reference frame to the object's in the weights of a connectionist network. Using a hierarchy of such transformations, with increasing complexity of features at each successive layer, the network can recognize multiple objects in parallel. An implementation of TRAFFIC is described, along with experimental results demonstrating the network's ability to recognize constellations of stars in a viewpoint-invariant manner. 1 INTRODUCTION A key goal of machine vision is to recognize familiar objects in an unsegmented image, independent of their orientation, position, and scale. Massively parallel models have long been used for lower-level vision tasks, such as primitive feature extraction and stereo depth. Models addressing "higher-level" vision have generally been restricted to pattern matching types of problems, in which much of the inherent complexity of the domain has been eliminated or ignored.
Neural Information Processing Systems
Dec-31-1990
- Country:
- North America
- Canada > Ontario
- Toronto (0.30)
- United States (0.94)
- Canada > Ontario
- North America
- Genre:
- Research Report > New Finding (0.34)
- Technology: