Recurrent computations for visual pattern completion

Tang, Hanlin, Schrimpf, Martin, Lotter, Bill, Moerman, Charlotte, Paredes, Ana, Caro, Josue Ortega, Hardesty, Walter, Cox, David, Kreiman, Gabriel

arXiv.org Artificial Intelligence 

These authors contributed equally To whom correspondence should be addressed at gabriel.kreiman@tch.harvard.edu Children's Hospital, Harvard Medical School Text Statistics: Number of words in abstract: 164 Number of words in significance statement: 100 Number of Figures: 4 Number of Tables: 0 Number of Supplementary Figures: 10 Abstract Making inferences from partial information constitutes a critical aspect of cognition. During visual perception, pattern completion enables recognition of poorly visible or occluded objects. We combined psychophysics, physiology and computational models to test the hypothesis that pattern completion is implemented by recurrent computations and present three pieces of evidence that are consistent with this hypothesis. First, subjects robustly recognized objects even when rendered 15% visible, but recognition was largely impaired when processing was interrupted by backward masking. Second, invasive physiological responses along the human ventral cortex exhibited visually selective responses to partially visible objects that were delayed compared to whole objects, suggesting the need for additional computations. These physiological delays were correlated with the effects of backward masking. Third, state-of-the-art feed-forward computational architectures were not robust to partial visibility. However, recognition performance was recovered when the model was augmented with attractor-based recurrent connectivity. These results provide a strong argument of plausibility for the role of recurrent computations in making visual inferences from partial information. Significance Statement The ability to complete patterns and interpret partial information is a central property of intelligence. Deep convolutional network architectures have proved successful in labeling whole objects in images and capturing the initial 150 ms of processing along the ventral visual cortex. This study shows that human object recognition abilities remain robust when only small amounts of information are available due to heavy occlusion, but the performance of bottom-up computational models is impaired under limited visibility.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found