Separating Knowledge and Perception with Procedural Data
Rodríguez-Muñoz, Adrián, Baradad, Manel, Isola, Phillip, Torralba, Antonio
–arXiv.org Artificial Intelligence
We train representation models with procedural data only, and apply them on visual similarity, classification, and semantic segmentation tasks without further training by using visual memory -- an explicit database of reference image embeddings. Unlike prior work on visual memory, our approach achieves full compartmentalization with respect to all real-world images while retaining strong performance. Compared to a model trained on Places, our procedural model performs within $1\%$ on NIGHTS visual similarity, outperforms by $8\%$ and $15\%$ on CUB200 and Flowers102 fine-grained classification, and is within $10\%$ on ImageNet-1K classification. It also demonstrates strong zero-shot segmentation, achieving an $R^2$ on COCO within $10\%$ of the models trained on real data. Finally, we analyze procedural versus real data models, showing that parts of the same object have dissimilar representations in procedural models, resulting in incorrect searches in memory and explaining the remaining performance gap.
arXiv.org Artificial Intelligence
Aug-19-2025
- Country:
- Asia
- India (0.04)
- Middle East
- Israel (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Singapore (0.04)
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- France > Provence-Alpes-Côte d'Azur
- Alpes-Maritimes > Nice (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- Belgium > Brussels-Capital Region
- North America
- Canada > British Columbia
- Vancouver (0.04)
- Dominican Republic (0.04)
- United States
- California > San Diego County
- San Diego (0.04)
- Florida > Orange County
- Orlando (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- California > San Diego County
- Canada > British Columbia
- Asia
- Genre:
- Research Report (0.41)
- Industry:
- Health & Medicine (0.46)
- Information Technology > Security & Privacy (0.68)
- Technology: