Complexity in Complexity: Understanding Visual Complexity Through Structure, Color, and Surprise

Sarıtaş, Karahan, Dayan, Peter, Shen, Tingke, Nath, Surabhi S

Feb-5-2025–arXiv.org Artificial Intelligence

Understanding human perception of visual complexity is crucial in visual cognition. Recently (Shen, et al. 2024) proposed an interpretable segmentation-based model that accurately predicted complexity across various datasets, supporting the idea that complexity can be explained simply. In this work, we investigate the failure of their model to capture structural, color and surprisal contributions to complexity. To this end, we propose Multi-Scale Sobel Gradient which measures spatial intensity variations, Multi-Scale Unique Color which quantifies colorfulness across multiple scales, and surprise scores generated using a Large Language Model. We test our features on existing benchmarks and a novel dataset containing surprising images from Visual Genome. Our experiments demonstrate that modeling complexity accurately is not as simple as previously thought, requiring additional perceptual and semantic factors to address dataset biases. Thus our results offer deeper insights into how humans assess visual complexity.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

Feb-5-2025

arXiv.org PDF

Add feedback

Country:
- Europe > Germany (0.28)
- North America > Mexico
  - Mexico City (0.14)

Genre:
- Research Report > New Finding (0.88)

Industry:
- Education (0.54)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning > Neural Networks
      - Deep Learning (0.69)
    - Natural Language > Large Language Model (0.69)
  - Human Computer Interaction (0.93)
  - Sensing and Signal Processing > Image Processing (0.93)