Open-vocabulary Attribute Detection

Bravo, María A., Mittal, Sudhanshu, Ging, Simon, Brox, Thomas

Mar-8-2023–arXiv.org Artificial Intelligence

Vision-language modeling has enabled open-vocabulary tasks where predictions can be queried using any text prompt in a zero-shot manner. Existing open-vocabulary tasks focus on object classes, whereas research on object attributes is limited due to the lack of a reliable attribute-focused evaluation benchmark. This paper introduces the Open-Vocabulary Attribute Detection (OVAD) task and the corresponding OVAD benchmark. The objective of the novel task and benchmark is to probe object-level attribute information learned by vision-language models. To this end, we created a clean and densely annotated test set covering 117 attribute classes on the 80 object classes of MS COCO. It includes positive and negative annotations, which enables open-vocabulary evaluation. Overall, the benchmark consists of 1.4 million annotations. For reference, we provide a first baseline method for open-vocabulary attribute detection. Moreover, we demonstrate the benchmark's value by studying the attribute detection performance of several foundation models. Project page https://ovad-benchmark.github.io

annotation, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

Mar-8-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.28)
- Europe > Germany
  - Baden-Württemberg > Freiburg (0.04)
- Asia
  - Middle East > Iran
    - Tehran Province > Tehran (0.04)
  - China > Guangxi Province
    - Nanning (0.04)

Genre:
- Research Report (0.50)

Industry:
- Transportation (0.68)
- Leisure & Entertainment > Sports (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Natural Language (1.00)
  - Representation & Reasoning > Object-Oriented Architecture (0.90)
  - Machine Learning > Performance Analysis
    - Accuracy (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found