Object Detection with Grammar Models

Mar-15-2024, 03:59:20 GMT–Neural Information Processing Systems

Compositional models provide an elegant formalism for representing the visual appearance of highly variable objects. While such models are appealing from a theoretical point of view, it has been difficult to demonstrate that they lead to performance advantages on challenging datasets. Here we develop a grammar model for person detection and show that it outperforms previous high-performance systems on the PASCAL benchmark. Our model represents people using a hierarchy of deformable parts, variable structure and an explicit model of occlusion for partially visible objects. To train the model, we introduce a new discriminative framework for learning structured prediction models from weakly-labeled data.

artificial intelligence, inductive learning, machine learning, (16 more...)

Neural Information Processing Systems

Mar-15-2024, 03:59:20 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (0.14)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Inductive Learning (0.89)
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.46)
    - Supervised Learning (0.67)
  - Vision (1.00)