Human-Object Interaction Detection Collaborated with Large Relation-driven Diffusion Models
–Neural Information Processing Systems
Prevalent human-object interaction (HOI) detection approaches typically leverage large-scale visual-linguistic models to help recognize events involving humans and objects. Though promising, models trained via contrastive learning on text-image pairs often neglect mid/low-level visual cues and struggle at compositional reasoning.
Neural Information Processing Systems
May-28-2025, 21:32:58 GMT
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Information Technology > Security & Privacy (0.46)
- Technology: