RLIP: Relational Language-Image Pre-training for Human-Object Interaction Detection

Open in new window