Supplementary: Opening the Vocabulary of Egocentric Actions

Neural Information Processing Systems 

Open-vocabulary object detection via vision and language knowledge distillation.