RegionCLIP: Region-based Language-Image Pretraining

Open in new window