Cross-domain Few-shot Object Detection with Multi-modal Textual Enrichment

Open in new window