Multi-modal Queried Object Detection in the Wild

Neural Information Processing Systems 

We introduce MQ-Det, an efficient architecture and pre-training strategy design to utilize both textual description with open-set generalization and visual exemplars with rich description granularity as category queries, namely, Multi-modal Queried object Detection, for real-world detection with both open-vocabulary categories and various granularity.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found