Zero-shot Generalizable Incremental Learning for Vision-Language Object Detection
–Neural Information Processing Systems
Loss and reparameterization techniques to tackle IVLOD without incurring a significant increase in memory usage. Comprehensive experiments on COCO and ODinW-13 datasets demonstrate that ZiRa effectively safeguards the zero-shot generalization ability of VLODMs while continuously adapting to new tasks. Specifically, after training on ODinW-13 datasets, ZiRa exhibits superior performance compared to CL-DETR and iDETR, boosting zero-shot generalizabil-ity by substantial 13.91 and 8.74 AP, respectively.
Neural Information Processing Systems
Oct-10-2025, 21:47:03 GMT
- Country:
- Africa
- Central African Republic > Ombella-M'Poko
- Bimbo (0.04)
- Rwanda > Kigali
- Kigali (0.04)
- Central African Republic > Ombella-M'Poko
- Asia
- China > Shanghai
- Shanghai (0.04)
- Middle East > Israel
- Tel Aviv District > Tel Aviv (0.04)
- South Korea > Seoul
- Seoul (0.04)
- China > Shanghai
- Europe
- France > Île-de-France
- Italy (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Romania > Sud - Muntenia Development Region
- Giurgiu County > Giurgiu (0.04)
- Switzerland > Zürich
- Zürich (0.14)
- North America
- Canada
- British Columbia > Vancouver (0.04)
- Quebec > Montreal (0.04)
- United States
- California > Los Angeles County
- Long Beach (0.14)
- District of Columbia > Washington (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Utah > Salt Lake County
- Salt Lake City (0.04)
- California > Los Angeles County
- Canada
- Africa
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Information Technology (0.46)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning > Neural Networks (0.93)
- Natural Language > Large Language Model (1.00)
- Vision (1.00)
- Information Technology > Artificial Intelligence