AITopics | unsupervised object-level representation learning

Collaborating Authors

unsupervised object-level representation learning

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Unsupervised Object-Level Representation Learning from Scene Images

Neural Information Processing SystemsDec-25-2025, 05:50:20 GMT

Contrastive self-supervised learning has largely narrowed the gap to supervised pre-training on ImageNet. However, its success highly relies on the object-centric priors of ImageNet, i.e., different augmented views of the same image correspond to the same object. Such a heavily curated constraint becomes immediately infeasible when pre-trained on more complex scene images with many objects. To overcome this limitation, we introduce Object-level Representation Learning (ORL), a new self-supervised learning framework towards scene images. Our key insight is to leverage image-level self-supervised pre-training as the prior to discover object-level semantic correspondence, thus realizing object-level representation learning from scene images. Extensive experiments on COCO show that ORL significantly improves the performance of self-supervised learning on scene images, even surpassing supervised ImageNet pre-training on several downstream tasks. Furthermore, ORL improves the downstream performance when more unlabeled scene images are available, demonstrating its great potential of harnessing unlabeled data in the wild. We hope our approach can motivate future research on more general-purpose unsupervised representation learning from scene data.

name change, scene image, unsupervised object-level representation learning, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Unsupervised Object-Level Representation Learning from Scene Images Supplementary Material

Neural Information Processing SystemsAug-18-2025, 19:38:28 GMT

The results are averaged across five independent runs. The learning rate is decayed by 0.2 at 12 and 16 epochs. The learning rate is initialized as 0.02 with a linear warmup for The implementation details of our most essential image-level baseline, i.e., BYOL [5], are provided Our reproduced results vs. existing results for BYOL. All are based on 800-epoch pre-training on COCO with ResNet-50. Figure 2 visualizes more attention maps generated by BYOL and ORL.

artificial intelligence, machine learning, unsupervised object-level representation learning, (11 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.05)
Asia > China > Hong Kong (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Unsupervised Object-Level Representation Learning from Scene Images

Neural Information Processing SystemsJan-19-2025, 13:09:40 GMT

scene image, self-supervised learning, unsupervised object-level representation learning, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback