Self-Supervised Visual Representation Learning from Hierarchical Grouping

Neural Information Processing Systems 

We create a framework for bootstrapping visual representation learning from a primitive visual grouping capability. We operationalize grouping via a contour detector that partitions an image into regions, followed by merging of those regions into a tree hierarchy.