Reviews: Learning Perceptual Inference by Contrasting

Neural Information Processing Systems 

One of the benefits that immediately comes to mind for the contrast module vs. the RN model is that the contrast module seems to scale linearly in the number of answer choices vs. the RN which produces a quadratic set.