Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition