Learning in Audio-visual Context: A Review, Analysis, and New Perspective