Applying statistical learning theory to deep learning