Unsupervised Deep Learning of Mid-Level Video Representation for Action Recognition