Reading Scene Text in Deep Convolutional Sequences