Large-Batch Training for LSTM and Beyond