Towards Gradient-based Time-Series Explanations through a SpatioTemporal Attention Network