Distributed solar generation forecasting using attention-based deep neural networks for cloud movement prediction