Adversarial Sparse Transformer for Time Series Forecasting

Neural Information Processing Systems 

Even probabilistic prediction using the likelihood estimation suffers these problems in the same way.