Retrospective Sparse Attention for Efficient Long-Context Generation