Stateful Memory-Augmented Transformers for Efficient Dialogue Modeling