Accelerating Retrieval-Augmented Language Model Serving with Speculation