Exploring Language Model Generalization in Low-Resource Extractive QA