LLM-as-a-Judge: Reassessing the Performance of LLMs in Extractive QA