Context Filtering with Reward Modeling in Question Answering