MiGrATe: Mixed-Policy GRPO for Adaptation at Test-Time