On the Robustness of Question Rewriting Systems to Questions of Varying Hardness