Towards Generalizable Reinforcement Learning for Trade Execution