Conditioning Hierarchical Reinforcement Learning on Flexible Constraints