Weakly-Supervised ReinforcementLearningfor ControllableBehavior