Imitation Learning from Vague Feedback The University of Tokyo, Tokyo, Japan