Graph-Structured Policy Learning for Multi-Goal Manipulation Tasks