Composite Gaussian Processes Flows for Learning Discontinuous Multimodal Policies