Unsupervised deep clustering and reinforcement learning can accurately segment MRI brain tumors with very small training sets