Merge or Not? Learning to Group Faces via Imitation Learning