Neural Networks Should Be Wide Enough to Learn Disconnected Decision Regions