Why Less is More (Sometimes): A Theory of Data Curation