Dataset Representativeness and Downstream Task Fairness