Cross-validation failure: small sample sizes lead to large error bars