Generalized Emphatic Temporal Difference Learning: Bias-Variance Analysis