Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates