High-confidence error estimates for learned value functions