Model Assessment and Selection under Temporal Distribution Shift