Reliable Confidence Intervals for Information Retrieval Evaluation Using Generative A.I