Can LLMs Detect Their Confabulations? Estimating Reliability in Uncertainty-Aware Language Models