A Framework for Evaluating LLMs Under Task Indeterminacy