Large Language Models as User-Agents for Evaluating Task-Oriented-Dialogue Systems