Evaluating Task-Oriented Dialogue Consistency through Constraint Satisfaction