Towards Automated Error Discovery: A Study in Conversational AI