task interesting [R1,R2], novel, and compelling [R1,R4]; our approach elegant [R4] and interesting [R2]; and our

Neural Information Processing Systems 

We also had the human players evaluate the fluency and relevance of questions.