Compression, transduction, and creation: a unified framework for evaluating natural language generation