LMStyle Benchmark: Evaluating Text Style Transfer for Chatbots