Towards Better Instruction Following Language Models for Chinese: Investigating the Impact of Training Data and Evaluation