Investigating Ensemble Methods for Model Robustness Improvement of Text Classifiers