Outlier Detection for Text Data : An Extended Version