Combining NLP and Machine Learning for Document Classification