Hacking Scikit-Learn's Vectorizers – Towards Data Science
Natural Language Processing is a fascinating field. Since all predictors are extracted from the text, data cleaning, preprocessing and feature engineering have an even more significant impact on the model's performance. Having worked for a few months on a machine learning project of my own involving NLP, I've learned one thing or two about Scikit-Learn's vectorizers that I would like to share. Hopefully, by the end of this post, you will have some new ideas to use on your next project. As you know machines, as advanced as they may be, are not capable of understanding words and sentences in the same manner as humans do.
Apr-4-2018, 22:44:29 GMT
- Technology: