A Short Introduction to Using Word2Vec for Text Classification
Machine learning applications on natural language are an extremely important tool in the data scientist's toolbox. Use cases can include auto-detecting the language of a website, detecting spam in your spam filter, or auto-completing search queries. When you're working with text data, an important use case is text classification, where the data scientist is tasked with creating an algorithm that can figure out what a bit of text is all about (what is the tagline) based on what is written in the document. This can be used in a myriad of examples we see everyday, tagging things such as blog articles, app descriptions, and reviews. In many cases traditional text classification can be difficult to scale, because as the order of the taxonomy count increases, the amount of training required increases as well.
Mar-26-2016, 17:25:26 GMT
- Technology: