Filtering startup news with Machine Learning MonkeyLearn Blog
On this new post series, we will analyze hundreds of thousands of articles from TechCrunch, VentureBeat and Recode to discover cool trends and insights about startups. These are the types of questions we aim to answer with this analysis. On this first post, we will cover how Scrapy can be used to get all the articles ever published on these tech news sites and how MonkeyLearn can be used for filtering these crawled articles by whether they are about startups or not. We want to create a dataset of startup news articles that can be used for studying trends later on. On the second post, we will create text classifiers that do analysis on the actual content of the startup articles. Is it a news about acquisition? Finally, on the third post we will use the data we got here, and the classifiers from the second post, to answer our questions.
Mar-21-2017, 21:45:11 GMT