Who Made the News? Text Analysis using R, in 7 steps
The dataset used for the analysis was obtained from Kaggle Datasets, and is attributed to UCI Machine Learning. The raw tabular data includes information about news category (business, science and technology, entertainment, etc.) R language has some useful packages for text pre-processing and natural language processing. I prefer fread() over read.csv() For the scope of this program, we limit ourselves to only the headline text and publisher name. We use a "for loop" to filter and a custom function aggregate the headline texts for each publisher.
Jan-1-2017, 03:10:05 GMT