GoodReads: Webscraping and Text Analysis with R (Part 1)

Sep-9-2016, 15:50:37 GMT–#artificialintelligence

Inspired by this article about sentiment analysis and this guide to webscraping, I have decided to get my hands dirty by scraping and analyzing a sample of reviews on the website Goodreads. The goal of this project is to demonstrate a complete example, going from data collection to machine learning analysis, and to illustrate a few of the dead ends and mistakes I encountered on my journey. We'll be looking at the reviews for five popular romance books. I have voluntarily chosen books in the same genre in order to make comments text more homogeneous a priori; these five books are popular enough that I can easily pull a few thousands reviews for each, yielding a significant corpus with minimum effort. If you don't like romance books, feel free to replicate the analysis with your genre of choice!

artificial intelligence, natural language, text processing, (15 more...)

#artificialintelligence

Sep-9-2016, 15:50:37 GMT

News Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language
  - Text Processing (0.50)
  - Information Extraction (0.36)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found