Data Wonderland: Christmas songs from the viewpoint of a data scientist
Whether „Driving Home for Christmas", „Winter Wonderland", „Let it snow!" or „Last Christmas" – every year christmas songs are taking over the charts again. While average Joe is joyfully putting on the next christmas song, the data scientist starts his journey of discovery through the snowy music history. The data set comes from 55000 Song Lyrics, which contains over 55,000 songs. Our goal is to perform a comprehensive analysis of the song texts to identify the Christmas songs. In order to do so, first we add an additional column to the data frame to give each song a label of either Christmas or Not Christmas, where every song which contains the words Christmas, Xmas or X-mas will be labeled as Christmas and otherwise as Not Christmas. This is just the initialization of the labels, later we will apply Naive Bayes to a training set to identify the other Christmas songs.
Dec-19-2017, 23:51:03 GMT
- Country:
- Asia > Middle East > Jordan (0.05)
- Industry:
- Leisure & Entertainment (0.55)
- Media > Music (0.55)
- Technology: