Comparison Between Global Vs Local Normalization of Tweets, and Various Distances
In the previous example we used clustering to see if an apparent pattern exists within Brexit tweets. We found out that we have three distinct patterns, the leave, the referendum, and Brexit. This in itself helps us think that we may even create a classifier that can identify if the tweet writer is pro or agains an issue automatically, with no human intervention. Let's get back to the issues related to clustering. To use the clustering algorithm we had to map 2 tweets at the time to a binary vector.
Nov-7-2016, 15:15:03 GMT