Canonical Trends: Detecting Trend Setters in Web Data
Biessmann, Felix, Papaioannou, Jens-Michalis, Braun, Mikio, Harth, Andreas
Much information available on the web is copied, reused or rephrased. The phenomenon that multiple web sources pick up certain information is often called trend. A central problem in the context of web data mining is to detect those web sources that are first to publish information which will give rise to a trend. We present a simple and efficient method for finding trends dominating a pool of web sources and identifying those web sources that publish the information relevant to a trend before others. We validate our approach on real data collected from influential technology news feeds.
Jun-27-2012
- Country:
- Europe > United Kingdom > Scotland (0.14)
- Genre:
- Research Report (1.00)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning > Statistical Learning (0.46)
- Natural Language (1.00)
- Communications > Web (1.00)
- Data Science > Data Mining (1.00)
- Artificial Intelligence
- Information Technology