Canonical Trends: Detecting Trend Setters in Web Data

Biessmann, Felix, Papaioannou, Jens-Michalis, Braun, Mikio, Harth, Andreas

Jun-27-2012–arXiv.org Machine Learning

Much information available on the web is copied, reused or rephrased. The phenomenon that multiple web sources pick up certain information is often called trend. A central problem in the context of web data mining is to detect those web sources that are first to publish information which will give rise to a trend. We present a simple and efficient method for finding trends dominating a pool of web sources and identifying those web sources that publish the information relevant to a trend before others. We validate our approach on real data collected from influential technology news feeds.

data mining, machine learning, natural language, (17 more...)

arXiv.org Machine Learning

Jun-27-2012

arXiv.org PDF

Add feedback

Country:
- Europe (0.68)

Genre:
- Research Report (1.00)

Industry:
- Media > News (0.52)

Technology:
- Information Technology
  - Data Science > Data Mining (1.00)
  - Communications > Web (1.00)
  - Artificial Intelligence
    - Natural Language (1.00)
    - Machine Learning > Statistical Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found