Interactions in Information Spread
–arXiv.org Artificial Intelligence
Since the development of writing 5000 years ago, human-generated data gets produced at an ever-increasing pace. Classical archival methods aimed at easing information retrieval. Nowadays, archiving is not enough anymore. The amount of data that gets generated daily is beyond human comprehension, and appeals for new information retrieval strategies. Instead of referencing every single data piece as in traditional archival techniques, a more relevant approach consists in understanding the overall ideas conveyed in data flows. To spot such general tendencies, a precise comprehension of the underlying data generation mechanisms is required. In the rich literature tackling this problem, the question of information interaction remains nearly unexplored. First, we investigate the frequency of such interactions. Building on recent advances made in Stochastic Block Modelling, we explore the role of interactions in several social networks. We find that interactions are rare in these datasets. Then, we wonder how interactions evolve over time. Earlier data pieces should not have an everlasting influence on ulterior data generation mechanisms. We model this using dynamic network inference advances. We conclude that interactions are brief. Finally, we design a framework that jointly models rare and brief interactions based on Dirichlet-Hawkes Processes. We argue that this new class of models fits brief and sparse interaction modelling. We conduct a large-scale application on Reddit and find that interactions play a minor role in this dataset. From a broader perspective, our work results in a collection of highly flexible models and in a rethinking of core concepts of machine learning. Consequently, we open a range of novel perspectives both in terms of real-world applications and in terms of technical contributions to machine learning.
arXiv.org Artificial Intelligence
Sep-16-2022
- Country:
- Africa
- Middle East > Egypt (0.04)
- Mozambique (0.04)
- Asia
- Europe
- France > Île-de-France
- Ukraine
- Crimea (0.04)
- Kyiv Oblast > Chernobyl (0.04)
- Eastern Europe (0.04)
- Russia (0.04)
- Italy
- United Kingdom > England
- Greater London > London (0.04)
- Finland (0.04)
- Netherlands
- North Holland > Amsterdam (0.04)
- South Holland > Dordrecht (0.04)
- Spain (0.04)
- Germany (0.04)
- Austria (0.04)
- North America
- Canada (0.04)
- Greenland (0.04)
- Mexico (0.04)
- Puerto Rico (0.04)
- The Bahamas (0.04)
- United States
- California (0.04)
- Hawaii (0.04)
- Louisiana (0.04)
- Utah (0.04)
- Oceania
- Australia (0.04)
- Samoa (0.04)
- Solomon Islands (0.04)
- South America
- Bolivia (0.04)
- Brazil (0.14)
- Chile > Santiago Metropolitan Region
- Santiago Province > Santiago (0.04)
- Africa
- Genre:
- Overview (0.92)
- Research Report > New Finding (1.00)
- Industry:
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning
- Learning Graphical Models > Directed Networks
- Bayesian Learning (1.00)
- Neural Networks (1.00)
- Performance Analysis > Accuracy (0.92)
- Statistical Learning > Clustering (1.00)
- Learning Graphical Models > Directed Networks
- Natural Language
- Information Retrieval (1.00)
- Text Processing (0.92)
- Representation & Reasoning
- Mathematical & Statistical Methods (0.67)
- Optimization (1.00)
- Personal Assistant Systems (0.67)
- Uncertainty > Bayesian Inference (1.00)
- Machine Learning
- Communications > Social Media (1.00)
- Data Science > Data Mining (1.00)
- Artificial Intelligence
- Information Technology