Systematic assessment of the quality of fit of the stochastic block model for empirical networks
Vaca-Ramírez, Felipe, Peixoto, Tiago P.
We perform a systematic analysis of the quality of fit of the stochastic block model (SBM) for 275 empirical networks spanning a wide range of domains and orders of size magnitude. We employ posterior predictive model checking as a criterion to assess the quality of fit, which involves comparing networks generated by the inferred model with the empirical network, according to a set of network descriptors. We observe that the SBM is capable of providing an accurate description for the majority of networks considered, but falls short of saturating all modeling requirements. In particular, networks possessing a large diameter and slow-mixing random walks tend to be badly described by the SBM. However, contrary to what is often assumed, networks with a high abundance of triangles can be well described by the SBM in many cases. We demonstrate that simple network descriptors can be used to evaluate whether or not the SBM can provide a sufficiently accurate representation, potentially pointing to possible model extensions that can systematically improve the expressiveness of this class of models.
Jan-5-2022
- Country:
- Africa > Uganda
- Eastern Region > Mayuge District (0.04)
- Asia
- China > Heilongjiang Province
- Daqing (0.04)
- Japan > Honshū
- Kansai > Kyoto Prefecture > Kyoto (0.04)
- Philippines (0.04)
- China > Heilongjiang Province
- Europe
- Hungary
- Budapest > Budapest (0.04)
- Hajdú-Bihar County > Debrecen (0.04)
- Belgium > Flanders
- Flemish Brabant > Leuven (0.04)
- Middle East > Cyprus
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Netherlands (0.04)
- Spain (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Austria > Vienna (0.14)
- Hungary
- North America > United States
- Massachusetts (0.04)
- Colorado (0.04)
- Illinois > Cook County
- Chicago (0.04)
- District of Columbia (0.04)
- Oregon (0.04)
- Arizona (0.04)
- New York (0.04)
- Wisconsin (0.04)
- California > Orange County
- Irvine (0.04)
- Florida (0.04)
- Maine (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Oceania
- Australia > Victoria (0.04)
- New Zealand (0.04)
- South America > Brazil (0.04)
- Africa > Uganda
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Education > Educational Setting (0.68)
- Government > Regional Government
- Health & Medicine > Therapeutic Area
- Immunology (0.68)
- Infections and Infectious Diseases (0.68)
- Information Technology (1.00)
- Law (1.00)
- Leisure & Entertainment (1.00)
- Transportation
- Air (0.67)
- Infrastructure & Services (0.93)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning
- Performance Analysis > Accuracy (1.00)
- Statistical Learning (0.68)
- Natural Language (0.93)
- Representation & Reasoning (1.00)
- Machine Learning
- Communications
- Networks (1.00)
- Social Media (1.00)
- Data Science > Data Mining (1.00)
- Information Management (1.00)
- Artificial Intelligence
- Information Technology