Learning Joint and Individual Structure in Network Data with Covariates

James, Carson, Yuan, Dongbang, Gaynanova, Irina, Arroyo, Jesús

Jun-12-2024–arXiv.org Machine Learning

Network data is ubiquitous in many disciplines and application domains, including computer science, statistics, biology, and physics. These data, encoding relationships between units represented as nodes, are often accompanied by additional information about the nodes, usually referred to as node covariates, attributes, or metadata (Newman and Clauset, 2016; Liu, 2019; Chunaev, 2020). In these situations, a common goal is to understand the associations between the network connectivity and the node covariates. In our example, we consider international food commodity trade data represented as a network, where the nodes correspond to different countries and edge weights encode food commodity trade volumes between corresponding countries. The covariates at each node consist of economic and geographic information for each country, such as gross domestic product (GDP) per capita, birth rate and region. We wish to exploit that both datasets contain information about the nodes in order to better understand the structure of the network, node covariates and their relationship. Specifically, we seek to understand how economic and geographic factors explain the observed trade between countries, and identify additional information in the network that cannot be explained solely by these variables. There has been substantial work that incorporates network and node covariate information. Some examples include methods that use node covariates to improve community detection (Binkiewicz et al., 2017; Huang et al., 2023), dimensionality reduction (Zhao et al., 2022), regression with network information (Li et al., 2019) and mixed effect models for network edges (Hoff, 2005).

covariate, individual component, matrix, (17 more...)

arXiv.org Machine Learning

Jun-12-2024

arXiv.org PDF

Add feedback

Country:
- Oceania (0.04)
- Africa (0.04)
- Asia > Russia (0.04)
- South America
  - Peru (0.04)
  - Guyana (0.04)
  - Ecuador (0.04)
  - Bolivia (0.04)
  - Brazil (0.04)
  - Chile (0.04)
  - Paraguay (0.04)
  - Argentina (0.04)
  - Colombia > Meta Department (0.04)
- North America
  - Aruba (0.04)
  - Costa Rica (0.04)
  - Bermuda (0.04)
  - Dominican Republic (0.04)
  - Honduras (0.04)
  - Belize (0.04)
  - Nicaragua (0.04)
  - Guatemala (0.04)
  - Barbados (0.04)
  - Mexico (0.04)
  - Jamaica (0.04)
  - Canada (0.04)
  - El Salvador (0.04)
  - Saint Lucia (0.04)
  - The Bahamas (0.04)
  - Cuba (0.04)
  - Saint Vincent and the Grenadines (0.04)
  - United States
    - Michigan (0.04)
    - Texas > Brazos County
      - College Station (0.04)
    - Tennessee > Anderson County
      - Oak Ridge (0.14)
    - California > San Diego County
      - San Diego (0.04)
- Europe
  - Austria (0.04)
  - Bulgaria (0.04)
  - Andorra (0.04)
  - Poland (0.04)
  - Germany (0.04)
  - Spain (0.04)
  - Netherlands (0.04)
  - Iceland (0.04)
  - Switzerland (0.04)
  - Denmark (0.04)
  - Finland (0.04)
  - Albania (0.04)
  - Norway (0.04)
  - Slovakia (0.04)
  - Slovenia (0.04)
  - Serbia (0.04)
  - France (0.04)
  - Italy (0.04)
  - Russia (0.04)
  - Greece (0.04)
  - Latvia (0.04)
  - Lithuania (0.04)
  - Estonia (0.04)
  - Romania (0.04)
  - Belgium (0.04)
  - Ukraine (0.04)
  - Croatia (0.04)
  - Sweden (0.04)
  - Czechia (0.04)
  - United Kingdom (0.04)
  - Portugal (0.04)
  - Hungary (0.04)
  - Belarus (0.04)
  - Ireland (0.04)
  - Middle East
    - Malta (0.04)
    - Cyprus > Nicosia
      - Nicosia (0.04)

Genre:
- Research Report (1.00)

Industry:
- Banking & Finance (0.86)
- Government (0.67)
- Health & Medicine > Therapeutic Area (0.67)
- Telecommunications > Networks (0.62)
- Information Technology > Networks (0.62)

Technology:
- Information Technology
  - Data Science (1.00)
  - Communications > Networks (1.00)
  - Artificial Intelligence
    - Representation & Reasoning > Optimization (0.93)
    - Machine Learning > Statistical Learning (0.87)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found