Multimodal learning with graphs
Ektefaie, Yasha, Dasoulas, George, Noori, Ayush, Farhat, Maha, Zitnik, Marinka
–arXiv.org Artificial Intelligence
Deep learning on graphs has contributed to breakthroughs in biology [1, 2], chemistry [3, 4], physics [5, 6], and the social sciences [7]. The predominant use of graph neural networks [8] is to learn representations of various graph components--such as nodes, edges, subgraphs, and entire graphs--based on neural message passing strategies. The learned representations are used for downstream tasks, including label prediction via semi-supervised learning [9], self-supervised learning [10], and graph design and generation [11, 12]. In most existing applications, datasets explicitly describe graphs in the form of nodes, edges, and additional information representing contextual knowledge, such as node, edge, and graph attributes. Modeling complex systems requires measurements that describe the same objects from different perspectives, at different scales, or through multiple modalities, such as images, sensor readings, language sequences, and compact mathematical statements. Multimodal learning [13] studies how such heterogeneous, complex descriptors can be optimized to create learning systems that are broadly generalizable, robust to changes in the underlying data distributions, and can train more with less labeled data.
arXiv.org Artificial Intelligence
Jan-23-2023
- Country:
- North America > United States
- Wisconsin > Dane County
- Madison (0.04)
- New York > New York County
- New York City (0.04)
- Massachusetts
- Suffolk County > Boston (0.04)
- Middlesex County > Cambridge (0.04)
- California > San Diego County
- San Diego (0.04)
- Wisconsin > Dane County
- Europe > Italy
- Asia > China
- Hong Kong (0.04)
- Guangdong Province > Shenzhen (0.04)
- Africa > Middle East
- Egypt (0.04)
- North America > United States
- Genre:
- Research Report (0.64)
- Overview (0.46)
- Industry:
- Technology: