Toward Robust Graph Semi-Supervised Learning against Extreme Data Scarcity

Ding, Kaize, Nouri, Elnaz, Zheng, Guoqing, Liu, Huan, White, Ryen

Dec-11-2022–arXiv.org Artificial Intelligence

The success of graph neural networks on graph-based web mining highly relies on abundant human-annotated data, which is laborious to obtain in practice. When only few labeled nodes are available, how to improve their robustness is a key to achieve replicable and sustainable graph semi-supervised learning. Though self-training has been shown to be powerful for semi-supervised learning, its application on graph-structured data may fail because (1) larger receptive fields are not leveraged to capture long-range node interactions, which exacerbates the difficulty of propagating feature-label patterns from labeled nodes to unlabeled nodes; and (2) limited labeled data makes it challenging to learn well-separated decision boundaries for different node classes without explicitly capturing the underlying semantic structure. To address the challenges of capturing informative structural and semantic knowledge, we propose a new graph data augmentation framework, AGST (Augmented Graph Self-Training), which is built with two new (i.e., structural and semantic) augmentation modules on top of a decoupled GST backbone. In this work, we investigate whether this novel framework can learn a robust graph predictive model under the low-data context. We conduct comprehensive evaluations on semi-supervised node classification under different scenarios of limited labeled-node data. The experimental results demonstrate the unique contributions of the novel data augmentation framework for node classification with few labeled data.

artificial intelligence, machine learning, node, (19 more...)

arXiv.org Artificial Intelligence

Dec-11-2022

arXiv.org PDF

Add feedback

Genre:
- Research Report > New Finding (0.48)

Industry:
- Education (0.48)
- Information Technology (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Inductive Learning (0.91)
  - Statistical Learning (0.88)
  - Unsupervised or Indirectly Supervised Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found