A Survey on Graph Neural Networks and Graph Transformers in Computer Vision: A Task-Oriented Perspective
Chen, Chaoqi, Wu, Yushuang, Dai, Qiyuan, Zhou, Hong-Yu, Xu, Mutian, Yang, Sibei, Han, Xiaoguang, Yu, Yizhou
–arXiv.org Artificial Intelligence
Graph Neural Networks (GNNs) have gained momentum in graph representation learning and boosted the state of the art in a variety of areas, such as data mining (\emph{e.g.,} social network analysis and recommender systems), computer vision (\emph{e.g.,} object detection and point cloud learning), and natural language processing (\emph{e.g.,} relation extraction and sequence learning), to name a few. With the emergence of Transformers in natural language processing and computer vision, graph Transformers embed a graph structure into the Transformer architecture to overcome the limitations of local neighborhood aggregation while avoiding strict structural inductive biases. In this paper, we present a comprehensive review of GNNs and graph Transformers in computer vision from a task-oriented perspective. Specifically, we divide their applications in computer vision into five categories according to the modality of input data, \emph{i.e.,} 2D natural images, videos, 3D data, vision + language, and medical images. In each category, we further divide the applications according to a set of vision tasks. Such a task-oriented taxonomy allows us to examine how each task is tackled by different GNN-based approaches and how well these approaches perform. Based on the necessary preliminaries, we provide the definitions and challenges of the tasks, in-depth coverage of the representative approaches, as well as discussions regarding insights, limitations, and future directions.
arXiv.org Artificial Intelligence
Oct-23-2022
- Country:
- North America (0.14)
- Asia
- Vietnam > Long An Province
- Tân An (0.04)
- China
- Hong Kong (0.04)
- Shanghai > Shanghai (0.04)
- Guangdong Province > Shenzhen (0.04)
- Vietnam > Long An Province
- Genre:
- Overview (1.00)
- Industry:
- Health & Medicine
- Diagnostic Medicine > Imaging (1.00)
- Health Care Technology (0.93)
- Therapeutic Area
- Psychiatry/Psychology (1.00)
- Neurology (1.00)
- Health & Medicine
- Technology: