Meet JEANIE: a Similarity Measure for 3D Skeleton Sequences via Temporal-Viewpoint Alignment
Wang, Lei, Liu, Jun, Zheng, Liang, Gedeon, Tom, Koniusz, Piotr
–arXiv.org Artificial Intelligence
Video sequences exhibit significant nuisance the-art results on NTU-60, NTU-120, Kinetics-skeleton and variations (undesired effects) of speed of actions, temporal UWA3D Multiview Activity II on supervised and unsupervised locations, and subjects' poses, leading to temporalviewpoint FSAR, and their meta-learning inspired fusion. Thus, we propose Joint tEmporal and cAmera viewpoiNt alIgnmEnt 1 Introduction (JEANIE) for sequence pairs. In particular, we focus on 3D skeleton sequences whose camera and subjects' poses can be Action recognition is a key topic in computer vision, easily manipulated in 3D. We evaluate JEANIE on skeletal with applications in video surveillance [105, 109, 120], Few-shot Action Recognition (FSAR), where matching well human-computer interaction, sport analysis and robotics. Given a query sequence, we create its several views labeling videos for 3D skeleton sequences is laborious, and by simulating several camera locations. For a support sequence, such pipelines need to be retrained or finetuned for new class we match it with view-simulated query sequences, concepts. Specifically, two-stream neural network [24, 23, 124] and 3D Convolutional each support temporal block can be matched to the Neural Network (3D CNN) [99, 9] aggregate framewise query temporal block with the same or adjacent (next) temporal and temporal block representations, respectively. However, index, and adjacent camera views to achieve joint local such networks are trained on large-scale datasets such temporal-viewpoint warping. JEANIE selects the smallest as Kinetics [9, 116, 110, 118] under a fixed set of training distance among matching paths with different temporalviewpoint classes. We also propose an Few-shot Learning (FSL) models for action recognition, unsupervised FSAR akin to clustering of sequences with termed Few-shot Action Recognition (FSAR), that rapidly JEANIE as a distance measure. JEANIE achieves state-of-adapt to novel classes given few training samples [77, 129, 31, 19, 138, 7, 112]. L. Wang is a Research Fellow at the School of Computing, the Australian J. Liu is an Assistant Professor at the Singapore University of Technology L. Zheng is an Associate Professor in the School of Computing, ANU.
arXiv.org Artificial Intelligence
Feb-7-2024
- Country:
- North America > United States (0.67)
- Genre:
- Research Report (0.63)
- Industry:
- Education (0.85)
- Technology: