Two-StreamNetworkforSignLanguageRecognition andTranslation
–Neural Information Processing Systems
Weadoptidentical dataaugmentationsforRGBvideos andheatmap sequences to maintain spatial and temporal consistency. SingleStream-SLTwhich only utilizes asingle video encoder without modelling keypoints serves as our baseline. TwoStream-SLT-V/K/J denotes the network where only one translation network is attached onto the video head/keypoint head/joint head. The averaged probabilities are used to decode text sequences. In each of the variants, only a single translation network is appended onto the video head, keypoint head, or joint head.
Neural Information Processing Systems
Feb-9-2026, 15:45:10 GMT
- Country:
- South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.05)
- Technology: