Language Supervised Human Action Recognition with Salient Fusion: Construction Worker Action Recognition as a Use Case