Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual Inputs

Open in new window