AI that understands speech by looking as well as hearing