An artificial neural network to acquire grounded representations of robot actions and language