tagE: Enabling an Embodied Agent to Understand Human Instructions