One Policy to Run Them All: an End-to-end Learning Approach to Multi-Embodiment Locomotion