Can we hop in general? A discussion of benchmark selection and design using the Hopper environment