Look Ma, No Hands! Agent-Environment Factorization of Egocentric Videos