Feature selection with gradient descent on two-layer networks in low-rotation regimes