Efficient Symbolic Policy Learning with Differentiable Symbolic Expression