STEEL: Singularity-aware Reinforcement Learning