Fidelity-Induced Interpretable Policy Extraction for Reinforcement Learning