Compiler Optimization for Quantum Computing Using Reinforcement Learning