Reinforcement Learning for Optimizing Large Qubit Array based Quantum Sensor Circuits