Probing and Enhancing the Robustness of GNN-based QEC Decoders with Reinforcement Learning