Advancing LLM-Based Security Automation with Customized Group Relative Policy Optimization for Zero-Touch Networks