Unleashing Region Understanding in Intermediate Layers for MLLM-based Referring Expression Generation