Reinforcing LLM Agents via Policy Optimization with Action Decomposition