Reinforcing Language Agents via Policy Optimization with Action Decomposition