EPO: Hierarchical LLM Agents with Environment Preference Optimization

Open in new window