MindFlow+: A Self-Evolving Agent for E-Commerce Customer Service
Gong, Ming, Huang, Xucheng, Xu, Ziheng, Asari, Vijayan K.
–arXiv.org Artificial Intelligence
High-quality dialogue is crucial for e-commerce customer service, yet traditional intent-based systems struggle with dynamic, multi-turn interactions. We present MindFlow+, a self-evolving dialogue agent that learns domain-specific behavior by combining large language models (LLMs) with imitation learning and offline reinforcement learning (RL). MindFlow+ introduces two data-centric mechanisms to guide learning: tool-augmented demonstration construction, which exposes the model to knowledge-enhanced and agentic (ReAct-style) interactions for effective tool use; and reward-conditioned data modeling, which aligns responses with task-specific goals using reward signals. To evaluate the model's role in response generation, we introduce the AI Contribution Ratio, a novel metric quantifying AI involvement in dialogue. Experiments on real-world e-commerce conversations show that MindFlow+ outperforms strong baselines in contextual relevance, flexibility, and task accuracy. These results demonstrate the potential of combining LLMs tool reasoning, and reward-guided learning to build domain-specialized, context-aware dialogue systems.
arXiv.org Artificial Intelligence
Jul-28-2025
- Country:
- Asia > China
- Europe > Germany
- Bavaria > Middle Franconia > Nuremberg (0.04)
- North America > United States
- Ohio > Montgomery County
- Dayton (0.04)
- Virginia > Norfolk City County
- Norfolk (0.04)
- Ohio > Montgomery County
- Genre:
- Research Report > New Finding (0.48)
- Industry:
- Information Technology > Services > e-Commerce Services (0.96)
- Technology: