Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping