Echoes of Human Malice in Agents: Benchmarking LLMs for Multi-Turn Online Harassment Attacks