TokenFlow: Responsive LLM Text Streaming Serving under Request Burst via Preemptive Scheduling