DuetServe: Harmonizing Prefill and Decode for LLM Serving via Adaptive GPU Multiplexing