CleanS2S: Single-file Framework for Proactive Speech-to-Speech Interaction