Fluent and Low-latency Simultaneous Speech-to-Speech Translation with Self-adaptive Training