A Unit-based System and Dataset for Expressive Direct Speech-to-Speech Translation