Integrating Language Models into Direct Speech Translation: An Inference-Time Solution to Control Gender Inflection