Investigating the Robustness of Counterfactual Learning to Rank Models: A Reproducibility Study