A Comparative Benchmark of Large Language Models for Labelling Wind Turbine Maintenance Logs