Evaluating Large Language Models as Virtual Annotators for Time-series Physical Sensing Data