Human Still Wins over LLM: An Empirical Study of Active Learning on Domain-Specific Annotation Tasks