Large Language Models For Text Classification: Case Study And Comprehensive Review
Kostina, Arina, Dikaiakos, Marios D., Stefanidis, Dimosthenis, Pallis, George
–arXiv.org Artificial Intelligence
Unlocking the potential of Large Language Models (LLMs) in data classification represents a promising frontier in natural language processing. In this work, we evaluate the performance of different LLMs in comparison with state-of-the-art deep-learning and machine-learning models, in two different classification scenarios: i) the classification of employees' working locations based on job reviews posted online (multiclass classification), and 2) the classification of news articles as fake or not (binary classification). Our analysis encompasses a diverse range of language models differentiating in size, quantization, and architecture. We explore the impact of alternative prompting techniques and evaluate the models based on the weighted F1-score. Also, we examine the trade-off between performance (F1-score) and time (inference response time) for each language model to provide a more nuanced understanding of each model's practical applicability. Our work reveals significant variations in model responses based on the prompting strategies. We find that LLMs, particularly Llama3 and GPT-4, can outperform traditional methods in complex classification tasks, such as multiclass classification, though at the cost of longer inference times. In contrast, simpler ML models offer better performance-to-time trade-offs in simpler binary classification tasks.
arXiv.org Artificial Intelligence
Jan-14-2025
- Country:
- Europe > Middle East > Cyprus (0.14)
- Genre:
- Overview (1.00)
- Research Report > New Finding (0.46)
- Technology: