GR-NLP-TOOLKIT: An Open-Source NLP Toolkit for Modern Greek
Loukas, Lefteris, Smyrnioudis, Nikolaos, Dikonomaki, Chrysa, Barbakos, Spyros, Toumazatos, Anastasios, Koutsikakis, John, Kyriakakis, Manolis, Georgiou, Mary, Vassos, Stavros, Pavlopoulos, John, Androutsopoulos, Ion
–arXiv.org Artificial Intelligence
We present GR-NLP-TOOLKIT, an open-source natural language processing (NLP) toolkit developed specifically for modern Greek. The toolkit provides state-of-the-art performance in five core NLP tasks, namely part-of-speech tagging, morphological tagging, dependency parsing, named entity recognition, and Greeklishto-Greek transliteration. The toolkit is based on pre-trained Transformers, it is freely available, and can be easily installed in Python (pip install gr-nlp-toolkit). It is also accessible through a demonstration platform on HuggingFace, along with a publicly available API for non-commercial use. We discuss the functionality provided for each task, the underlying methods, experiments against comparable open-source toolkits, and future possible enhancements. The toolkit is available at: https://github.com/nlpaueb/gr-nlp-toolkit
arXiv.org Artificial Intelligence
Dec-11-2024
- Country:
- North America
- Europe
- Middle East > Cyprus (0.04)
- Sweden > Vaestra Goetaland
- Gothenburg (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Norway > Western Norway
- Italy > Liguria
- Genoa (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Greece
- Central Macedonia > Thessaloniki (0.04)
- Attica > Athens (0.04)
- Genre:
- Research Report (0.40)
- Technology: