AITopics | ethiopian language

Collaborating Authors

ethiopian language

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Enhancing Multi-Label Emotion Analysis and Corresponding Intensities for Ethiopian Languages

Belay, Tadesse Destaw, Gete, Dawit Ketema, Ayele, Abinew Ali, Kolesnikova, Olga, Sidorov, Grigori, Yimam, Seid Muhie

arXiv.org Artificial IntelligenceMar-23-2025

In this digital world, people freely express their emotions using different social media platforms. As a result, modeling and integrating emotion-understanding models are vital for various human-computer interaction tasks such as decision-making, product and customer feedback analysis, political promotions, marketing research, and social media monitoring. As users express different emotions simultaneously in a single instance, annotating emotions in a multilabel setting such as the EthioEmo (Belay et al., 2025) dataset effectively captures this dynamic. Additionally, incorporating intensity, or the degree of emotion, is crucial, as emotions can significantly differ in their expressive strength and impact. This intensity is significant for assessing whether further action is necessary in decision-making processes, especially concerning negative emotions in applications such as healthcare and mental health studies. To enhance the EthioEmo dataset, we include annotations for the intensity of each labeled emotion. Furthermore, we evaluate various state-of-the-art encoder-only Pretrained Language Models (PLMs) and decoder-only Large Language Models (LLMs) to provide comprehensive benchmarking.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2503.18253

Country:

Europe > Austria > Vienna (0.14)
Asia > Thailand > Bangkok > Bangkok (0.05)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
(8 more...)

Genre: Research Report (0.64)

Industry: Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.54)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback

Evaluating the Capabilities of Large Language Models for Multi-label Emotion Understanding

Belay, Tadesse Destaw, Azime, Israel Abebe, Ayele, Abinew Ali, Sidorov, Grigori, Klakow, Dietrich, Slusallek, Philipp, Kolesnikova, Olga, Yimam, Seid Muhie

arXiv.org Artificial IntelligenceJan-3-2025

Large Language Models (LLMs) show promising learning and reasoning abilities. Compared to other NLP tasks, multilingual and multi-label emotion evaluation tasks are under-explored in LLMs. In this paper, we present EthioEmo, a multi-label emotion classification dataset for four Ethiopian languages, namely, Amharic (amh), Afan Oromo (orm), Somali (som), and Tigrinya (tir). We perform extensive experiments with an additional English multi-label emotion dataset from SemEval 2018 Task 1. Our evaluation includes encoder-only, encoder-decoder, and decoder-only language models. We compare zero and few-shot approaches of LLMs to fine-tuning smaller language models. The results show that accurate multi-label emotion classification is still insufficient even for high-resource languages such as English, and there is a large gap between the performance of high-resource and low-resource languages. The results also show varying performance levels depending on the language and model type. EthioEmo is available publicly to further improve the understanding of emotions in language models and how people convey emotions through various languages.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2412.17837

Country:

Europe (1.00)
Africa (1.00)
North America > United States (0.93)
Asia > Middle East > UAE (0.28)

Genre: Research Report > New Finding (0.34)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)

Add feedback

EthioLLM: Multilingual Large Language Models for Ethiopian Languages with Task Evaluation

Tonja, Atnafu Lambebo, Azime, Israel Abebe, Belay, Tadesse Destaw, Yigezu, Mesay Gemeda, Mehamed, Moges Ahmed, Ayele, Abinew Ali, Jibril, Ebrahim Chekol, Woldeyohannis, Michael Melese, Kolesnikova, Olga, Slusallek, Philipp, Klakow, Dietrich, Xiong, Shengwu, Yimam, Seid Muhie

arXiv.org Artificial IntelligenceJun-23-2024

Large language models (LLMs) have gained popularity recently due to their outstanding performance in various downstream Natural Language Processing (NLP) tasks. However, low-resource languages are still lagging behind current state-of-the-art (SOTA) developments in the field of NLP due to insufficient resources to train LLMs. Ethiopian languages exhibit remarkable linguistic diversity, encompassing a wide array of scripts, and are imbued with profound religious and cultural significance. This paper introduces EthioLLM -- multilingual large language models for five Ethiopian languages (Amharic, Ge'ez, Afan Oromo, Somali, and Tigrinya) and English, and Ethiobenchmark -- a new benchmark dataset for various downstream NLP tasks. We evaluate the performance of these models across five downstream NLP tasks. We open-source our multilingual language models, new benchmark datasets for various downstream tasks, and task-specific fine-tuned language models and discuss the performance of the models. Our dataset and models are available at the https://huggingface.co/EthioNLP repository.

dataset, ethiopian language, language model, (13 more...)

arXiv.org Artificial Intelligence

2403.13737

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > Middle East > Israel (0.04)
North America > Mexico (0.04)
(16 more...)

Genre: Research Report (1.00)

Industry: Media > News (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

EthioMT: Parallel Corpus for Low-resource Ethiopian Languages

Tonja, Atnafu Lambebo, Kolesnikova, Olga, Gelbukh, Alexander, Kalita, Jugal

arXiv.org Artificial IntelligenceMar-28-2024

Recent research in natural language processing (NLP) has achieved impressive performance in tasks such as machine translation (MT), news classification, and question-answering in high-resource languages. However, the performance of MT leaves much to be desired for low-resource languages. This is due to the smaller size of available parallel corpora in these languages, if such corpora are available at all. NLP in Ethiopian languages suffers from the same issues due to the unavailability of publicly accessible datasets for NLP tasks, including MT. To help the research community and foster research for Ethiopian languages, we introduce EthioMT -- a new parallel corpus for 15 languages. We also create a new benchmark by collecting a dataset for better-researched languages in Ethiopia. We evaluate the newly collected corpus and the benchmark dataset for 23 Ethiopian languages using transformer and fine-tuning approaches.

ethiopian language, machine translation, translation, (14 more...)

arXiv.org Artificial Intelligence

2403.19365

Country:

Africa > Kenya (0.05)
Africa > Eritrea (0.05)
Africa > Middle East > Somalia (0.05)
(22 more...)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Natural Language Processing in Ethiopian Languages: Current State, Challenges, and Opportunities

Tonja, Atnafu Lambebo, Belay, Tadesse Destaw, Azime, Israel Abebe, Ayele, Abinew Ali, Mehamed, Moges Ahmed, Kolesnikova, Olga, Yimam, Seid Muhie

arXiv.org Artificial IntelligenceMar-25-2023

This survey delves into the current state of natural language processing (NLP) for four Ethiopian languages: Amharic, Afaan Oromo, Tigrinya, and Wolaytta. Through this paper, we identify key challenges and opportunities for NLP research in Ethiopia. Furthermore, we provide a centralized repository on GitHub that contains publicly available resources for various NLP tasks in these languages. This repository can be updated periodically with contributions from other researchers. Our objective is to identify research gaps and disseminate the information to NLP researchers interested in Ethiopian languages and encourage future research in this domain.

ethiopian language, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2303.14406

Country:

Asia > Middle East > Israel (0.04)
Africa > Ethiopia > Addis Ababa > Addis Ababa (0.04)
Africa > Ethiopia > Southern Nations, Nationalities, and Peoples' Region > Hawassa (0.04)
(14 more...)

Genre:

Research Report (1.00)
Overview (1.00)

Industry: Media > News (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback