AITopics | Gelbukh, Alexander

Collaborating Authors

Gelbukh, Alexander

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Comparative Approaches to Sentiment Analysis Using Datasets in Major European and Arabic Languages

Krasitskii, Mikhail, Kolesnikova, Olga, Hernandez, Liliana Chanona, Sidorov, Grigori, Gelbukh, Alexander

arXiv.org Artificial IntelligenceJan-21-2025

This study explores transformer-based models such as BERT, mBERT, and XLM-R for multilingual sentiment analysis across diverse linguistic structures. Key contributions include the identification of XLM-R's superior adaptability in morphologically complex languages, achieving accuracy levels above 88%. The work highlights fine-tuning strategies and emphasizes their significance for improving sentiment classification in underrepresented languages.

machine learning, natural language, xlm-r, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.5121/csit.2024.150112

2501.1254

Country: North America > Mexico (0.14)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.99)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

Add feedback

PRKAN: Parameter-Reduced Kolmogorov-Arnold Networks

Ta, Hoang-Thang, Thai, Duy-Quy, Tran, Anh, Sidorov, Grigori, Gelbukh, Alexander

arXiv.org Artificial IntelligenceJan-12-2025

MLPs have been one of key components in modern neural network architectures for years. Their simplicity makes them widely used for capturing complex relationships through multiple layers of non-linear transformations. However, their role has been reconsidered recently with the revival of Kolmogorov-Arnold Networks (KANs) [1, 2]. In these papers, fixed activation functions used in MLPs are described as "nodes," and the authors proposed replacing them with learnable activation functions like B-splines, referred to as "edges", to improve performance in mathematical and physical examples. To address Hilbert's 13th problem [3], the Kolmogorov-Arnold Representation Theorem (KART) [4] was introduced. It posits that any continuous function involving multiple variables can be decomposed into a sum of continuous functions of single variables, thus inspiring the creation of KANs. The work of Liu et al. [1] on KANs has inspired numerous studies exploring the use of various basis and polynomial functions as replacements for B-splines [5, 6, 7, 8, 9, 10, 11, 12, 13], investigating the model's performance compared to MLPs. Several studies have shown that KANs do not always outperform MLPs when using the same training parameters [14, 15]. Moreover, while KANs achieve better performance than MLPs with the same network structure, they often require a significantly larger number of parameters [7, 16, 17, 18, 19].

artificial intelligence, machine learning, normalization, (15 more...)

arXiv.org Artificial Intelligence

2501.07032

Country: Asia > Vietnam (0.28)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Synthetic Time Series Data Generation for Healthcare Applications: A PCG Case Study

Jamshidi, Ainaz, Arif, Muhammad, Kalhoro, Sabir Ali, Gelbukh, Alexander

arXiv.org Artificial IntelligenceDec-17-2024

The generation of high-quality medical time series data is essential for advancing healthcare diagnostics and safeguarding patient privacy. Specifically, synthesizing realistic phonocardiogram (PCG) signals offers significant potential as a cost-effective and efficient tool for cardiac disease pre-screening. Despite its potential, the synthesis of PCG signals for this specific application received limited attention in research. In this study, we employ and compare three state-of-the-art generative models from different categories - WaveNet, DoppelGANger, and DiffWave - to generate high-quality PCG data. We use data from the George B. Moody PhysioNet Challenge 2022. Our methods are evaluated using various metrics widely used in the previous literature in the domain of time series data generation, such as mean absolute error and maximum mean discrepancy. Our results demonstrate that the generated PCG data closely resembles the original datasets, indicating the effectiveness of our generative models in producing realistic synthetic PCG data. In our future work, we plan to incorporate this method into a data augmentation pipeline to synthesize abnormal PCG signals with heart murmurs, in order to address the current scarcity of abnormal data. We hope to improve the robustness and accuracy of diagnostic tools in cardiology, enhancing their effectiveness in detecting heart murmurs.

artificial intelligence, data mining, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2412.16207

Country: North America > United States > Maryland (0.28)

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology > Security & Privacy (0.68)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.68)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Social Support Detection from Social Media Texts

Ahani, Zahra, Tash, Moein Shahiki, Balouchzahi, Fazlourrahman, Ramos, Luis, Sidorov, Grigori, Gelbukh, Alexander

arXiv.org Artificial IntelligenceNov-4-2024

Social support, conveyed through a multitude of interactions and platforms such as social media, plays a pivotal role in fostering a sense of belonging, aiding resilience in the face of challenges, and enhancing overall well-being. This paper introduces Social Support Detection (SSD) as a Natural language processing (NLP) task aimed at identifying supportive interactions within online communities. The study presents the task of Social Support Detection (SSD) in three subtasks: two binary classification tasks and one multiclass task, with labels detailed in the dataset section. We conducted experiments on a dataset comprising 10,000 YouTube comments. Traditional machine learning models were employed, utilizing various feature combinations that encompass linguistic, psycholinguistic, emotional, and sentiment information. Additionally, we experimented with neural network-based models using various word embeddings to enhance the performance of our models across these subtasks.The results reveal a prevalence of group-oriented support in online dialogues, reflecting broader societal patterns. The findings demonstrate the effectiveness of integrating psycholinguistic, emotional, and sentiment features with n-grams in detecting social support and distinguishing whether it is directed toward an individual or a group. The best results for different subtasks across all experiments range from 0.72 to 0.82.

artificial intelligence, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2411.0258

Country:

North America > Mexico (0.28)
Asia > Middle East (0.28)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (0.93)
Law (0.67)
Health & Medicine > Consumer Health (0.67)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Ethio-Fake: Cutting-Edge Approaches to Combat Fake News in Under-Resourced Languages Using Explainable AI

Yigezu, Mesay Gemeda, Mersha, Melkamu Abay, Bade, Girma Yohannis, Kalita, Jugal, Kolesnikova, Olga, Gelbukh, Alexander

arXiv.org Artificial IntelligenceOct-3-2024

The proliferation of fake news has emerged as a significant threat to the integrity of information dissemination, particularly on social media platforms. Misinformation can spread quickly due to the ease of creating and disseminating content, affecting public opinion and sociopolitical events. Identifying false information is therefore essential to reducing its negative consequences and maintaining the reliability of online news sources. Traditional approaches to fake news detection often rely solely on content-based features, overlooking the crucial role of social context in shaping the perception and propagation of news articles. In this paper, we propose a comprehensive approach that integrates social context-based features with news content features to enhance the accuracy of fake news detection in under-resourced languages. We perform several experiments utilizing a variety of methodologies, including traditional machine learning, neural networks, ensemble learning, and transfer learning. Assessment of the outcomes of the experiments shows that the ensemble learning approach has the highest accuracy, achieving a 0.99 F1 score. Additionally, when compared with monolingual models, the fine-tuned model with the target language outperformed others, achieving a 0.94 F1 score. We analyze the functioning of the models, considering the important features that contribute to model performance, using explainable AI techniques.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2410.02609

Country:

North America > United States > Colorado (0.14)
North America > Mexico > Mexico City (0.14)

Genre:

Research Report > New Finding (0.69)
Research Report > Promising Solution (0.50)

Industry: Media > News (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.86)
Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (0.85)

Add feedback

ThangDLU at #SMM4H 2024: Encoder-decoder models for classifying text data on social disorders in children and adolescents

Ta, Hoang-Thang, Rahman, Abu Bakar Siddiqur, Najjar, Lotfollah, Gelbukh, Alexander

arXiv.org Artificial IntelligenceApr-30-2024

This paper describes our participation in Task 3 and Task 5 of the #SMM4H (Social Media Mining for Health) 2024 Workshop, explicitly targeting the classification challenges within tweet data. Task 3 is a multi-class classification task centered on tweets discussing the impact of outdoor environments on symptoms of social anxiety. Task 5 involves a binary classification task focusing on tweets reporting medical disorders in children. We applied transfer learning from pre-trained encoder-decoder models such as BART-base and T5-small to identify the labels of a set of given tweets. We also presented some data augmentation methods to see their impact on the model performance. Finally, the systems obtained the best F1 score of 0.627 in Task 3 and the best F1 score of 0.841 in Task 5.

disorder, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2404.19714

Country:

North America (0.15)
Asia > Thailand (0.14)

Genre: Research Report (0.83)

Industry:

Health & Medicine > Therapeutic Area > Neurology (0.73)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.49)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)

Add feedback

EthioMT: Parallel Corpus for Low-resource Ethiopian Languages

Tonja, Atnafu Lambebo, Kolesnikova, Olga, Gelbukh, Alexander, Kalita, Jugal

arXiv.org Artificial IntelligenceMar-28-2024

Recent research in natural language processing (NLP) has achieved impressive performance in tasks such as machine translation (MT), news classification, and question-answering in high-resource languages. However, the performance of MT leaves much to be desired for low-resource languages. This is due to the smaller size of available parallel corpora in these languages, if such corpora are available at all. NLP in Ethiopian languages suffers from the same issues due to the unavailability of publicly accessible datasets for NLP tasks, including MT. To help the research community and foster research for Ethiopian languages, we introduce EthioMT -- a new parallel corpus for 15 languages. We also create a new benchmark by collecting a dataset for better-researched languages in Ethiopia. We evaluate the newly collected corpus and the benchmark dataset for 23 Ethiopian languages using transformer and fine-tuning approaches.

artificial intelligence, natural language, translation, (16 more...)

arXiv.org Artificial Intelligence

2403.19365

Country:

Africa > Middle East (0.47)
Africa > Ethiopia (0.41)
North America > United States > Colorado (0.15)
Africa > South Sudan > Greater Upper Nile > Greater Pibor Administrative Area (0.14)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Evaluating Embeddings for One-Shot Classification of Doctor-AI Consultations

Ojo, Olumide Ebenezer, Adebanji, Olaronke Oluwayemisi, Gelbukh, Alexander, Calvo, Hiram, Feldman, Anna

arXiv.org Artificial IntelligenceFeb-6-2024

Effective communication between healthcare providers and patients is crucial to providing high-quality patient care. In this work, we investigate how Doctor-written and AI-generated texts in healthcare consultations can be classified using state-of-the-art embeddings and one-shot classification systems. By analyzing embeddings such as bag-of-words, character n-grams, Word2Vec, GloVe, fastText, and GPT2 embeddings, we examine how well our one-shot classification systems capture semantic information within medical consultations. Results show that the embeddings are capable of capturing semantic features from text in a reliable and adaptable manner. Overall, Word2Vec, GloVe and Character n-grams embeddings performed well, indicating their suitability for modeling targeted to this task. GPT2 embedding also shows notable performance, indicating its suitability for models tailored to this task as well. Our machine learning architectures significantly improved the quality of health conversations when training data are scarce, improving communication between patients and healthcare providers.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2402.04442

Country: North America > Mexico > Mexico City (0.15)

Genre:

Research Report > New Finding (1.00)
Overview (1.00)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Health Care Technology (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

GuReT: Distinguishing Guilt and Regret related Text

Butt, Sabur, Balouchzahi, Fazlourrahman, Meque, Abdul Gafar Manuel, Amjad, Maaz, Cancino, Hector G. Ceballos, Sidorov, Grigori, Gelbukh, Alexander

arXiv.org Artificial IntelligenceJan-29-2024

The intricate relationship between human decision-making and emotions, particularly guilt and regret, has significant implications on behavior and well-being. Yet, these emotions subtle distinctions and interplay are often overlooked in computational models. This paper introduces a dataset tailored to dissect the relationship between guilt and regret and their unique textual markers, filling a notable gap in affective computing research. Our approach treats guilt and regret recognition as a binary classification task and employs three machine learning and six transformer-based deep learning techniques to benchmark the newly created dataset. The study further implements innovative reasoning methods like chain-of-thought and tree-of-thought to assess the models interpretive logic. The results indicate a clear performance edge for transformer-based models, achieving a 90.4% macro F1 score compared to the 85.3% scored by the best machine learning classifier, demonstrating their superior capability in distinguishing complex emotional states.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2401.16541

Country: North America > United States > Texas (0.14)

Genre: Research Report (0.82)

Industry: Health & Medicine > Therapeutic Area (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Leveraging the power of transformers for guilt detection in text

Meque, Abdul Gafar Manuel, Angel, Jason, Sidorov, Grigori, Gelbukh, Alexander

arXiv.org Artificial IntelligenceJan-14-2024

In recent years, language models and deep learning techniques have revolutionized natural language processing tasks, including emotion detection. However, the specific emotion of guilt has received limited attention in this field. In this research, we explore the applicability of three transformer-based language models for detecting guilt in text and compare their performance for general emotion detection and guilt detection. Our proposed model outformed BERT and RoBERTa models by two and one points respectively. Additionally, we analyze the challenges in developing accurate guilt-detection models and evaluate our model's effectiveness in detecting related emotions like "shame" through qualitative analysis of results.

detection, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2401.07414

Country: North America > Mexico > Mexico City (0.14)

Genre: Research Report (0.64)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback