COVID-19 on YouTube: A Data-Driven Analysis of Sentiment, Toxicity, and Content Recommendations
–arXiv.org Artificial Intelligence
This study presents a data-driven analysis of COVID-19 discourse on YouTube, examining the sentiment, toxicity, and thematic patterns of video content published between January 2023 and October 2024. The analysis involved applying advanced natural language processing (NLP) techniques: sentiment analysis with VADER, toxicity detection with Detoxify, and topic modeling using Latent Dirichlet Allocation (LDA). The sentiment analysis revealed that 49.32% of video descriptions were positive, 36.63% were neutral, and 14.05% were negative, indicating a generally informative and supportive tone in pandemic-related content. Toxicity analysis identified only 0.91% of content as toxic, suggesting minimal exposure to toxic content. Topic modeling revealed two main themes, with 66.74% of the videos covering general health information and pandemic-related impacts and 33.26% focused on news and real-time updates, highlighting the dual informational role of YouTube. A recommendation system was also developed using TF-IDF vectorization and cosine similarity, refined by sentiment, toxicity, and topic filters to ensure relevant and context-aligned video recommendations. This system achieved 69% aggregate coverage, with monthly coverage rates consistently above 85%, demonstrating robust performance and adaptability over time. Evaluation across recommendation sizes showed coverage reaching 69% for five video recommendations and 79% for ten video recommendations per video. In summary, this work presents a framework for understanding COVID-19 discourse on YouTube and a recommendation system that supports user engagement while promoting responsible and relevant content related to COVID-19.
arXiv.org Artificial Intelligence
Dec-22-2024
- Country:
- Asia
- Pakistan (0.04)
- Nepal (0.04)
- Vietnam (0.04)
- Japan (0.04)
- Indonesia (0.04)
- Middle East
- Jordan (0.04)
- Republic of Türkiye (0.04)
- Philippines (0.04)
- Russia (0.04)
- China (0.04)
- South Korea (0.04)
- Taiwan (0.04)
- Thailand (0.04)
- India (0.05)
- Europe
- France (0.04)
- Germany (0.04)
- Russia (0.04)
- Switzerland > Basel-City
- Basel (0.04)
- United Kingdom (0.04)
- North America
- Canada (0.28)
- Mexico (0.04)
- United States
- Georgia > Fulton County
- Atlanta (0.04)
- South Dakota > Pennington County
- Rapid City (0.04)
- Georgia > Fulton County
- Oceania > New Zealand (0.04)
- South America > Brazil (0.05)
- Asia
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Technology: