BTPD: A Multilingual Hand-curated Dataset of Bengali Transnational Political Discourse Across Online Communities
Das, Dipto, Ahmed, Syed Ishtiaque, Guha, Shion
–arXiv.org Artificial Intelligence
Understanding political discourse in online spaces is crucial for analyzing public opinion and ideological polarization. While social computing and computational linguistics have explored such discussions in English, such research efforts are significantly limited in major yet under-resourced languages like Bengali due to the unavailability of datasets. In this paper, we present a multilingual dataset of Bengali transnational political discourse (BTPD) collected from three online platforms, each representing distinct community structures and interaction dynamics. Besides describing how we hand-curated the dataset through community-informed keyword-based retrieval, this paper also provides a general overview of its topics and multilingual content.
arXiv.org Artificial Intelligence
Jun-10-2025
- Country:
- Africa > Nigeria (0.04)
- Asia
- Bangladesh > Dhaka Division
- Dhaka District > Dhaka (0.05)
- China (0.05)
- India
- Tripura (0.04)
- West Bengal > Kolkata (0.05)
- Indonesia > Bali (0.04)
- Pakistan (0.04)
- Russia (0.04)
- Vietnam (0.04)
- Bangladesh > Dhaka Division
- Europe
- Russia (0.04)
- Spain > Aragón (0.04)
- Ukraine (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- North America
- Canada > Ontario
- Toronto (0.14)
- Costa Rica > Heredia Province
- Heredia (0.04)
- United States
- District of Columbia > Washington (0.04)
- Massachusetts (0.04)
- Canada > Ontario
- South America > Brazil (0.04)
- Genre:
- Research Report (0.50)
- Industry:
- Government
- Regional Government (0.68)
- Voting & Elections (0.47)
- Information Technology > Services (0.46)
- Media > News (0.49)
- Government
- Technology: