SoMeR: Multi-View User Representation Learning for Social Media
Guo, Siyi, Burghardt, Keith, Pantè, Valeria, Lerman, Kristina
–arXiv.org Artificial Intelligence
User representation learning aims to capture user preferences, interests, and behaviors in low-dimensional vector representations. These representations have widespread applications in recommendation systems and advertising; however, existing methods typically rely on specific features like text content, activity patterns, or platform metadata, failing to holistically model user behavior across different modalities. To address this limitation, we propose SoMeR, a Social Media user Representation learning framework that incorporates temporal activities, text content, profile information, and network interactions to learn comprehensive user portraits. SoMeR encodes user post streams as sequences of timestamped textual features, uses transformers to embed this along with profile data, and jointly trains with link prediction and contrastive learning objectives to capture user similarity. We demonstrate SoMeR's versatility through two applications: 1) Identifying inauthentic accounts involved in coordinated influence operations by detecting users posting similar content simultaneously, and 2) Measuring increased polarization in online discussions after major events by quantifying how users with different beliefs moved farther apart in the embedding space. SoMeR's ability to holistically model users enables new solutions to important problems around disinformation, societal tensions, and online behavior understanding.
arXiv.org Artificial Intelligence
May-2-2024
- Country:
- North America > United States > California (0.14)
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Health & Medicine (0.47)
- Information Technology (0.47)
- Media > News (0.66)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning
- Neural Networks > Deep Learning (0.68)
- Statistical Learning (1.00)
- Natural Language (1.00)
- Representation & Reasoning > Personal Assistant Systems (0.86)
- Machine Learning
- Communications > Social Media (1.00)
- Data Science > Data Mining (1.00)
- Information Management (1.00)
- Artificial Intelligence
- Information Technology