social message
Towards Effective, Efficient and Unsupervised Social Event Detection in the Hyperbolic Space
Yu, Xiaoyan, Wei, Yifan, Zhou, Shuaishuai, Yang, Zhiwei, Sun, Li, Peng, Hao, Zhu, Liehuang, Yu, Philip S.
The vast, complex, and dynamic nature of social message data has posed challenges to social event detection (SED). Despite considerable effort, these challenges persist, often resulting in inadequately expressive message representations (ineffective) and prolonged learning durations (inefficient). In response to the challenges, this work introduces an unsupervised framework, HyperSED (Hyperbolic SED). Specifically, the proposed framework first models social messages into semantic-based message anchors, and then leverages the structure of the anchor graph and the expressiveness of the hyperbolic space to acquire structure- and geometry-aware anchor representations. Finally, HyperSED builds the partitioning tree of the anchor message graph by incorporating differentiable structural information as the reflection of the detected events. Extensive experiments on public datasets demonstrate HyperSED's competitive performance, along with a substantial improvement in efficiency compared to the current state-of-the-art unsupervised paradigm. Statistically, HyperSED boosts incremental SED by an average of 2%, 2%, and 25% in NMI, AMI, and ARI, respectively; enhancing efficiency by up to 37.41 times and at least 12.10 times, illustrating the advancement of the proposed framework. Our code is publicly available at https://github.com/XiaoyanWork/HyperSED.
Unsupervised Social Event Detection via Hybrid Graph Contrastive Learning and Reinforced Incremental Clustering
Guo, Yuanyuan, Zang, Zehua, Gao, Hang, Xu, Xiao, Wang, Rui, Liu, Lixiang, Li, Jiangmeng
Detecting events from social media data streams is gradually attracting researchers. The innate challenge for detecting events is to extract discriminative information from social media data thereby assigning the data into different events. Due to the excessive diversity and high updating frequency of social data, using supervised approaches to detect events from social messages is hardly achieved. To this end, recent works explore learning discriminative information from social messages by leveraging graph contrastive learning (GCL) and embedding clustering in an unsupervised manner. However, two intrinsic issues exist in benchmark methods: conventional GCL can only roughly explore partial attributes, thereby insufficiently learning the discriminative information of social messages; for benchmark methods, the learned embeddings are clustered in the latent space by taking advantage of certain specific prior knowledge, which conflicts with the principle of unsupervised learning paradigm. In this paper, we propose a novel unsupervised social media event detection method via hybrid graph contrastive learning and reinforced incremental clustering (HCRC), which uses hybrid graph contrastive learning to comprehensively learn semantic and structural discriminative information from social messages and reinforced incremental clustering to perform efficient clustering in a solidly unsupervised manner. We conduct comprehensive experiments to evaluate HCRC on the Twitter and Maven datasets. The experimental results demonstrate that our approach yields consistent significant performance boosts. In traditional incremental setting, semi-supervised incremental setting and solidly unsupervised setting, the model performance has achieved maximum improvements of 53%, 45%, and 37%, respectively.
Streaming Social Event Detection and Evolution Discovery in Heterogeneous Information Networks
Peng, Hao, Li, Jianxin, Song, Yangqiu, Yang, Renyu, Ranjan, Rajiv, Yu, Philip S., He, Lifang
Events are happening in real-world and real-time, which can be planned and organized for occasions, such as social gatherings, festival celebrations, influential meetings or sports activities. Social media platforms generate a lot of real-time text information regarding public events with different topics. However, mining social events is challenging because events typically exhibit heterogeneous texture and metadata are often ambiguous. In this paper, we first design a novel event-based meta-schema to characterize the semantic relatedness of social events and then build an event-based heterogeneous information network (HIN) integrating information from external knowledge base. Second, we propose a novel Pairwise Popularity Graph Convolutional Network, named as PP-GCN, based on weighted meta-path instance similarity and textual semantic representation as inputs, to perform fine-grained social event categorization and learn the optimal weights of meta-paths in different tasks. Third, we propose a streaming social event detection and evolution discovery framework for HINs based on meta-path similarity search, historical information about meta-paths, and heterogeneous DBSCAN clustering method. Comprehensive experiments on real-world streaming social text data are conducted to compare various social event detection and evolution discovery algorithms. Experimental results demonstrate that our proposed framework outperforms other alternative social event detection and evolution discovery techniques.
Detecting Cybertrolls using deep learning
Learn to build cybertrolls detection engine with CNN, keras, Glove and popular programming language Python.NEW by Evergreen Technologies What you'll learn Detect cybertroll in social messages using CNN, Glove embeddings and Keras Description Course Description Learn to build cybertrolls detection engine with CNN, keras, Glove and popular programming language Python. Understanding of cybertrolls classification Understand the world of world embeddings Learn CNN from scratch Leverage CNN, Keras, Glove to classify cybertrolls in social messages Learn how to represent text as numeric vectors using glove embeddings Learn how to evaluate model from scratch User Jupyter Notebook for programming Build a real life web application to classify social messages A Powerful Skill at Your Fingertips Learning the fundamentals of text classification puts a powerful and very useful tool at your fingertips. Python and Jupyter are free, easy to learn, has excellent documentation. No prior knowledge of deep learning or Machine learning is assumed. I a, covering topics like CNN, Word Embeddings Precision, Recall in depth so that even beginners can understand this course very well.
Dynamic time warping distance for message propagation classification in Twitter
Jendoubi, Siwar, Martin, Arnaud, Liétard, Ludovic, Yaghlane, Boutheina Ben, Hadji, Hend Ben
Social messages classification is a research domain that has attracted the attention of many researchers in these last years. Indeed, the social message is different from ordinary text because it has some special characteristics like its shortness. Then the development of new approaches for the processing of the social message is now essential to make its classification more efficient. In this paper, we are mainly interested in the classification of social messages based on their spreading on online social networks (OSN). We proposed a new distance metric based on the Dynamic Time Warping distance and we use it with the probabilistic and the evidential k Nearest Neighbors (k-NN) classifiers to classify propagation networks (PrNets) of messages. The propagation network is a directed acyclic graph (DAG) that is used to record propagation traces of the message, the traversed links and their types. We tested the proposed metric with the chosen k-NN classifiers on real world propagation traces that were collected from Twitter social network and we got good classification accuracies.