Using LLMs to Infer Non-Binary COVID-19 Sentiments of Chinese Micro-bloggers
Hu, Jerry Chongyi, Modi, Mohammed Shahid, Szymanski, Boleslaw K.
Studying public sentiment during crises is crucial for understanding how opinions and sentiments shift, resulting in polarized societies. We study Weibo, the most popular microblogging site in China, using posts made during the outbreak of the COVID-19 crisis. The study period includes the pre-COVID-19 stage, the outbreak stage, and the early stage of epidemic prevention. We use Llama 3 8B, a Large Language Model, to analyze users' sentiments on the platform by classifying them into positive, negative, sarcastic, and neutral categories. Analyzing sentiment shifts on Weibo provides insights into how social events and government actions influence public opinion. This study contributes to understanding the dynamics of social sentiments during health crises, fulfilling a gap in sentiment analysis for Chinese platforms. By examining these dynamics, we aim to offer valuable perspectives on digital communication's role in shaping society's responses during unprecedented global challenges.
- Asia > South Korea (0.14)
- Asia > China > Hubei Province > Wuhan (0.04)
- North America > United States > New York > Rensselaer County > Troy (0.04)
- (3 more...)
- Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
- Health & Medicine > Therapeutic Area > Immunology (1.00)
- Health & Medicine > Epidemiology (1.00)
When does MAML Work the Best? An Empirical Study on Model-Agnostic Meta-Learning in NLP Applications
Liu, Zequn, Zhang, Ruiyi, Song, Yiping, Ju, Wei, Zhang, Ming
Model-Agnostic Meta-Learning (MAML), a model-agnostic meta-learning method, is successfully employed in NLP applications including few-shot text classification and multi-domain low-resource language generation. Many impacting factors, including data quantity, similarity among tasks, and the balance between general language model and task-specific adaptation, can affect the performance of MAML in NLP, but few works have thoroughly studied them. In this paper, we conduct an empirical study to investigate these impacting factors and conclude when MAML works the best based on the experimental results.
MCFEND: A Multi-source Benchmark Dataset for Chinese Fake News Detection
Li, Yupeng, He, Haorui, Bai, Jin, Wen, Dacheng
The prevalence of fake news across various online sources has had a significant influence on the public. Existing Chinese fake news detection datasets are limited to news sourced solely from Weibo. However, fake news originating from multiple sources exhibits diversity in various aspects, including its content and social context. Methods trained on purely one single news source can hardly be applicable to real-world scenarios. Our pilot experiment demonstrates that the F1 score of the state-of-the-art method that learns from a large Chinese fake news detection dataset, Weibo-21, drops significantly from 0.943 to 0.470 when the test data is changed to multi-source news data, failing to identify more than one-third of the multi-source fake news. To address this limitation, we constructed the first multi-source benchmark dataset for Chinese fake news detection, termed MCFEND, which is composed of news we collected from diverse sources such as social platforms, messaging apps, and traditional online news outlets. Notably, such news has been fact-checked by 14 authoritative fact-checking agencies worldwide. In addition, various existing Chinese fake news detection methods are thoroughly evaluated on our proposed dataset in cross-source, multi-source, and unseen source ways. MCFEND, as a benchmark dataset, aims to advance Chinese fake news detection approaches in real-world scenarios.
- Asia > China > Hong Kong (0.05)
- Asia > Singapore > Central Region > Singapore (0.05)
- Asia > Taiwan (0.04)
- (5 more...)
Language-based Valence and Arousal Expressions between the United States and China: a Cross-Cultural Examination
Cho, Young-Min, Pang, Dandan, Thapa, Stuti, Sherman, Garrick, Ungar, Lyle, Tay, Louis, Guntuku, Sharath Chandra
Although affective expressions of individuals have been extensively studied using social media, research has primarily focused on the Western context. There are substantial differences among cultures that contribute to their affective expressions. This paper examines the differences between Twitter (X) in the United States and Sina Weibo posts in China on two primary dimensions of affect - valence and arousal. We study the difference in the functional relationship between arousal and valence (so-called V-shaped) among individuals in the US and China and explore the associated content differences. Furthermore, we correlate word usage and topics in both platforms to interpret their differences. We observe that for Twitter users, the variation in emotional intensity is less distinct between negative and positive emotions compared to Weibo users, and there is a sharper escalation in arousal corresponding with heightened emotions. From language features, we discover that affective expressions are associated with personal life and feelings on Twitter, while on Weibo such discussions are about socio-political topics in the society. These results suggest a West-East difference in the V-shaped relationship between valence and arousal of affective expressions on social media influenced by content differences. Our findings have implications for applications and theories related to cultural differences in affective expressions.
- Asia > China (0.81)
- North America > United States > Pennsylvania (0.04)
- North America > Canada > Quebec > Montreal (0.04)
- (2 more...)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (0.94)
- Health & Medicine (0.93)
- Information Technology > Services (0.89)
KTRL+F: Knowledge-Augmented In-Document Search
Oh, Hanseok, Shin, Haebin, Ko, Miyoung, Lee, Hyunji, Seo, Minjoon
We introduce a new problem KTRL+F, a knowledge-augmented in-document search task that necessitates real-time identification of all semantic targets within a document with the awareness of external sources through a single natural query. This task addresses following unique challenges for in-document search: 1) utilizing knowledge outside the document for extended use of additional information about targets to bridge the semantic gap between the query and the targets, and 2) balancing between real-time applicability with the performance. We analyze various baselines in KTRL+F and find there are limitations of existing models, such as hallucinations, low latency, or difficulties in leveraging external knowledge. Therefore we propose a Knowledge-Augmented Phrase Retrieval model that shows a promising balance between speed and performance by simply augmenting external knowledge embedding in phrase embedding. Additionally, we conduct a user study to verify whether solving KTRL+F can enhance search experience of users. It demonstrates that even with our simple model users can reduce the time for searching with less queries and reduced extra visits to other sources for collecting evidence. We encourage the research community to work on KTRL+F to enhance more efficient in-document information access.
- North America > United States > California > Santa Clara County > San Jose (0.14)
- Asia > China (0.08)
- North America > United States > New Jersey > Monmouth County (0.04)
- (8 more...)
- Leisure & Entertainment > Sports > Soccer (1.00)
- Law Enforcement & Public Safety (1.00)
- Information Technology > Services (0.73)
How Chinese influencers use AI digital clones of themselves to pump out content
His followers were suitably wowed – until some started to question if such a feat was humanly possible. The small print on the video stream confirmed their suspicions: "For display purposes only, not a real person." Many of Chen's fans were outraged, and he reportedly lost more than 7,000 followers between 24 and 26 September. Even the legal community weighed in. Quoted in Chinese media reports, Dong Yuanyuan, a senior partner at Tiantai, a Beijing law firm, said that AI avatars could not be "completely untied from the celebrity himself" and that "virtual live broadcasts … do not exempt celebrities from legal liability".
Explicit Time Embedding Based Cascade Attention Network for Information Popularity Prediction
Sun, Xigang, Zhou, Jingya, Liu, Ling, Wei, Wenqi
Predicting information cascade popularity is a fundamental problem in social networks. Capturing temporal attributes and cascade role information (e.g., cascade graphs and cascade sequences) is necessary for understanding the information cascade. Current methods rarely focus on unifying this information for popularity predictions, which prevents them from effectively modeling the full properties of cascades to achieve satisfactory prediction performances. In this paper, we propose an explicit Time embedding based Cascade Attention Network (TCAN) as a novel popularity prediction architecture for large-scale information networks. TCAN integrates temporal attributes (i.e., periodicity, linearity, and non-linear scaling) into node features via a general time embedding approach (TE), and then employs a cascade graph attention encoder (CGAT) and a cascade sequence attention encoder (CSAT) to fully learn the representation of cascade graphs and cascade sequences. We use two real-world datasets (i.e., Weibo and APS) with tens of thousands of cascade samples to validate our methods. Experimental results show that TCAN obtains mean logarithm squared errors of 2.007 and 1.201 and running times of 1.76 hours and 0.15 hours on both datasets, respectively. Furthermore, TCAN outperforms other representative baselines by 10.4%, 3.8%, and 10.4% in terms of MSLE, MAE, and R-squared on average while maintaining good interpretability.
- North America > United States (0.14)
- Asia > China > Jiangsu Province (0.04)
Interpretable Multimodal Misinformation Detection with Logic Reasoning
Liu, Hui, Wang, Wenya, Li, Haoliang
Multimodal misinformation on online social platforms is becoming a critical concern due to increasing credibility and easier dissemination brought by multimedia content, compared to traditional text-only information. While existing multimodal detection approaches have achieved high performance, the lack of interpretability hinders these systems' reliability and practical deployment. Inspired by NeuralSymbolic AI which combines the learning ability of neural networks with the explainability of symbolic learning, we propose a novel logic-based neural model for multimodal misinformation detection which integrates interpretable logic clauses to express the reasoning process of the target task. To make learning effective, we parameterize symbolic logical elements using neural representations, which facilitate the automatic generation and evaluation of meaningful logic clauses. Additionally, to make our framework generalizable across diverse misinformation sources, we introduce five meta-predicates that can be instantiated with different correlations. Results on three public datasets (Twitter, Weibo, and Sarcasm) demonstrate the feasibility and versatility of our model.
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- Asia > China > Hong Kong (0.04)
- North America > United States > California > San Francisco County > San Francisco (0.04)
- (21 more...)
- Information Technology > Communications > Social Media (1.00)
- Information Technology > Artificial Intelligence > Natural Language (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.90)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Russia Praises Chinese Drone Maker On Weibo; Deletes Post Following Protest
The Russian Embassy in Beijing has pulled down a controversial post on Weibo, praising China's top drone maker DJI for its products, which allegedly helped Kremlin in "modern warfare." The post went missing after DJI rebuffed the claim, stating its drones were not meant for military specifications. The embassy's post on Weibo cited a report from Russian state media Sputnik, based on a new book by Army General Yuri Baluyevsky, the former Chief of the General Staff of the Armed Forces of the Russian Federation, according to South China Morning Post. In his book, Baluyevsky said Chinese commercial drones have brought "a real revolution" to traditional artillery weapons. "When drones hover over a target area to guide the artillery, its pinpoint accuracy and efficiency are comparable to precision-guided missiles, according to the Russian embassy's Weibo post quoting Baluyevsky. "The Mavic quadcopter drone made by China's DJI has become a true symbol of modern warfare," he said. However, the post soon snowballed into a major controversy, as many netizens called out Russian Embassy for uploading something with "malicious intent." "What do you want by saying this?
- Government > Regional Government > Europe Government > Russia Government (0.73)
- Government > Regional Government > Asia Government > Russia Government (0.73)
China roundup: Alibaba's sexual assault scandal and more delayed IPOs – TechCrunch
Hello and welcome back to TechCrunch's China roundup, a digest of recent events shaping the Chinese tech landscape and what they mean to people in the rest of the world. A sexual assault case at Alibaba has sparked a new round of #MeToo reckoning in China. Industry observers believe this is a watershed moment for the fight against China's allegedly misogynist tech industry. Meanwhile, social media operators are still undecided on how to deal with the unprecedented public uproar against the powerful internet giant. In other news, more Chinese tech companies have delayed plans to go public overseas after Didi's fallout with Chinese regulators over its rushed IPO, including Tencent's music streaming empire and one of China's highest-valued autonomous driving startups.
- Asia > China > Beijing > Beijing (0.07)
- North America > United States > New York (0.05)
- Asia > China > Hong Kong (0.05)
- Law > Criminal Law (1.00)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
- Information Technology (1.00)
- Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (1.00)