fake account
Graph-based Fake Account Detection: A Survey
Dehkordi, Ali Safarpoor, Zehmakan, Ahad N.
In recent years, there has been a growing effort to develop effective and efficient algorithms for fake account detection in online social networks. This survey comprehensively reviews existing methods, with a focus on graph-based techniques that utilise topological features of social graphs (in addition to account information, such as their shared contents and profile data) to distinguish between fake and real accounts. We provide several categorisations of these methods (for example, based on techniques used, input data, and detection time), discuss their strengths and limitations, and explain how these methods connect in the broader context. We also investigate the available datasets, including both real-world data and synthesised models. We conclude the paper by proposing several potential avenues for future research.
Uncovering the Dark Side of Telegram: Fakes, Clones, Scams, and Conspiracy Movements
La Morgia, Massimo, Mei, Alessandro, Mongardini, Alberto Maria, Wu, Jie
Telegram is one of the most used instant messaging apps worldwide. Some of its success lies in providing high privacy protection and social network features like the channels -- virtual rooms in which only the admins can post and broadcast messages to all its subscribers. However, these same features contributed to the emergence of borderline activities and, as is common with Online Social Networks, the heavy presence of fake accounts. Telegram started to address these issues by introducing the verified and scam marks for the channels. Unfortunately, the problem is far from being solved. In this work, we perform a large-scale analysis of Telegram by collecting 35,382 different channels and over 130,000,000 messages. We study the channels that Telegram marks as verified or scam, highlighting analogies and differences. Then, we move to the unmarked channels. Here, we find some of the infamous activities also present on privacy-preserving services of the Dark Web, such as carding, sharing of illegal adult and copyright protected content. In addition, we identify and analyze two other types of channels: the clones and the fakes. Clones are channels that publish the exact content of another channel to gain subscribers and promote services. Instead, fakes are channels that attempt to impersonate celebrities or well-known services. Fakes are hard to identify even by the most advanced users. To detect the fake channels automatically, we propose a machine learning model that is able to identify them with an accuracy of 86%. Lastly, we study Sabmyk, a conspiracy theory that exploited fakes and clones to spread quickly on the platform reaching over 1,000,000 users.
A Systematic Review of Machine Learning Approaches for Detecting Deceptive Activities on Social Media: Methods, Challenges, and Biases
Liu, Yunchong, Shen, Xiaorui, Zhang, Yeyubei, Wang, Zhongyan, Tian, Yexin, Dai, Jianglai, Cao, Yuchen
Social media platforms like Twitter, Facebook, and Instagram have facilitated the spread of misinformation, necessitating automated detection systems. This systematic review evaluates 36 studies that apply machine learning (ML) and deep learning (DL) models to detect fake news, spam, and fake accounts on social media. Using the Prediction model Risk Of Bias ASsessment Tool (PROBAST), the review identified key biases across the ML lifecycle: selection bias due to non-representative sampling, inadequate handling of class imbalance, insufficient linguistic preprocessing (e.g., negations), and inconsistent hyperparameter tuning. Although models such as Support Vector Machines (SVM), Random Forests, and Long Short-Term Memory (LSTM) networks showed strong potential, over-reliance on accuracy as an evaluation metric in imbalanced data settings was a common flaw. The review highlights the need for improved data preprocessing (e.g., resampling techniques), consistent hyperparameter tuning, and the use of appropriate metrics like precision, recall, F1 score, and AUROC. Addressing these limitations can lead to more reliable and generalizable ML/DL models for detecting deceptive content, ultimately contributing to the reduction of misinformation on social media.
OpenAI Says Russia, China, and Israel Are Using Its Tools for Foreign Influence Campaigns
OpenAI identified and removed five covert influence operations based in Russia, China, Iran and Israel that were using its artificial intelligence tools to manipulate public opinion, the company said on Thursday. In a new report, OpenAI detailed how these groups, some of which are linked to known propaganda campaigns, used the company's tools for a variety of "deceptive activities." These included generating social media comments, articles, and images in multiple languages, creating names and biographies for fake accounts, debugging code, and translating and proofreading texts. These networks focused on a range of issues, including defending the war in Gaza and Russia's invasion of Ukraine, criticizing Chinese dissidents, and commenting on politics in India, Europe, and the U.S. in their attempts to sway public opinion. While these influence operations targeted a wide range of online platforms, including X (formerly known as Twitter), Telegram, Facebook, Medium, Blogspot, and other sites, "none managed to engage a substantial audience" according to OpenAI analysts.
Preemptive Detection of Fake Accounts on Social Networks via Multi-Class Preferential Attachment Classifiers
Breuer, Adam, Khosravani, Nazanin, Tingley, Michael, Cottel, Bradford
In this paper, we describe a new algorithm called Preferential Attachment k-class Classifier (PreAttacK) for detecting fake accounts in a social network. Recently, several algorithms have obtained high accuracy on this problem. However, they have done so by relying on information about fake accounts' friendships or the content they share with others--the very things we seek to prevent. PreAttacK represents a significant departure from these approaches. We provide some of the first detailed distributional analyses of how new fake (and real) accounts first attempt to request friends after joining a major network (Facebook). We show that even before a new account has made friends or shared content, these initial friend request behaviors evoke a natural multi-class extension of the canonical Preferential Attachment model of social network growth. We use this model to derive a new algorithm, PreAttacK. We prove that in relevant problem instances, PreAttacK near-optimally approximates the posterior probability that a new account is fake under this multi-class Preferential Attachment model of new accounts' (not-yet-answered) friend requests. These are the first provable guarantees for fake account detection that apply to new users, and that do not require strong homophily assumptions. This principled approach also makes PreAttacK the only algorithm with provable guarantees that obtains state-of-the-art performance on new users on the global Facebook network, where it converges to AUC=0.9 after new users send + receive a total of just 20 not-yet-answered friend requests. For comparison, state-of-the-art benchmarks do not obtain this AUC even after observing additional data on new users' first 100 friend requests. Thus, unlike mainstream algorithms, PreAttacK converges before the median new fake account has made a single friendship (accepted friend request) with a human.
The Looming Threat of Fake and LLM-generated LinkedIn Profiles: Challenges and Opportunities for Detection and Prevention
Ayoobi, Navid, Shahriar, Sadat, Mukherjee, Arjun
In this paper, we present a novel method for detecting fake and Large Language Model (LLM)-generated profiles in the LinkedIn Online Social Network immediately upon registration and before establishing connections. Early fake profile identification is crucial to maintaining the platform's integrity since it prevents imposters from acquiring the private and sensitive information of legitimate users and from gaining an opportunity to increase their credibility for future phishing and scamming activities. This work uses textual information provided in LinkedIn profiles and introduces the Section and Subsection Tag Embedding (SSTE) method to enhance the discriminative characteristics of these data for distinguishing between legitimate profiles and those created by imposters manually or by using an LLM. Additionally, the dearth of a large publicly available LinkedIn dataset motivated us to collect 3600 LinkedIn profiles for our research. We will release our dataset publicly for research purposes. This is, to the best of our knowledge, the first large publicly available LinkedIn dataset for fake LinkedIn account detection. Within our paradigm, we assess static and contextualized word embeddings, including GloVe, Flair, BERT, and RoBERTa. We show that the suggested method can distinguish between legitimate and fake profiles with an accuracy of about 95% across all word embeddings. In addition, we show that SSTE has a promising accuracy for identifying LLM-generated profiles, despite the fact that no LLM-generated profiles were employed during the training phase, and can achieve an accuracy of approximately 90% when only 20 LLM-generated profiles are added to the training set. It is a significant finding since the proliferation of several LLMs in the near future makes it extremely challenging to design a single system that can identify profiles created with various LLMs.
Hinge is adding video identity verification to combat fake accounts
Starting next month, dating app Hinge will begin rolling out a new profile verification feature to combat a surge in fake accounts. Dubbed "Selfie Verification," the feature will prompt users to upload a video of themselves, which the app, with a combination of machine learning and human oversight, will use to confirm they look like the person pictured in their profile. People who complete the process will get a "Verified" badge on their dating profile. Hinge parent company Match Group told Wired, the first publication to report on the feature, that Selfie Verification would roll out to all users by December. "As romance scammers find new ways to defraud people, we are committed to investing in new updates and technologies that prevent harm to our daters," Hinge spokesperson Jarryd Boyd told the outlet.
Synthetic media: How AI-generated characters spread disinformation
In the last few years, many strategies and tactics have been used to generate and spread online misinformation. But a recent approach that taps into the power of artificial intelligence to create photos with high accuracy of fictitious personas that purport to be journalists or field experts poses a serious and novel threat to our society. The AI-generated characters fall under a broad umbrella called synthetic media that relies on a technique called generative adversarial network (GAN), in which two networks compete to create photos that are cross-checked to determine whether they are realistic or not. Many websites and applications are now available to generate these photos without the need of any technical background, and they are incredibly convincing. In regard to disinformation campaigns, AI-generated characters have been utilized in three main ways.
Pro-China Propaganda Act Used Fake Followers Made With AI-Generated Images
A pro-China propaganda campaign that's been bashing the US on social media created fake followers with the help of AI-generated images. Since June, the campaign has been posting English-language videos critical of the Trump administration on Facebook, Twitter, and YouTube, according to research company Graphika, which has been tracking the group's activities. Graphika dubs the campaign "Spamouflage Dragon." And like other propaganda activities, the pro-China group uses fake accounts to share and post comments on its content to help it gain wider circulation. However, Graphika noticed something odd with the profile photos belonging to these fake accounts: In some cases, the headshots appear to be the work of an AI program designed to create artificial human faces. At first glance, the profile photos look legitimate.
The new AI tools spreading fake news in politics and business
When Camille François, a longstanding expert on disinformation, sent an email to her team late last year, many were perplexed. Her message began by raising some seemingly valid concerns: that online disinformation -- the deliberate spreading of false narratives typically designed to sow mayhem -- "could get out of control and become a huge threat to democratic norms". But the text from the chief innovation officer at social media intelligence group Graphika soon became rather more wacky. Disinformation, it read, is the "grey goo of the internet", a reference to a nightmarish, end-of-the world scenario in molecular nanotechnology. The solution the email proposed was to make a "holographic holographic hologram". The bizarre email was not actually written by François, but by computer code; she had created the message -- from her basement -- using text-generating artificial intelligence technology.