A Unified Model for Unsupervised Opinion Spamming Detection Incorporating Text Generality

Xu, Yinqing (The Chinese University of Hong Kong) | Shi, Bei (The Chinese University of Hong Kong) | Tian, Wentao (The Chinese University of Hong Kong) | Lam, Wai (The Chinese University of Hong Kong)

Jul-15-2015–AAAI Conferences

Unlike other forms of spamming, it is difficult to collect a large amount of gold-standard labels for reviews Many existing methods on review spam detection by means of manual effort. Thus, most of these methods considering text content merely utilize simple text [Mukherjee et al., 2013; Li et al., 2013a; Sun et al., features such as content similarity. We explore a 2013] just rely on the ad-hoc or pseudo fake or non-fake novel idea of exploiting text generality for improving labels for model training, such as the labels annotated by spam detection. Besides, apart from the task the Amazon anonymous online workers [Ott et al., 2011; of review spam detection, although there have also Li et al., 2014]. On the other hand, some unsupervised been some works on identifying the review spammers methods have been proposed to detect the individual review (users) and the manipulated offerings (items), spammer [Mukherjee et al., 2013; Lim et al., 2010; no previous works have attempted to solve these Wang et al., 2011] and review spammer groups [Mukherjee et three tasks in a unified model. We have proposed al., 2012]. In addition, time series pattern [Xie et al., 2012], a unified probabilistic graphical model to detect rating distribution [Feng et al., 2012], reviewer graph [Wang et the suspicious review spams, the review spammers al., 2011], and reviewing burstiness [Fei et al., 2013] have also and the manipulated offerings in an unsupervised been applied to identify the review spams in an unsupervised manner.

abnormal feature, review spam, spamicity, (16 more...)

AAAI Conferences

Jul-15-2015

Conferences PDF

Add feedback

Country:
- North America > United States
  - New York (0.04)
  - Michigan (0.04)
  - Pennsylvania > Allegheny County
    - Pittsburgh (0.04)
  - New Jersey > Hudson County
    - Secaucus (0.04)
  - Illinois > Cook County
    - Chicago (0.04)
- Asia
  - China > Hong Kong (0.04)
  - Middle East > Jordan (0.04)

Genre:
- Research Report > Promising Solution (0.34)

Technology:
- Information Technology
  - Security & Privacy > Spam Filtering (1.00)
  - Artificial Intelligence > Machine Learning
    - Statistical Learning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found