Institute of Computing Technology, Chinese Academy of Science
Locally Adaptive Translation for Knowledge Graph Embedding
Jia, Yantao (Institute of Computing Technology, Chinese Academy of Science) | Wang, Yuanzhuo (Institute of Computing Technology, Chinese Academy of Science) | Lin, Hailun (Institute of Information Engineering, Chinese Academy of Science) | Jin, Xiaolong (Institute of Computing Technology, Chinese Academy of Science) | Cheng, Xueqi (Institute of Computing Technology, Chinese Academy of Science)
Knowledge graph embedding aims to represent entities and relations in a large-scale knowledge graph as elements in a continuous vector space. Existing methods, e.g., TransE and TransH, learn embedding representation by defining a global margin-based loss function over the data. However, the optimal loss function is determined during experiments whose parameters are examined among a closed set of candidates. Moreover, embeddings over two knowledge graphs with different entities and relations share the same set of candidate loss functions, ignoring the locality of both graphs. This leads to the limited performance of embedding related applications. In this paper, we propose a locally adaptive translation method for knowledge graph embedding, called TransA, to find the optimal loss function by adaptively determining its margin over different knowledge graphs. Experiments on two benchmark data sets demonstrate the superiority of the proposed method, as compared to the-state-of-the-art ones.
A Probabilistic Model for Bursty Topic Discovery in Microblogs
Yan, Xiaohui (Institute of Computing Technology, Chinese Academy of Science) | Guo, Jiafeng (Institute of Computing Technology, Chinese Academy of Science) | Lan, Yanyan (Institute of Computing Technology, Chinese Academy of Science) | Xu, Jun (Institute of Computing Technology, Chinese Academy of Science) | Cheng, Xueqi (Institute of Computing Technology, Chinese Academy of Science)
Bursty topics discovery in microblogs is important for people to grasp essential and valuable information. However, the task is challenging since microblog posts are particularly short and noisy. This work develops a novel probabilistic model, namely Bursty Biterm Topic Model (BBTM), to deal with the task. BBTM extends the Biterm Topic Model (BTM) by incorporating the burstiness of biterms as prior knowledge for bursty topic modeling, which enjoys the following merits: 1) It can well solve the data sparsity problem in topic modeling over short texts as the same as BTM; 2) It can automatical discover high quality bursty topics in microblogs in a principled and efficient way. Extensive experiments on a standard Twitter dataset show that our approach outperforms the state-of-the-art baselines significantly.