Media
Cross-Lingual Propagation for Deep Sentiment Analysis
Dong, Xin (Rutgers University) | Melo, Gerard de (Rutgers University)
For many languages and domains, there is a paucity of available Given such valuable data, modern deep learning-based sentiment data and resources. In some cases, it may be challenging analysis methods excel at determining the sentiment to obtain sufficient in-domain training data, both because polarity of what is being said about companies, products, etc. there may be less data available online and because it may be (Wang et al. 2015). Unfortunately, such deep methods require somewhat harder to find annotators. Hence, a question that substantial amounts of training data, because multiple levels arises is whether one can assist deep networks by incorporating of computation, each with additional weights and parameters, external cues that enable the model to generalize better.
Incorporating Discriminator in Sentence Generation: a Gibbs Sampling Method
Su, Jinyue (Fudan University) | Xu, Jiacheng (Fudan University) | Qiu, Xipeng (Fudan University) | Huang, Xuanjing (Fudan University)
Generating plausible and fluent sentence with desired properties has long been a challenge. Most of the recent works use recurrent neural networks (RNNs) and their variants to predict following words given previous sequence and target label. In this paper, we propose a novel framework to generate constrained sentences via Gibbs Sampling. The candidate sentences are revised and updated iteratively, with sampled new words replacing old ones. Our experiments show the effectiveness of the proposed method to generate plausible and diverse sentences.
A Novel Embedding Method for News Diffusion Prediction
Liu, Ruoran (Institute of Automation, Chinese Academy of Sciences, Beijing) | Li, Qiudan (Institute of Automation, Chinese Academy of Sciences, Beijing) | Wang, Can (Institute of Automation, Chinese Academy of Sciences, Beijing) | Wang, Lei (Institute of Automation, Chinese Academy of Sciences, Beijing) | Zeng, Daniel Dajun (Institute of Automation, Chinese Academy of Sciences, Beijing)
News diffusion prediction aims to predict a sequence of news sites which will quote a particular piece of news. Most of previous propagation models make efforts to estimate propagation probabilities along observed links and ignore the characteristics of news diffusion processes, and they fail to capture the implicit relationships between news sites. In this paper, we propose an algorithm to model the news diffusion processes in a continuous space and take the attributes of news into account. Experiments performed on a real-world news dataset show that our model can take advantage of news’ attributes and predict news diffusion accurately.
Explainable Cross-Domain Recommendations Through Relational Learning
Sopchoke, Sirawit (Osaka University) | Fukui, Ken-ichi (The Institute of Scientific and Industrial Research, Osaka University) | Numao, Masayuki (The Institute of Scientific and Industrial Research, Osaka University)
We propose a method to generate explainable recommendation rules on cross-domain problems. Our two main contributions are: i) using relational learning to generate the rules which are able to explain clearly why the items were recommended to the particular user, ii) using the user's preferences of items on different domains and item attributes to generate novel or unexpected recommendations for the user. To illustrate that our method is indeed feasible and applicable, we conducted experiments on music and movie domains.
Sequential Decision Making in Artificial Musical Intelligence
Liebman, Elad (The University of Texas at Austin)
My main research motivation is to develop complete autonomous agents that interact with people socially. For an agent to be social with respect to humans, it needs to be able to parse and process the multitude of aspects that comprise the human cultural experience. That in itself gives rise to many fascinating learning problems. I am interested in tackling these fundamental problems from an empirical as well as a theoretical perspective. Music, as a general target domain, serves as an excellent testbed for these research ideas. Musical skills---playing music (alone or in a group), analyzing music or composing it---all involve extremely advanced knowledge representation and problem solving tools. Creating "musical agents"---agents that can interact richly with people in the music domain---is a challenge that holds the potential of advancing social agents research, and contributing important and broadly applicable AI knowledge. This belief is fueled not just by my background in computer science and artificial intelligence, but also by my deep passion for music as well as my extensive musical training. One key aspect of musical intelligence which hasn’t been sufficiently studied is that of sequential decision-making. My thesis strives to answer the following question: How can a sequential decision making perspective guide us in the creation of better music agents, and social agents in general? More specifically, this thesis focuses on two aspects of musical intelligence: music recommendation and multiagent interaction in the context of music.
Aida: Intelligent Image Analysis to Automatically Detect Poems in Digital Archives of Historic Newspapers
Soh, Leen-Kiat (University of Nebraska, Lincoln) | Lorang, Elizabeth (University of Nebraska, Lincoln) | Liu, Yi (University of Nebraska, Lincoln)
We describe an intelligent image analysis approach to automatically detect poems in digitally archived historic newspapers. Our application, Image Analysis for Archival Discovery, or Aida, integrates computer vision to capture visual cues based on visual structures of poetic works—instead of the meaning or content—and machine learning to train an artificial neural network to determine whether an image has poetic text. We have tested our application on almost 17,000 image snippets and obtained promising accuracies, precision, and recall. The application is currently being deployed at two institutions for digital library and literary research.
TipMaster: A Knowledge Base of Authoritative Local News Sources on Social Media
Shuai, Xin (Thomson Reuters) | Liu, Xiaomo (Thomson Reuters) | Nourbakhsh, Armineh (Thomson Reuters) | Shah, Sameena (Thomson Reuters) | Curtis, Tonya (Thomson Reuters)
Twitter has become an important online source for real-time news dissemination. Especially, official accounts of local government and media outlets have provided newsworthy and authoritative information, revealing local trends and breaking news. In this paper, we describe TipMaster an automatically constructed knowledge base of Twitter accounts that are likely to report local news, from government agencies to local media outlets. First, we implement classifiers for detecting these accounts by integrating heterogeneous information from the accounts' textual metadata, profile images, and their tweet messages. Next, we demonstrate two use cases for TipMaster: 1) as a platform that monitors real-time social media messages for local breaking news, and 2) as an authoritative source for verifying nascent rumors. Experimental results show that our account classification algorithms achieve both high precision and recall (around 90%). The demonstrated case studies prove that our platform is able to detect local breaking news or debunk emergent rumors faster than mainstream media sources.
Computer-Assisted Authoring for Natural Language Story Scripts
Sanghrajka, Rushit (Disney Research) | Witoń, Wojciech (Disney Research) | Schriber, Sasha (Disney Research) | Gross, Markus (Disney Research) | Kapadia, Mubbasir (Rutgers University, Disney Research)
In order to assist scriptwriters during the process of story-writing, we have developed a system that can extract information from natural language stories, and allow for story-centric as well as character-centric reasoning. These inferencing capabilities are exposed to the user through intuitive querying systems, allowing the scriptwriter to ask the system questions about story and character information. We introduce knowledge bytes as atoms of information and demonstrate that the system can parse text into a stream of knowledge bytes and use these mentioned reasoning capabilities through logical reasoning.
A Deep Ranking Model for Spatio-Temporal Highlight Detection From a 360◦ Video
Yu, Youngjae (Seoul National University Vision and Learning Lab) | Lee, Sangho (Seoul National University Vision and Learning Lab) | Na, Joonil (Seoul National University Vision and Learning Lab) | Kang, Jaeyun (Seoul National University) | Kim, Gunhee (Seoul National University Vision and Learning Lab)
We address the problem of highlight detection from a 360◦ video by summarizing it both spatially and temporally. Given a long 360◦ video, we spatially select pleasantly-looking normal field-of-view (NFOV) segments from unlimited field of views (FOV) of the 360◦ video, and temporally summarize it into a concise and informative highlight as a selected subset of subshots. We propose a novel deep ranking model named as Composition View Score (CVS) model, which produces a spherical score map of composition per video segment, and determines which view is suitable for highlight via a sliding window kernel at inference. To evaluate the proposed framework, we perform experiments on the Pano2Vid benchmark dataset (Su, Jayaraman, and Grauman 2016) and our newly collected 360◦ video highlight dataset from YouTube and Vimeo. Through evaluation using both quantitative summarization metrics and user studies via Amazon Mechanical Turk, we demonstrate that our approach outperforms several state-of-the-art highlight detection methods.We also show that our model is 16 times faster at inference than AutoCam (Su, Jayaraman, and Grauman 2016), which is one of the first summarization algorithms of 360◦ videos.
Understanding Image Impressiveness Inspired by Instantaneous Human Perceptual Cues
Yang, Jufeng (Nankai University) | Sun, Yan (Nankai University) | Liang, Jie (Nankai University) | Yang, Yong-Liang (University of Bath) | Cheng, Ming-Ming (Nankai University)
With the explosion of visual information nowadays, millions of digital images are available to the users. How to efficiently explore a large set of images and retrieve useful information thus becomes extremely important. Unfortunately only some of the images can impress the user at first glance. Others that make little sense in human perception are often discarded, while still costing valuable time and space. Therefore, it is significant to identify these two kinds of images for relieving the load of online repositories and accelerating information retrieval process. However, most of the existing image properties, e.g., memorability and popularity, are based on repeated human interactions, which limit the research and application of evaluating image quality in terms of instantaneous impression. In this paper, we propose a novel image property, called impressiveness, that measures how images impress people with a short-term contact. This is based on an impression-driven model inspired by a number of important human perceptual cues. To achieve this, we first collect three datasets in various domains, which are labeled according to the instantaneous sensation of the annotators. Then we investigate the impressiveness property via six established human perceptual cues as well as the corresponding features from pixel to semantic levels. Sequentially, we verify the consistency of the impressiveness which can be quantitatively measured by multiple visual representations, and evaluate their latent relationships. Finally, we apply the proposed impressiveness property to rank the images for an efficient image recommendation system.