Deep Learning
Deep Learning Quadcopter Control via Risk-Aware Active Learning
Andersson, Olov (Linköping University) | Wzorek, Mariusz (Linköping University) | Doherty, Patrick (Linköping University)
Modern optimization-based approaches to control increasingly allow automatic generation of complex behavior from only a model and an objective. Recent years has seen growing interest in fast solvers to also allow real-time operation on robots, but the computational cost of such trajectory optimization remains prohibitive for many applications. In this paper we examine a novel deep neural network approximation and validate it on a safe navigation problem with a real nano-quadcopter. As the risk of costly failures is a major concern with real robots, we propose a risk-aware resampling technique. Contrary to prior work this active learning approach is easy to use with existing solvers for trajectory optimization, as well as deep learning. We demonstrate the efficacy of the approach on a difficult collision avoidance problem with non-cooperative moving obstacles. Our findings indicate that the resulting neural network approximations are least 50 times faster than the trajectory optimizer while still satisfying the safety requirements. We demonstrate the potential of the approach by implementing a synthesized deep neural network policy on the nano-quadcopter microcontroller.
Community-Based Question Answering via Asymmetric Multi-Faceted Ranking Network Learning
Zhao, Zhou (Zhejiang University) | Lu, Hanqing (Zhejiang University) | Zheng, Vincent W. (Advanced Digital Sciences Center) | Cai, Deng (Zhejiang University) | He, Xiaofei (Zhejiang University) | Zhuang, Yueting (Zhejiang University)
Nowadays the community-based question answering (CQA) sites become the popular Internet-based web service, which have accumulated millions of questions and their posted answers over time. Thus, question answering becomes an essential problem in CQA sites, which ranks the high-quality answers to the given question. Currently, most of the existing works study the problem of question answering based on the deep semantic matching model to rank the answers based on their semantic relevance, while ignoring the authority of answerers to the given question. In this paper, we consider the problem of community-based question answering from the viewpoint of asymmetric multi-faceted ranking network embedding. We propose a novel asymmetric multi-faceted ranking network learning framework for community-based question answering by jointly exploiting the deep semantic relevance between question-answer pairs and the answerers' authority to the given question. We then develop an asymmetric ranking network learning method with deep recurrent neural networks by integrating both answers' relative quality rank to the given question and the answerers' following relations in CQA sites. The extensive experiments on a large-scale dataset from a real world CQA site show that our method achieves better performance than other state-of-the-art solutions to the problem.
Word Embedding Based Correlation Model for Question/Answer Matching
Shen, Yikang (Beihang University) | Rong, Wenge (Beihang University) | Jiang, Nan (Beihang University) | Peng, Baolin (The Chinese University of Hong Kong) | Tang, Jie (Tsinghua University) | Xiong, Zhang (Beihang University)
The large scale of Q&A archives accumulated in community based question answering (CQA) servivces are important information and knowledge resource on the web. Question and answer matching task has been attached much importance to for its ability to reuse knowledge stored in these systems: it can be useful in enhancing user experience with recurrent questions. In this paper, a Word Embedding based Correlation (WEC) model is proposed by integrating advantages of both the translation model and word embedding. Given a random pair of words, WEC can score their co-occurrence probability in Q&A pairs, while it can also leverage the continuity and smoothness of continuous space word representation to deal with new pairs of words that are rare in the training parallel text. An experimental study on Yahoo! Answers dataset and Baidu Zhidao dataset shows this new method's promising potential.
Salience Estimation via Variational Auto-Encoders for Multi-Document Summarization
Li, Piji (The Chinese University of Hong Kong) | Wang, Zihao (The Chinese University of Hong Kong) | Lam, Wai (The Chinese University of Hong Kong) | Ren, Zhaochun (University College London) | Bing, Lidong (AI Platform Department, Tencent Inc.)
We propose a new unsupervised sentence salience framework for Multi-Document Summarization (MDS), which can be divided into two components: latent semantic modeling and salience estimation. For latent semantic modeling, a neural generative model called Variational Auto-Encoders (VAEs) is employed to describe the observed sentences and the corresponding latent semantic representations. Neural variational inference is used for the posterior inference of the latent variables. For salience estimation, we propose an unsupervised data reconstruction framework, which jointly considers the reconstruction for latent semantic space and observed term vector space. Therefore, we can capture the salience of sentences from these two different and complementary vector spaces. Thereafter, the VAEs-based latent semantic model is integrated into the sentence salience estimation component in a unified fashion, and the whole framework can be trained jointly by back-propagation via multi-task learning. Experimental results on the benchmark datasets DUC and TAC show that our framework achieves better performance than the state-of-the-art models.
Improving Event Causality Recognition with Multiple Background Knowledge Sources Using Multi-Column Convolutional Neural Networks
Kruengkrai, Canasai (National Institute of Information and Communications Technology) | Torisawa, Kentaro (National Institute of Information and Communications Technology) | Hashimoto, Chikara (National Institute of Information and Communications Technology) | Kloetzer, Julien (National Institute of Information and Communications Technology) | Oh, Jong-Hoon (National Institute of Information and Communications Technology) | Tanaka, Masahiro (National Institute of Information and Communications Technology)
We propose a method for recognizing such event causalities as "smoke cigarettes" → "die of lung cancer" using background knowledge taken from web texts as well as original sentences from which candidates for the causalities were extracted. We retrieve texts related to our event causality candidates from four billion web pages by three distinct methods, including a why-question answering system, and feed them to our multi-column convolutional neural networks. This allows us to identify the useful background knowledge scattered in web texts and effectively exploit the identified knowledge to recognize event causalities. We empirically show that the combination of our neural network architecture and background knowledge significantly improves average precision, while the previous state-of-the-art method gains just a small benefit from such background knowledge.
What Happens Next? Future Subevent Prediction Using Contextual Hierarchical LSTM
Hu, Linmei (Tsinghua University) | Li, Juanzi (Tsinghua University) | Nie, Liqiang (Shandong University) | Li, Xiao-Li (A*STAR) | Shao, Chao (Tsinghua University)
Events are typically composed of a sequence of subevents. Predicting a future subevent of an event is of great importance for many real-world applications. Most previous work on event prediction relied on hand-crafted features and can only predict events that already exist in the training data. In this paper, we develop an end-to-end model which directly takes the texts describing previous subevents as input and automatically generates a short text describing a possible future subevent. Our model captures the two-level sequential structure of a subevent sequence, namely, the word sequence for each subevent and the temporal order of subevents. In addition, our model incorporates the topics of the past subevents to make context-aware prediction of future subevents. Extensive experiments on a real-world dataset demonstrate the superiority of our model over several state-of-the-art methods.
Recurrent Neural Networks with Auxiliary Labels for Cross-Domain Opinion Target Extraction
Ding, Ying (Singapore Management University) | Yu, Jianfei (Singapore Management University) | Jiang, Jing (Singapore Management University)
Opinion target extraction is a fundamental task in opinion mining. In recent years, neural network based supervised learning methods have achieved competitive performance on this task. However, as with any supervised learning method, neural network based methods for this task cannot work well when the training data comes from a different domain than the test data. On the other hand, some rule-based unsupervised methods have shown to be robust when applied to different domains. In this work, we use rule-based unsupervised methods to create auxiliary labels and use neural network models to learn a hidden representation that works well for different domains. When this hidden representation is used for opinion target extraction, we find that it can outperform a number of strong baselines with a large margin.
Automatic Emphatic Information Extraction from Aligned Acoustic Data and Its Application on Sentence Compression
Chen, Yanju (Sun Yat-sen University) | Pan, Rong (Sun Yat-sen University)
In this paper we address the following question: can useful Specific words can be prosodically emphasized in an utterance emphatic information be automatically extracted from by a speaker in order to draw attentions on them, the prevailing acoustic data without any manual feature extraction which can be modeled by pitch accents of words (Bolinger and be used to help improve the performance of natural 1958). Also referred as prosodic prominence, pitch accent language processing tasks such as sentence compression? is found to emphasize several semantic information in an utterance While sentence compression requires the models of a such as uncertainty, contrast, turn-taking cues and so good comprehension of the semantic context and the exact on, whose changes in an utterance can be perceived by listeners intention of the input sentence, we believe the supervision and thus convey certain kinds of emphasis (Terken of additional emphatic data can be a boost to the later, and 1991). The detection of prosodic prominence shows improvements the LSTM structures dealing with the former, which will be on different tasks, such as Text-to-Speech synthesis supported by our evidence. Meanwhile, with the Speech-To- and spoken language summarization. With most of Text alignment techniques, we present a faster approach to the detections of prosodic prominence are done by using automatically extract approximate emphatic patterns from acoustic features (acoustic durations and intensities, extremity aligned acoustic data, thus lowering the cost of manual of fundamental frequency minima and maxima), there feature extraction in emphatic words detection and prediction are also works investigating predictions of emphatic words and providing weak supervision as an auxiliary task using only lexical features (Brenier, Cer, and Jurafsky 2005; to improve sentence compression performance using LSTM Brenier 2008), which shows promising results and potential structures.
Mechanism-Aware Neural Machine for Dialogue Response Generation
Zhou, Ganbin (Institute of Computing Technology, Chinese Academy of Sciences) | Luo, Ping (Institute of Computing Technology, Chinese Academy of Sciences) | Cao, Rongyu (Institute of Computing Technology, Chinese Academy of Sciences) | Lin, Fen (Tencent) | Chen, Bo (Tencent) | He, Qing (Institute of Computing Technology, Chinese Academy of Sciences)
To the same utterance, people's responses in everyday dialogue may be diverse largely in terms of content semantics, speaking styles, communication intentions and so on. Previous generative conversational models ignore these 1-to-n relationships between a post to its diverse responses, and tend to return high-frequency but meaningless responses. In this study we propose a mechanism-aware neural machine for dialogue response generation. It assumes that there exists some latent responding mechanisms, each of which can generate different responses for a single input post. With this assumption we model different responding mechanisms as latent embeddings, and develop a encoder-diverter-decoder framework to train its modules in an end-to-end fashion. With the learned latent mechanisms, for the first time these decomposed modules can be used to encode the input into mechanism-aware context, and decode the responses with the controlled generation styles and topics. Finally, the experiments with human judgements, intuitive examples, detailed discussions demonstrate the quality and diversity of the generated responses with 9.80% increase of acceptable ratio over the best of six baseline methods.
Active Discriminative Text Representation Learning
Zhang, Ye (University of Texas at Austin) | Lease, Matthew (University of Texas at Austin) | Wallace, Byron C. (Northeastern University)
We propose a new active learning (AL) method for text classification with convolutional neural networks (CNNs). In AL, one selects the instances to be manually labeled with the aim of maximizing model performance with minimal effort. Neural models capitalize on word embeddings as representations (features), tuning these to the task at hand. We argue that AL strategies for multi-layered neural models should focus on selecting instances that most affect the embedding space (i.e., induce discriminative word representations). This is in contrast to traditional AL approaches (e.g., entropy-based uncertainty sampling), which specify higher level objectives. We propose a simple approach for sentence classification that selects instances containing words whose embeddings are likely to be updated with the greatest magnitude, thereby rapidly learning discriminative, task-specific embeddings. We extend this approach to document classification by jointly considering: (1) the expected changes to the constituent word representations; and (2) the model’s current overall uncertainty regarding the instance. The relative emphasis placed on these criteria is governed by a stochastic process that favors selecting instances likely to improve representations at the outset of learning, and then shifts toward general uncertainty sampling as AL progresses. Empirical results show that our method outperforms baseline AL approaches on both sentence and document classification tasks. We also show that, as expected, the method quickly learns discriminative word embeddings. To the best of our knowledge, this is the first work on AL addressing neural models for text classification.