Country
GoodNewsEveryone: A Corpus of News Headlines Annotated with Emotions, Semantic Roles, and Reader Perception
Bostan, Laura, Kim, Evgeny, Klinger, Roman
Most research on emotion analysis from text focuses on the task of emotion classification or emotion intensity regression. Fewer works address emotions as structured phenomena, which can be explained by the lack of relevant datasets and methods. We fill this gap by releasing a dataset of 5000 English news headlines annotated via crowdsourcing with their dominant emotions, emotion experiencers and textual cues, emotion causes and targets, as well as the reader's perception and emotion of the headline. We propose a multiphase annotation procedure which leads to high quality annotations on such a task via crowdsourcing. Finally, we develop a baseline for the task of automatic prediction of structures and discuss results. The corpus we release enables further research on emotion classification, emotion intensity prediction, emotion cause detection, and supports further qualitative studies.
Strategic Abstention based on Preference Extensions: Positive Results and Computer-Generated Impossibilities
Brandl, Florian (Stanford University) | Brandt, Felix (Technical University of Munich) | Geist, Christian (Technical University of Munich) | Hofbauer, Johannes (Technical University of Munich)
Voting rules allow multiple agents to aggregate their preferences in order to reach joint decisions. A common flaw of some voting rules, known as the no-show paradox, is that agents may obtain a more preferred outcome by abstaining from an election. We study strategic abstention for set-valued voting rules based on Kelly's and Fishburn's preference extensions. Our contribution is twofold. First, we show that, whenever there are at least five alternatives and seven agents, every Pareto-optimal majoritarian voting rule suffers from the no-show paradox with respect to Fishburn's extension. This is achieved by reducing the statement to a finite - yet very large - problem, which is encoded as a formula in propositional logic and then shown to be unsatisfiable by a SAT solver. We also provide a human-readable proof which we extracted from a minimal unsatisfiable core of the formula. Secondly, we prove that every voting rule that satisfies two natural conditions cannot be manipulated by strategic abstention with respect to Kelly's extension and give examples of well-known Pareto-optimal majoritarian voting rules that meet these requirements.
An Attention-based Graph Neural Network for Heterogeneous Structural Learning
Hong, Huiting, Guo, Hantao, Lin, Yucheng, Yang, Xiaoqing, Li, Zang, Ye, Jieping
In this paper, we focus on graph representation learning of heterogeneous information network (HIN), in which various types of vertices are connected by various types of relations. Most of the existing methods conducted on HIN revise homogeneous graph embedding models via meta-paths to learn low-dimensional vector space of HIN. In this paper, we propose a novel Heterogeneous Graph Structural Attention Neural Network (HetSANN) to directly encode structural information of HIN without meta-path and achieve more informative representations. With this method, domain experts will not be needed to design meta-path schemes and the heterogeneous information can be processed automatically by our proposed model. Specifically, we implicitly represent heterogeneous information using the following two methods: 1) we model the transformation between heterogeneous vertices through a projection in low-dimensional entity spaces; 2) afterwards, we apply the graph neural network to aggregate multi-relational information of projected neighborhood by means of attention mechanism. We also present three extensions of HetSANN, i.e., voices-sharing product attention for the pairwise relationships in HIN, cycle-consistency loss to retain the transformation between heterogeneous entity spaces, and multi-task learning with full use of information. The experiments conducted on three public datasets demonstrate that our proposed models achieve significant and consistent improvements compared to state-of-the-art solutions.
Practical Solutions for Machine Learning Safety in Autonomous Vehicles
Mohseni, Sina, Pitale, Mandar, Singh, Vasu, Wang, Zhangyang
Autonomous vehicles rely on machine learning to solve challenging tasks in perception and motion planning. However, automotive software safety standards have not fully evolved to address the challenges of machine learning safety such as interpretability, verification, and performance limitations. In this paper, we review and organize practical machine learning safety techniques that can complement engineering safety for machine learning based software in autonomous vehicles. Our organization maps safety strategies to state-of-the-art machine learning techniques in order to enhance dependability and safety of machine learning algorithms. We also discuss security limitations and user experience aspects of machine learning components in autonomous vehicles.
Tensor entropy for uniform hypergraphs
Many real world complex systems can be analyzed through a graph/ network prospective. There are two classical and well-known classes of complex networks, scale-fr ee networks and small world networks, which play a significant role in many domains such as social networks, b iology, cognitive science and signal processing [1, 4, 27, 44]. The human genome is a beautiful example of complex dynamic graph. The genome-wide chromosomal conformation (Hi-C) map represents the spatia l proximity of different parts of genome capturing the genome structure over time [40, 42]. When studying s uch dynamic graphs, one is often required to identify the pattern/couple changes including degree distributio n, path lengths, clustering coefficients, etc, in the graph topology in order to capture the dynamics [25, 33, 41]. The von Neumann entropy of a graph, first introduced by Braunst ein et al. [8], is a spectral measure used in structural pattern recognition. The intuition behind this me asure is linking the graph Laplacian to density matrices from quantum mechanics, and measuring the comp lexity of the graphs in terms of the von Neumman entropy of the corresponding density matrices [32]. In ad dition, the measure can be viewed as the information theoretic Shannon entropy, i.e., S null
Features or Shape? Tackling the False Dichotomy of Time Series Classification
Alaee, Sara, Abdoli, Alireza, Shelton, Christian, Murillo, Amy C., Gerry, Alec C., Keogh, Eamonn
Time series classification is an important task in its own right, and it is often a precursor to further downstream analytics. To date, virtually all works in the literature have used either shape - based classification using a distance measure or feature - based classification after finding some suitable features for the do main . I t seems to be underappreciated that in many datasets it is the case that some classes are best discriminated with fea tures, while others are best discriminated with shape. Thus, making the shape vs. feature choice will condemn us to poor results, at least for some classes. In this work, we propose a new model for classifying time series that allows the use of both shape and feature - based measures, when warranted . Our algorithm automatically decides which approach is best for which class, and at query time chooses which classifier to trust the most. We evaluate our idea on real world datasets and demonstrate that our ideas produce statistically significant improvement in classification accuracy .
Group-Connected Multilayer Perceptron Networks
Kachuee, Mohammad, Darabi, Sajad, Fazeli, Shayan, Sarrafzadeh, Majid
Despite the success of deep learning in domains such as image, voice, and graphs, there has been little progress in deep representation learning for domains without a known structure between features. For instance, a tabular dataset of different demographic and clinical factors where the feature interactions are not given as a prior. In this paper, we propose Group-Connected Multilayer Perceptron (GMLP) networks to enable deep representation learning in these domains. GMLP is based on the idea of learning expressive feature combinations (groups) and exploiting them to reduce the network complexity by defining local group-wise operations. During the training phase, GMLP learns a sparse feature grouping matrix using temperature annealing softmax with an added entropy loss term to encourage the sparsity. Furthermore, an architecture is suggested which resembles binary trees, where group-wise operations are followed by pooling operations to combine information; reducing the number of groups as the network grows in depth. To evaluate the proposed method, we conducted experiments on five different real-world datasets covering various application areas. Additionally, we provide visualizations on MNIST and synthesized data. According to the results, GMLP is able to successfully learn and exploit expressive feature combinations and achieve state-of-the-art classification performance on different datasets.
Exploiting the potential of deep reinforcement learning for classification tasks in high-dimensional and unstructured data
Obando-Ceron, Johan S., Cano, Victor Romero, Toro, Walter Mayor
This paper presents a framework for efficiently learning feature selection policies which use less features to reach a high classification precision on large unstructured data. It uses a Deep Convolutional Autoencoder (DCAE) for learning compact feature spaces, in combination with recently-proposed Reinforcement Learning (RL) algorithms as Double DQN and Retrace.
Gaussian Process Latent Variable Model Factorization for Context-aware Recommender Systems
Context-aware recommender systems (CARS) have gained increasing attention due to their ability to utilize contextual information. Compared to traditional recommender systems, CARS are, in general, able to generate more accurate recommendations. Latent factors approach accounts for a large proportion of CARS. Recently, a nonlinear Gaussian Process (GP) based factorization method was proven to outperform the state-of-the-art methods in CARS. Despite its effectiveness, GP model-based methods can suffer from over-fitting and may not be able to determine the impact of each context automatically. In order to address such shortcomings, we propose a Gaussian Process Latent V ariable Model Factorization (GPL VMF) method, where we apply an appropriate prior to the original GP model. Our work is primarily inspired by the Gaussian Process Latent V ariable Model (GPL VM), which was a nonlinear dimensionality reduction method. As a result, we improve the performance on the real datasets significantly as well as capturing the importance of each context. In addition to the general advantages, our method provides two main contributions regarding recommender system settings: (1) addressing the influence of bias by setting a nonzero mean function, and (2) utilizing real-valued contexts by fixing the latent space with real values.
Graph Convolutional Networks: analysis, improvements and results
Ullah, Ihsan, Manzo, Mario, Shah, Mitul, Madden, Michael
In the current era of neural networks and big data, higher dimensional data is processed for automation of different application areas. Graphs represent a complex data organization in which dependencies between more than one object or activity occur. Due to the high dimensionality, this data creates challenges for machine learning algorithms. Graph convolutional networks were introduced to utilize the convolutional models concepts that shows good results. In this context, we enhanced two of the existing Graph convolutional network models by proposing four enhancements. These changes includes: hyper parameters optimization, convex combination of activation functions, topological information enrichment through clustering coefficients measure, and structural redesign of the network through addition of dense layers. We present extensive results on four state-of-art benchmark datasets. The performance is notable not only in terms of lesser computational cost compared to competitors, but also achieved competitive results for three of the datasets and state-of-the-art for the fourth dataset.