Information Extraction
Sentiment Analysis Exposed
I made it with Max last night! OMG! Welcome to womanhood!! How was it/he? And right about now, Mary's mom gets a'notification' on her cell phone that her daughter is texting sexual references, then displays Mary's texts with Shelly upon mom's request. Mom spends the rest of the day at work fuming, conjuring dialog with her daughter for later that evening when they'll be home together. Never did, and she'd told Mary not to see him.
Facebook Data Breach: API Security Risks
In the year 2018 Facebook disclosed a massive data breach due to which the company had to face a lawsuit along with allegations of not properly securing its user data. The breach directly affected the authentication tokens of nearly 30 million of its users which led to the filing of several class-action complaints in a San Francisco appeals court. In the wake of the incident, Facebook pledged to strengthen its security. A feature, known as "View As" which was employed by developers to render user pages was exploited by hackers to get access to user tokens. The theft of these tokens is associated with the advancement of a major API security risk, it also indicates how API risks can go unnoticed for such a long time frame.
Annotation of Emotion Carriers in Personal Narratives
Tammewar, Aniruddha, Cervone, Alessandra, Messner, Eva-Maria, Riccardi, Giuseppe
We are interested in the problem of understanding personal narratives (PN) - spoken or written - recollections of facts, events, and thoughts. In PN, emotion carriers are the speech or text segments that best explain the emotional state of the user. Such segments may include entities, verb or noun phrases. Advanced automatic understanding of PNs requires not only the prediction of the user emotional state but also to identify which events (e.g. "the loss of relative" or "the visit of grandpa") or people ( e.g. "the old group of high school mates") carry the emotion manifested during the personal recollection. This work proposes and evaluates an annotation model for identifying emotion carriers in spoken personal narratives. Compared to other text genres such as news and microblogs, spoken PNs are particularly challenging because a narrative is usually unstructured, involving multiple sub-events and characters as well as thoughts and associated emotions perceived by the narrator. In this work, we experiment with annotating emotion carriers from speech transcriptions in the Ulm State-of-Mind in Speech (USoMS) corpus, a dataset of German PNs. We believe this resource could be used for experiments in the automatic extraction of emotion carriers from PN, a task that could provide further advancements in narrative understanding.
Information Extraction from Receipts with Graph Convolutional Networks
As we have seen before, the Information Extraction step consists mainly of classifying words (tagging), the output can be stored as key-value pairs in a computer-friendly file format (e.g.: JSON). The data extracted can then be efficiently archived, indexed and used for analytics. If we compare OCR to young children training themselves to recognize characters and words, then Information Extraction would be like children learning to make sense of the words. An example of IE would be when you stare at your credit card bill trying to find the amount due and the due date. Suppose you want to build an AI application to do it automatically; OCR could be applied to extract the text from the image, converting pixels into bytes or Unicode characters, and the output would be every single character printed in the bill.
Gated Mechanism for Attention Based Multimodal Sentiment Analysis
ABSTRACT different granularities [3, 9] or use a cross interaction block that couple the features from different modalities [10, 6]. It is imperative that all modalities in multimodal interactions and 3. Fusion of unimodal and cross Therefore, to learn better cross modal information, we introduce 1.6% and 1.34% absolute improvement over current state-ofthe-art. Furthermore, to capture long term dependencies across 1. INTRODUCTION These are categorised into three types, 1. Methods that learn the modalities independently and fuse the In our proposed model, we aim to learn the interaction between [3, 4], and 3. Methods that explicitly learn contributions Personal use of this material is permitted. Multimodal sentiment analysis provides an opportunity to 2.1. M T V H T W H T V; W R d d (3) (U 1, U 2,..., U u) for a Text modality can be defined as: Cross attentive representations of Text (C V T R u d) and H T Bi-GRU(U 1, U 2,..., U u) (1) Video (C T V R u d) can be represented as: Subscript T denotes Text modality, A and V represent Audio As much as there is an opportunity to leverage cross modal interactions, representations is employed.
Analyzing Customer Support on Social Media - Qualetics Data Machines
The goal of this study is to analyze the queries raised by customers on a particular social media platform by analyzing their interactions with the customer support and provide incisive insights to perform sentiment analysis. We performed exploratory data analysis to extract insights from the data. With Deep Learning tools like NLTK, sentiment analysis was performed to understand the positive, negative, and neutral sentiments of the customers of a brand. Machine Learning was used to identify the frequency of similar text appearances. Deep learning algorithms were used to understand the customer queries and the average time taken by the respective company's social customer support team in addressing the queries.
Sentiment Analysis with the bag-of-words
As a precursor to research about Sentiment Analysis with Text Classifiers (Naive Bayes, Maximum Entropy, SVM), Sentiment Analysis with bag-of-words was done and Positive / Negative Sentiment was detected with an accuracy of 60%. This is when only unigrams are used. This percentage will be much when bigrams or trigrams are used (in a next blog-post). See the results at: part 1: http://tinyurl.com/gnlfqqm
Co-regularization Based Semi-supervised Domain Adaptation
Kumar, Abhishek, Saha, Avishek, Daume, Hal
This paper presents a co-regularization based approach to semi-supervised domain adaptation. Our proposed approach (EA) builds on the notion of augmented space (introduced in EASYADAPT (EA) [1]) and harnesses unlabeled data in target domain to further enable the transfer of information from source to target. This semi-supervised approach to domain adaptation is extremely simple to implement and can be applied as a pre-processing step to any supervised learner. Our theoretical analysis (in terms of Rademacher complexity) of EA and EA show that the hypothesis class of EA has lower complexity (compared to EA) and hence results in tighter generalization bounds. Experimental results on sentiment analysis tasks reinforce our theoretical findings and demonstrate the efficacy of the proposed method when compared to EA as well as a few other baseline approaches.
Adversarial Multiple Source Domain Adaptation
Zhao, Han, Zhang, Shanghang, Wu, Guanhang, Moura, José M. F., Costeira, Joao P., Gordon, Geoffrey J.
While domain adaptation has been actively researched, most algorithms focus on the single-source-single-target adaptation setting. In this paper we propose new generalization bounds and algorithms under both classification and regression settings for unsupervised multiple source domain adaptation. Our theoretical analysis naturally leads to an efficient learning strategy using adversarial neural networks: we show how to interpret it as learning feature representations that are invariant to the multiple domain shifts while still being discriminative for the learning task. To this end, we propose multisource domain adversarial networks (MDAN) that approach domain adaptation by optimizing task-adaptive generalization bounds. To demonstrate the effectiveness of MDAN, we conduct extensive experiments showing superior adaptation performance on both classification and regression problems: sentiment analysis, digit classification, and vehicle counting.