Goto

Collaborating Authors

 Information Extraction


Day 123 of #NLP365: NLP Papers Summary -- Context-aware Embedding for Targeted Aspect-based…

#artificialintelligence

Proposed context-aware embeddings to refine the embeddings of targets and aspects using highly correlative words. A lot of previous work uses context-independent vectors to construct targets and aspects embeddings, which lead to loss of semantic information and failed to capture the interconnection between the specific target, its aspect, and its context. This approach has led to SOTA results in targeted aspect-based sentiment analysis (TABSA). The goal of TABSA is that given an input sentence, we want to extract the sentiment of the aspect that belongs to a target. The refined target embeddings can be computed by multiplying the sentence word embeddings X with the sparse coefficient vector u'.


Machine Knowledge: Creation and Curation of Comprehensive Knowledge Bases

arXiv.org Artificial Intelligence

Equipping machines with comprehensive knowledge of the world's entities and their relationships has been a long-standing goal of AI. Over the last decade, large-scale knowledge bases, also known as knowledge graphs, have been automatically constructed from web contents and text sources, and have become a key asset for search engines. This machine knowledge can be harnessed to semantically interpret textual phrases in news, social media and web tables, and contributes to question answering, natural language processing and data analytics. This article surveys fundamental concepts and practical methods for creating and curating large knowledge bases. It covers models and methods for discovering and canonicalizing entities and their semantic types and organizing them into clean taxonomies. On top of this, the article discusses the automatic extraction of entity-centric properties. To support the long-term life-cycle and the quality assurance of machine knowledge, the article presents methods for constructing open schemas and for knowledge curation. Case studies on academic projects and industrial knowledge graphs complement the survey of concepts and methods.


Sentiment Analysis with NLP using Python and Flask

#artificialintelligence

There are two main features in SA. Sentiment Analysis is widely used in the area of Machine Learning under Natural Language Processing. In this course, you will know how to use sentiment analysis on reviews with the help of a NLP library called TextBlob. You will learn and develop a Flask based WebApp that takes reviews from the user and perform sentiment analysis on the same.


Financial Evolution: AI, Machine Learning & Sentiment Analysis

#artificialintelligence

Artificial Intelligence and Machine Learning (AI & ML) and Sentiment Analysis are said to "predict the future through analysing the past" – the Holy Grail of the finance sector. They can replicate cognitive decisions made by humans yet avoid the behavioural biases inherent in humans. Processing news data and social media data and classifying (market) sentiment and how it impacts Financial Markets is a growing area of research. The field has recently progressed further with many new "alternative" data sources, such as email receipts, credit/debit card transactions, weather, geo-location, satellite data, Twitter, Micro-blogs and search engine results. AI & ML are gaining adoption in the financial services industry especially in the context of compliance, investment decisions and risk management.


WESSA at SemEval-2020 Task 9: Code-Mixed Sentiment Analysis using Transformers

arXiv.org Artificial Intelligence

In this paper, we describe our system submitted for SemEval 2020 Task 9, Sentiment Analysis for Code-Mixed Social Media Text alongside other experiments. Our best performing system is a Transfer Learning-based model that fine-tunes "XLM-RoBERTa", a transformer-based multilingual masked language model, on monolingual English and Spanish data and Spanish-English code-mixed data. Our system outperforms the official task baseline by achieving a 70.1% average F1-Score on the official leaderboard using the test set. For later submissions, our system manages to achieve a 75.9% average F1-Score on the test set using CodaLab username "ahmed0sultan".


Facebook, Twitter and YouTube have your data. Why not China-owned ByteDance's TikTok?

USATODAY - Tech Top Stories

Facebook, Twitter and YouTube have your data. The Trump administration said Friday that it would bar two popular Chinese-owned mobile apps WeChat and TikTok from U.S. app stores as of midnight Sunday, escalating the U.S. standoff with China. "Today's actions prove once again that President Trump will do everything in his power to guarantee our national security and protect Americans from the threats of the Chinese Communist Party," Commerce Secretary Wilbur Ross said in a statement. The Trump administration contends the data collected from American users by TikTok and WeChat could be accessed by the Chinese government. "The Trump administration is looking to make sure U.S. TikTok consumer data stays out of Beijing," said Wedbush Securities analyst Daniel Ives.


Large-Scale Intelligent Microservices

arXiv.org Artificial Intelligence

Deploying Machine Learning (ML) algorithms within databases is a challenge due to the varied computational footprints of modern ML algorithms and the myriad of database technologies each with their own restrictive syntax. We introduce an Apache Spark-based micro-service orchestration framework that extends database operations to include web service primitives. Our system can orchestrate web services across hundreds of machines and takes full advantage of cluster, thread, and asynchronous parallelism. Using this framework, we provide large scale clients for intelligent services such as speech, vision, search, anomaly detection, and text analysis. This allows users to integrate ready-to-use intelligence into any datastore with an Apache Spark connector. To eliminate the majority of overhead from network communication, we also introduce a low-latency containerized version of our architecture. Finally, we demonstrate that the services we investigate are competitive on a variety of benchmarks, and present two applications of this framework to create intelligent search engines, and real time auto race analytics systems.


A Beginner's Guide to Sentiment Analysis with Python

#artificialintelligence

Sentiment analysis is a technique that detects the underlying sentiment in a piece of text. It is the process of classifying text as either positive, negative, or neutral. Machine learning techniques are used to evaluate a piece of text and determine the sentiment behind it. Sentiment analysis is essential for businesses to gauge customer response. Picture this: Your company has just released a new product that is being advertised on a number of different channels.


Arabic Opinion Mining Using a Hybrid Recommender System Approach

arXiv.org Artificial Intelligence

One of these textual information is the customer comments or reviews. People usually prefer to read the reviews before buying or using a service to make the right decision. This behavior is also common before the existence of the Internet. From this amount of available data, researches attempt to handle and use these data to have a specific and useful knowledge. Sentiment analysis (SA) is the process of determining the opinion or feeling of a piece of text. Sentiment means feelings, attitudes, emotions and opinions. The applications of sentiment analysis are numerous such as politics or political science, law, e-commerce, sociology and psychology. In e-commerce, the sentiment analysis is super useful for gaining insight into customer opinions; once they understand how the customer feels after analyzing their comments or reviews, they can identify what they like and dislike and build things like recommendation systems, or enhance the product or the service.


Tag and Correct: Question aware Open Information Extraction with Two-stage Decoding

arXiv.org Artificial Intelligence

Question Aware Open Information Extraction (Question aware Open IE) takes question and passage as inputs, outputting an answer tuple which contains a subject, a predicate, and one or more arguments. Each field of answer is a natural language word sequence and is extracted from the passage. The semi-structured answer has two advantages which are more readable and falsifiable compared to span answer. There are two approaches to solve this problem. One is an extractive method which extracts candidate answers from the passage with the Open IE model, and ranks them by matching with questions. It fully uses the passage information at the extraction step, but the extraction is independent to the question. The other one is the generative method which uses a sequence to sequence model to generate answers directly. It combines the question and passage as input at the same time, but it generates the answer from scratch, which does not use the facts that most of the answer words come from in the passage. To guide the generation by passage, we present a two-stage decoding model which contains a tagging decoder and a correction decoder. At the first stage, the tagging decoder will tag keywords from the passage. At the second stage, the correction decoder will generate answers based on tagged keywords. Our model could be trained end-to-end although it has two stages. Compared to previous generative models, we generate better answers by generating coarse to fine. We evaluate our model on WebAssertions (Yan et al., 2018) which is a Question aware Open IE dataset. Our model achieves a BLEU score of 59.32, which is better than previous generative methods.