Goto

Collaborating Authors

 Information Extraction


Customer Insights 2021 Predictions: Evolution And Collaboration

#artificialintelligence

CI leaders will shift 10% of their budgets to emotion analytics. Emotions are a more important driver of consumer decisions than rational thought and thus are the largest factor in brand energy, customer experience, and marketing effectiveness. But for the past decade, CI professionals have leaned into the precision of big data analytics instead of the traditionally unquantifiable territory of emotion. New techniques change this dynamic: AI-based text analytics tools such as Clarabridge and IBM Watson improve the precision of cruder sentiment analysis tools, while firms such as Nielsen and Realeyes bring biometric and facial analysis methodologies from the lab to the business world. As data analytics becomes commoditized, firms will shift 10% of the insights budget to emotion analytics to pilot new techniques in search of competitive advantage in the "why" behind consumer behavior, not just the "what" that data analytics addresses. Companies will reorganize to ensure CX and CI collaboration.


Iterative Network Pruning with Uncertainty Regularization for Lifelong Sentiment Classification

arXiv.org Artificial Intelligence

Lifelong learning capabilities are crucial for sentiment classifiers to process continuous streams of opinioned information on the Web. However, performing lifelong learning is non-trivial for deep neural networks as continually training of incrementally available information inevitably results in catastrophic forgetting or interference. In this paper, we propose a novel iterative network pruning with uncertainty regularization method for lifelong sentiment classification (IPRLS), which leverages the principles of network pruning and weight regularization. By performing network pruning with uncertainty regularization in an iterative manner, IPRLS can adapta single BERT model to work with continuously arriving data from multiple domains while avoiding catastrophic forgetting and interference. Specifically, we leverage an iterative pruning method to remove redundant parameters in large deep networks so that the freed-up space can then be employed to learn new tasks, tackling the catastrophic forgetting problem. Instead of keeping the old-tasks fixed when learning new tasks, we also use an uncertainty regularization based on the Bayesian online learning framework to constrain the update of old tasks weights in BERT, which enables positive backward transfer, i.e. learning new tasks improves performance on past tasks while protecting old knowledge from being lost. In addition, we propose a task-specific low-dimensional residual function in parallel to each layer of BERT, which makes IPRLS less prone to losing the knowledge saved in the base BERT network when learning a new task. Extensive experiments on 16 popular review corpora demonstrate that the proposed IPRLS method sig-nificantly outperforms the strong baselines for lifelong sentiment classification. For reproducibility, we submit the code and data at:https://github.com/siat-nlp/IPRLS.


Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences

arXiv.org Artificial Intelligence

Visual information extraction (VIE) has attracted increasing attention in recent years. The existing methods usually first organized optical character recognition (OCR) results into plain texts and then utilized token-level entity annotations as supervision to train a sequence tagging model. However, it expends great annotation costs and may be exposed to label confusion, and the OCR errors will also significantly affect the final performance. In this paper, we propose a unified weakly-supervised learning framework called TCPN (Tag, Copy or Predict Network), which introduces 1) an efficient encoder to simultaneously model the semantic and layout information in 2D OCR results; 2) a weakly-supervised training strategy that utilizes only key information sequences as supervision; and 3) a flexible and switchable decoder which contains two inference modes: one (Copy or Predict Mode) is to output key information sequences of different categories by copying a token from the input or predicting one in each time step, and the other (Tag Mode) is to directly tag the input sequence in a single forward pass. Our method shows new state-of-the-art performance on several public benchmarks, which fully proves its effectiveness.


Add Machine Learning to Your Apps using TensorFlow.js

#artificialintelligence

All around us, developers now leverage machine learning capabilities in their applications to amplify human effort. Tensorflow enables developers to make mind blowing capabilities like tracking your pose with a web cam, object detection using images, sentiment analysis in text, and computer generated art/music. In this session, we'll explore the opportunities for web developers to create "plug and play" machine learning experiences using TensorFlow.JS and related JavaScript libraries. We'll explore ways that you can make an impact using pre-trained models from tfhub.dev. To learn more, check out https://www.tensorflow.org/js/ .


David Horton on LinkedIn: ElligencIA Teaser

#artificialintelligence

The annual production of data follows an exponential curve, the assimilation by a person or even by a group of persons of this data is no longer possible. To get the most out of it, it is necessary to be helped by computers. But as this data is mostly unstructured, classical algorithms are unable to do this job. Only Artificial Intelligence and in particular NLP with Sentiment Analysis can do it. We created ElligencIA with the aim of giving meaning to this ocean of data and taking advantage of this collective intelligence. ElligencIA, operational since January 1st 2021, is an AI consulting and solutions company for the BFSI.


Use advanced natural language processing and tone analysis to extract meaningful insights

#artificialintelligence

Learn how to extract insights from natural language text, such as category, concepts, emotion, entities, keywords, sentiment, top positive sentences, and word clouds by using IBM Watson Natural Language Understanding and Watson Tone Analyzer. Watson Natural Language Understanding includes a set of text analytics features that can be used to extract meanings from unstructured data such as a text file. Watson Tone Analyzer understands emotions and communication styles in a text. By combining the capabilities of both services, you can extract meaningful insights in the form of a natural language understanding analysis report from a natural language transcript. The transcript used in this code pattern is generated from a video recording of the IBM Q1 2019 earnings meeting.


SaH Analytics International on LinkedIn: Data at the core of Analytics

#artificialintelligence

Why analytics are vital for prosperous businesses? Descriptive analytics: review data with stats to tell you what happened in the past. It helps a business understand better how it is performing by providing context to help stakeholders interpret information. This can be in the form of data visualizations like graphs, charts, reports, and dashboards. Predictive analytics takes past data and feeds it into a machine learning model that considers key trends and patterns.


Sentiment Analysis

#artificialintelligence

Sentiment Analysis, as the name suggests, it means to identify the view or emotion behind a situation. It basically means to analyze and find the emotion or intent behind a piece of text or speech or any mode of communication. In this article, we will focus on the sentiment analysis of text data. We, humans, communicate with each other in a variety of languages, and any language is just a mediator or a way in which we try to express ourselves. And, whatever we say has a sentiment associated with it.


Natural Language Processing With Transformers in Python

#artificialintelligence

Transformer models are the de-facto standard in modern NLP. They have proven themselves as the most expressive, powerful models for language by a large margin, beating all major language-based benchmarks time and time again. In this course, we learn all you need to know to get started with building cutting-edge performance NLP applications using transformer models like Google AI's BERT, or Facebook AI's DPR. Throughout each of these use-cases we work through a variety of examples to ensure that what, how, and why transformers are so important. Alongside these sections we also work through two full-size NLP projects, one for sentiment analysis of financial Reddit data, and another covering a fully-fledged open domain question-answering application.


Bangla Natural Language Processing: A Comprehensive Review of Classical, Machine Learning, and Deep Learning Based Methods

arXiv.org Artificial Intelligence

The Bangla language is the seventh most spoken language, with 265 million native and non-native speakers worldwide. However, English is the predominant language for online resources and technical knowledge, journals, and documentation. Consequently, many Bangla-speaking people, who have limited command of English, face hurdles to utilize English resources. To bridge the gap between limited support and increasing demand, researchers conducted many experiments and developed valuable tools and techniques to create and process Bangla language materials. Many efforts are also ongoing to make it easy to use the Bangla language in the online and technical domains. There are some review papers to understand the past, previous, and future Bangla Natural Language Processing (BNLP) trends. The studies are mainly concentrated on the specific domains of BNLP, such as sentiment analysis, speech recognition, optical character recognition, and text summarization. There is an apparent scarcity of resources that contain a comprehensive study of the recent BNLP tools and methods. Therefore, in this paper, we present a thorough review of 71 BNLP research papers and categorize them into 11 categories, namely Information Extraction, Machine Translation, Named Entity Recognition, Parsing, Parts of Speech Tagging, Question Answering System, Sentiment Analysis, Spam and Fake Detection, Text Summarization, Word Sense Disambiguation, and Speech Processing and Recognition. We study articles published between 1999 to 2021, and 50% of the papers were published after 2015. We discuss Classical, Machine Learning and Deep Learning approaches with different datasets while addressing the limitations and current and future trends of the BNLP.