AITopics | Information Extraction

Collaborating Authors

Information Extraction

News Overviews Instructional Materials AI-Alerts Classics

Azure Cognitive Services Sentiment Analysis V3-- Using PySpark

#artificialintelligenceSep-6-2020, 02:30:12 GMT

Azure Cognitive Services Text Analytics is a great tool you can use to quickly evaluate a text data set for positive or negative sentiment. For example, a service provider can quickly and easily evaluate reviews as positive or negative and rank them based on the sentiment score detected. Today I'm going to go through how to use Azure Cognitive Services Text Analytics using Databricks PySpark Notebook to analyze the sentiment of COVID-19 Tweets and return sentiment scores and indicators as to whether it is a positive or negative tweet. Cognitive Services are a set of machine learning algorithms that Microsoft has developed to solve problems in the field of Artificial Intelligence (AI). Developers can consume these algorithms through standard REST calls over the Internet to the Cognitive Services APIs in their Apps, Websites, or Workflows.

artificial intelligence, machine learning, natural language, (6 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Beyond Social Media Analytics: Understanding Human Behaviour and Deep Emotion using Self Structuring Incremental Machine Learning

Bandaragoda, Tharindu

arXiv.org Machine LearningSep-5-2020

This thesis develops a conceptual framework considering social data as representing the surface layer of a hierarchy of human social behaviours, needs and cognition which is employed to transform social data into representations that preserve social behaviours and their causalities. Based on this framework two platforms were built to capture insights from fast-paced and slow-paced social data. For fast-paced, a self-structuring and incremental learning technique was developed to automatically capture salient topics and corresponding dynamics over time. An event detection technique was developed to automatically monitor those identified topic pathways for significant fluctuations in social behaviours using multiple indicators such as volume and sentiment. This platform is demonstrated using two large datasets with over 1 million tweets. The separated topic pathways were representative of the key topics of each entity and coherent against topic coherence measures. Identified events were validated against contemporary events reported in news. Secondly for the slow-paced social data, a suite of new machine learning and natural language processing techniques were developed to automatically capture self-disclosed information of the individuals such as demographics, emotions and timeline of personal events. This platform was trialled on a large text corpus of over 4 million posts collected from online support groups. This was further extended to transform prostate cancer related online support group discussions into a multidimensional representation and investigated the self-disclosed quality of life of patients (and partners) against time, demographics and clinical factors. The capabilities of this extended platform have been demonstrated using a text corpus collected from 10 prostate cancer online support groups comprising of 609,960 prostate cancer discussions and 22,233 patients.

information retrieval, machine learning, patient-reported information multidimensional exploration, (20 more...)

arXiv.org Machine Learning

2009.09078

Country:

Asia > Russia (0.45)
North America > United States > New York > New York County > New York City (0.14)
Asia > Middle East > Iran (0.14)
(38 more...)

Genre:

Research Report > Strength High (1.00)
Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
(3 more...)

Industry:

Health & Medicine > Therapeutic Area > Urology (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (1.00)
Health & Medicine > Therapeutic Area > Neurology (1.00)
(3 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
(9 more...)

Add feedback

Forrester names SAS a Leader in AI-Based text analytics platforms (document focused).

#artificialintelligenceAug-30-2020, 23:20:27 GMT

According to Forrester, document capture options and mining text from images and cursive writing in multiple languages are key differentiators for document-focused enterprise text analytics platforms, which focus on longer documents, such as contracts, insurance claims, invoices, and purchase orders. The software helps users with sentiment analysis, trend analysis, data preparation and visualization and hybrid modeling approaches. The report states, "SAS Visual Text Analytics bolsters SAS' family of formidable analytics products. SAS Visual Text Analytics is one of [SAS'] several applications built on the SAS Viya platform, where all applications share data and model management, BI and analytics GUI and other microservices, resulting in consistent UX."

data mining, natural language, text analytic platform, (3 more...)

#artificialintelligence

Industry: Media > News (0.40)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)

Add feedback

A Blast From the Past: Personalizing Predictions of Video-Induced Emotions using Personal Memories as Context

Dudzik, Bernd, Broekens, Joost, Neerincx, Mark, Hung, Hayley

arXiv.org Artificial IntelligenceAug-27-2020

A key challenge in the accurate prediction of viewers' emotional responses to video stimuli in real-world applications is accounting for person- and situation-specific variation. An important contextual influence shaping individuals' subjective experience of a video is the personal memories that it triggers in them. Prior research has found that this memory influence explains more variation in video-induced emotions than other contextual variables commonly used for personalizing predictions, such as viewers' demographics or personality. In this article, we show that (1) automatic analysis of text describing their video-triggered memories can account for variation in viewers' emotional responses, and (2) that combining such an analysis with that of a video's audiovisual content enhances the accuracy of automatic predictions. We discuss the relevance of these findings for improving on state of the art approaches to automated affective video analysis in personalized contexts.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2008.12096

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > Netherlands > South Holland > Delft (0.05)
(2 more...)

Genre: Research Report > New Finding (0.93)

Industry: Health & Medicine > Therapeutic Area (0.69)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Data Science (0.94)
(3 more...)

Add feedback

Cross-language sentiment analysis of European Twitter messages duringthe COVID-19 pandemic

Kruspe, Anna, Häberle, Matthias, Kuhn, Iona, Zhu, Xiao Xiang

arXiv.org Machine LearningAug-27-2020

Social media data can be a very salient source of information during crises. User-generated messages provide a window into people's minds during such times, allowing us insights about their moods and opinions. Due to the vast amounts of such messages, a large-scale analysis of population-wide developments becomes possible. In this paper, we analyze Twitter messages (tweets) collected during the first months of the COVID-19 pandemic in Europe with regard to their sentiment. This is implemented with a neural network for sentiment analysis using multilingual sentence embeddings. We separate the results by country of origin, and correlate their temporal development with events in those countries. This allows us to study the effect of the situation on people's moods. We see, for example, that lockdown announcements correlate with a deterioration of mood in almost all surveyed countries, which recovers within a short time span.

keyword, sentiment, tweet, (12 more...)

arXiv.org Machine Learning

2008.12172

Country:

Europe > United Kingdom (0.14)
Europe > Spain (0.06)
Europe > Italy (0.06)
(33 more...)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.92)
Health & Medicine > Therapeutic Area > Immunology (0.92)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.86)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

Multi-Label Sentiment Analysis on 100 Languages with Dynamic Weighting for Label Imbalance

Yilmaz, Selim F., Kaynak, E. Batuhan, Koç, Aykut, Dibeklioğlu, Hamdi, Kozat, Suleyman S.

arXiv.org Machine LearningAug-26-2020

We investigate cross-lingual sentiment analysis, which has attracted significant attention due to its applications in various areas including market research, politics and social sciences. In particular, we introduce a sentiment analysis framework in multi-label setting as it obeys Plutchik wheel of emotions. We introduce a novel dynamic weighting method that balances the contribution from each class during training, unlike previous static weighting methods that assign non-changing weights based on their class frequency. Moreover, we adapt the focal loss that favors harder instances from single-label object recognition literature to our multi-label setting. Furthermore, we derive a method to choose optimal class-specific thresholds that maximize the macro-f1 score in linear time complexity. Through an extensive set of experiments, we show that our method obtains the state-of-the-art performance in 7 of 9 metrics in 3 different languages using a single model compared to the common baselines and the best-performing methods in the SemEval competition. We publicly share our code for our model, which can perform sentiment analysis in 100 languages, to facilitate further research.

classification, machine learning, natural language, (20 more...)

arXiv.org Machine Learning

2008.11573

Country:

Asia > Middle East > Republic of Türkiye > Ankara Province > Ankara (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Europe > Portugal > Braga > Braga (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

Learning from students' perception on professors through opinion mining

Vargas-Calderón, Vladimir, Flórez, Juan S., Ardila, Leonel F., Parra-A., Nicolas, Camargo, Jorge E., Vargas, Nelson

arXiv.org Artificial IntelligenceAug-25-2020

Students' perception of classes measured through their opinions on teaching surveys allows to identify deficiencies and problems, both in the environment and in the learning methodologies. The purpose of this paper is to study, through sentiment analysis using natural language processing (NLP) and machine learning (ML) techniques, those opinions in order to identify topics that are relevant for students, as well as predicting the associated sentiment via polarity analysis. As a result, it is implemented, trained and tested two algorithms to predict the associated sentiment as well as the relevant topics of such opinions. The combination of both approaches then becomes useful to identify specific properties of the students' opinions associated with each sentiment label (positive, negative or neutral opinions) and topic. Furthermore, we explore the possibility that students' perception surveys are carried out without closed questions, relying on the information that students can provide through open questions where they express their opinions about their classes.

artificial intelligence, natural language, student, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/978-3-030-61702-8_23

2008.11183

Country:

South America > Colombia > Bogotá D.C. > Bogotá (0.04)
Oceania > Australia (0.04)
North America > United States > Nevada > Clark County > Las Vegas (0.04)
(5 more...)

Genre:

Questionnaire & Opinion Survey (0.91)
Research Report > Strength High (0.46)
Research Report > Experimental Study (0.46)
Instructional Material > Course Syllabus & Notes (0.46)

Industry:

Health & Medicine (1.00)
Education > Educational Setting (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)

Add feedback

Portfolio

#artificialintelligenceAug-22-2020, 08:30:20 GMT

The Linguistic Universe of Hungarian Poet Endre Ady Gender Stereotypes of Hungarian Online Media Named Entities in Hungarian Online Media Growth Hacking with NLP and Sentiment Analysis - our 5-week course at Manning Publications Metaphor and National Identity Alternative conceptualization of the Treaty of Trianon - 2019, John Benjamins Publishing Company We helped the future…

artificial intelligence, portfolio, social media, (1 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (0.39)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.39)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.39)

Add feedback

Twitter Data Case Sparks Dispute, Delay Among EU Privacy Regulators

WSJ.com: WSJD - TechnologyAug-20-2020, 11:00:00 GMT

European Union privacy regulators are clashing over how much--if anything--to fine Twitter Inc. for its handling of a data breach disclosed last year, delaying progress of the most advanced cross-border privacy case involving a U.S. tech company under the EU's strict new privacy law. The dispute, disclosed in a statement Thursday from Ireland's Data Protection Commission, is one of the first major tests for enforcement of the EU's privacy law, known as GDPR, which took effect in 2018. It raises the specter of disagreements and...

artificial intelligence, natural language, twitter data case spark dispute, (3 more...)

WSJ.com: WSJD - Technology

Country: Europe > Ireland (0.35)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.40)

Add feedback

SentiQ: A Probabilistic Logic Approach to Enhance Sentiment Analysis Tool Quality

Kouadri, Wissam Maamar, Benbernou, Salima, Ouziri, Mourad, Palpanas, Themis, Amor, Iheb Ben

arXiv.org Artificial IntelligenceAug-19-2020

The opinion expressed in various Web sites and social-media is an essential contributor to the decision making process of several organizations. Existing sentiment analysis tools aim to extract the polarity (i.e., positive, negative, neutral) from these opinionated contents. Despite the advance of the research in the field, sentiment analysis tools give \textit{inconsistent} polarities, which is harmful to business decisions. In this paper, we propose SentiQ, an unsupervised Markov logic Network-based approach that injects the semantic dimension in the tools through rules. It allows to detect and solve inconsistencies and then improves the overall accuracy of the tools. Preliminary experimental results demonstrate the usefulness of SentiQ.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2008.08919

Country:

North America > United States > California > San Diego County > San Diego (0.05)
Asia > China (0.05)
North America > United States > New York > New York County > New York City (0.04)
Europe > France > Auvergne-Rhône-Alpes > Lyon > Lyon (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.70)

Add feedback