AITopics | Information Extraction

Collaborating Authors

Information Extraction

News Overviews Instructional Materials AI-Alerts Classics

Ashish Patel 🇮🇳 on LinkedIn: Data Management Challenges in Production Machine learning

#artificialintelligenceMar-2-2023, 12:50:37 GMT

Data Management Challenges in Production Machine Learning By Google #Data #Management #MLOps #MachineLearning

data management challenge, linkedin, production machine

#artificialintelligence

Industry: Information Technology > Services (0.40)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.89)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.40)

Add feedback

SESAMm Raises €35 Million in Series B2 to Grow its ESG and Sentiment Analysis Business

#artificialintelligenceMar-2-2023, 09:25:39 GMT

SESAMm, a leader in natural language processing (NLP), a field of artificial intelligence, announced the close of a Series B2 funding round of €35 million (USD 37 million) to accelerate its ambitious growth and global expansion plans. "Since we started working with SESAMm as investors and clients over two years ago, we've been impressed with both the company's growth and the advanced analytics that have supported our deal sourcing, diligence, and portfolio company value creation efforts" Securing this funding will enable SESAMm to further expand into U.S. and Asian markets, support technology development to generate AI-powered ESG and sentiment analytics, and hire key talent across sustainability, technology, sales, and marketing. The Series B2 round was co-led by Elaia, a deep tech VC firm, and Opera Tech Ventures, the venture capital arm of BNP Paribas (BNP). Other participating companies include asset manager Unigestion, Raiffeisen Bank International's (RBI) venture capital entity Elevator Ventures, AFG Partners, CEGEE Capital, and historical backers, including Carlyle (CG) and New Alpha Asset Management, who participated in the previous Series B1 round. This latest round brings the total funding raised to €50 million.

aithority interview, esg and sentiment analysis business, sesamm, (10 more...)

#artificialintelligence

Country: Asia > Singapore (0.05)

Industry: Banking & Finance > Trading (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.40)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.40)
Information Technology > Data Science > Data Mining > Big Data (0.38)

Add feedback

Soft Prompt Guided Joint Learning for Cross-Domain Sentiment Analysis

Shi, Jingli, Li, Weihua, Bai, Quan, Yang, Yi, Jiang, Jianhua

arXiv.org Artificial IntelligenceMar-1-2023

Aspect term extraction is a fundamental task in fine-grained sentiment analysis, which aims at detecting customer's opinion targets from reviews on product or service. The traditional supervised models can achieve promising results with annotated datasets, however, the performance dramatically decreases when they are applied to the task of cross-domain aspect term extraction. Existing cross-domain transfer learning methods either directly inject linguistic features into Language models, making it difficult to transfer linguistic knowledge to target domain, or rely on the fixed predefined prompts, which is time-consuming to construct the prompts over all potential aspect term spans. To resolve the limitations, we propose a soft prompt-based joint learning method for cross domain aspect term extraction in this paper. Specifically, by incorporating external linguistic features, the proposed method learn domain-invariant representations between source and target domains via multiple objectives, which bridges the gap between domains with varied distributions of aspect terms. Further, the proposed method interpolates a set of transferable soft prompts consisted of multiple learnable vectors that are beneficial to detect aspect terms in target domain. Extensive experiments are conducted on the benchmark datasets and the experimental results demonstrate the effectiveness of the proposed method for cross-domain aspect terms extraction.

artificial intelligence, machine learning, natural language, (14 more...)

arXiv.org Artificial Intelligence

2303.00815

Country:

Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
Oceania > Australia > Tasmania (0.04)
Asia > China > Anhui Province > Hefei (0.04)
(2 more...)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.90)

Add feedback

SynGen: A Syntactic Plug-and-play Module for Generative Aspect-based Sentiment Analysis

Yu, Chengze, Wu, Taiqiang, Li, Jiayi, Bai, Xingyu, Yang, Yujiu

arXiv.org Artificial IntelligenceFeb-25-2023

Aspect-based Sentiment Analysis (ABSA) is a sentiment analysis task at fine-grained level. Recently, generative frameworks have attracted increasing attention in ABSA due to their ability to unify subtasks and their continuity to upstream pre-training tasks. However, these generative models suffer from the neighboring dependency problem that induces neighboring words to get higher attention. In this paper, we propose SynGen, a plug-and-play syntactic information aware module. As a plug-in module, our SynGen can be easily applied to any generative framework backbones. The key insight of our module is to add syntactic inductive bias to attention assignment and thus direct attention to the correct target words. To the best of our knowledge, we are the first one to introduce syntactic information to generative ABSA frameworks. Our module design is based on two main principles: (1) maintaining the structural integrity of backbone PLMs and (2) disentangling the added syntactic information and original semantic information. Empirical results on four popular ABSA datasets demonstrate that SynGen enhanced model achieves a comparable performance to the state-of-the-art model with relaxed labeling specification and less training consumption.

artificial intelligence, information, natural language, (17 more...)

arXiv.org Artificial Intelligence

2302.13032

Country:

Europe > Italy > Tuscany > Florence (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.94)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.94)

Add feedback

Resources for Turkish Natural Language Processing: A critical survey

Çöltekin, Çağrı, Doğruöz, A. Seza, Çetinoğlu, Özlem

arXiv.org Artificial IntelligenceFeb-25-2023

The recent (re)popularization of deep learning methods increased the importance and need for the data even further. Similarly, the other subfields of theoretical and applied linguistics have also seen a shift towards more data-driven methods. As a result, availability of large and high-quality language data is essential for both linguistic research and practical NLP applications. In this paper, we present a comprehensive and critical survey of linguistic resources for Turkish.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/s10579-022-09605-4

2204.05042

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.14)
Europe > France > Provence-Alpes-Côte d'Azur > Bouches-du-Rhône > Marseille (0.04)
(49 more...)

Genre:

Overview (1.00)
Research Report > Experimental Study (0.67)
Research Report > New Finding (0.45)

Industry:

Media > News (1.00)
Education (1.00)
Government (0.67)
Law (0.67)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
(5 more...)

Add feedback

Emotion Prediction Oriented method with Multiple Supervisions for Emotion-Cause Pair Extraction

Hu, Guimin, Zhao, Yi, Lu, Guangming

arXiv.org Artificial IntelligenceFeb-23-2023

Emotion-cause pair extraction (ECPE) task aims to extract all the pairs of emotions and their causes from an unannotated emotion text. The previous works usually extract the emotion-cause pairs from two perspectives of emotion and cause. However, emotion extraction is more crucial to the ECPE task than cause extraction. Motivated by this analysis, we propose an end-to-end emotion-cause extraction approach oriented toward emotion prediction (EPO-ECPE), aiming to fully exploit the potential of emotion prediction to enhance emotion-cause pair extraction. Considering the strong dependence between emotion prediction and emotion-cause pair extraction, we propose a synchronization mechanism to share their improvement in the training process. That is, the improvement of emotion prediction can facilitate the emotion-cause pair extraction, and then the results of emotion-cause pair extraction can also be used to improve the accuracy of emotion prediction simultaneously. For the emotion-cause pair extraction, we divide it into genuine pair supervision and fake pair supervision, where the genuine pair supervision learns from the pairs with more possibility to be emotion-cause pairs. In contrast, fake pair supervision learns from other pairs. In this way, the emotion-cause pairs can be extracted directly from the genuine pair, thereby reducing the difficulty of extraction. Experimental results show that our approach outperforms the 13 compared systems and achieves new state-of-the-art performance.

extraction, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2302.12417

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.14)
(17 more...)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
(3 more...)

Add feedback

Exploring celebrity influence on public attitude towards the COVID-19 pandemic: social media shared sentiment analysis

White, Brianna M, Melton, Chad A, Zareie, Parya, Davis, Robert L, Bednarczyk, Robert A, Shaban-Nejad, Arash

arXiv.org Artificial IntelligenceFeb-23-2023

The COVID-19 pandemic has introduced new opportunities for health communication, including an increase in the public use of online outlets for health-related emotions. People have turned to social media networks to share sentiments related to the impacts of the COVID-19 pandemic. In this paper we examine the role of social messaging shared by Persons in the Public Eye (i.e. athletes, politicians, news personnel) in determining overall public discourse direction. We harvested approximately 13 million tweets ranging from 1 January 2020 to 1 March 2022. The sentiment was calculated for each tweet using a fine-tuned DistilRoBERTa model, which was used to compare COVID-19 vaccine-related Twitter posts (tweets) that co-occurred with mentions of People in the Public Eye. Our findings suggest the presence of consistent patterns of emotional content co-occurring with messaging shared by Persons in the Public Eye for the first two years of the COVID-19 pandemic influenced public opinion and largely stimulated online public discourse. We demonstrate that as the pandemic progressed, public sentiment shared on social networks was shaped by risk perceptions, political ideologies and health-protective behaviours shared by Persons in the Public Eye, often in a negative light.

artificial intelligence, natural language, sentiment, (13 more...)

arXiv.org Artificial Intelligence

doi: 10.1136/bmjhci-2022-100665

2303.16759

Country:

North America > United States > Tennessee > Knox County > Knoxville (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
Asia > Middle East > Iran (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.51)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.51)

Add feedback

Automated Extraction of Fine-Grained Standardized Product Information from Unstructured Multilingual Web Data

Flick, Alexander, Jäger, Sebastian, Trajanovska, Ivana, Biessmann, Felix

arXiv.org Artificial IntelligenceFeb-23-2023

Extracting structured information from unstructured data is one of the key challenges in modern information retrieval applications, including e-commerce. Here, we demonstrate how recent advances in machine learning, combined with a recently published multilingual data set with standardized fine-grained product category information, enable robust product attribute extraction in challenging transfer learning settings. Our models can reliably predict product attributes across online shops, languages, or both. Furthermore, we show that our models can be used to match product taxonomies between online retailers.

category, information retrieval, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2302.12139

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
North America > United States > New York > New York County > New York City (0.05)
North America > United States > Michigan > Washtenaw County > Ann Arbor (0.04)
(2 more...)

Genre: Research Report (0.64)

Industry: Retail (0.54)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.37)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.30)

Add feedback

Generalizing through Forgetting -- Domain Generalization for Symptom Event Extraction in Clinical Notes

Zhou, Sitong, Lybarger, Kevin, Yetisgen, Meliha, Ostendorf, Mari

arXiv.org Artificial IntelligenceFeb-23-2023

Symptom information is primarily documented in free-text clinical notes and is not directly accessible for downstream applications. To address this challenge, information extraction approaches that can handle clinical language variation across different institutions and specialties are needed. In this paper, we present domain generalization for symptom extraction using pretraining and fine-tuning data that differs from the target domain in terms of institution and/or specialty and patient population. We extract symptom events using a transformer-based joint entity and relation extraction method. To reduce reliance on domain-specific features, we propose a domain generalization method that dynamically masks frequent symptoms words in the source domain. Additionally, we pretrain the transformer language model (LM) on task-related unlabeled texts for better representation. Our experiments indicate that masking and adaptive pretraining methods can significantly improve performance when the source domain is more distant from the target domain.

data mining, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2209.09485

Country:

North America > United States > Washington > King County > Seattle (0.14)
North America > United States > Virginia > Fairfax County > Fairfax (0.04)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Health Care Providers & Services (1.00)
Health & Medicine > Health Care Technology > Medical Record (0.87)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)
Information Technology > Data Science > Data Mining > Text Mining (0.34)

Add feedback

A Semantic Approach to Negation Detection and Word Disambiguation with Natural Language Processing

Okpala, Izunna, Rodriguez, Guillermo Romera, Tapia, Andrea, Halse, Shane, Kropczynski, Jess

arXiv.org Artificial IntelligenceFeb-22-2023

This study aims to demonstrate the methods for detecting negations in a sentence by uniquely evaluating the lexical structure of the text via word-sense disambiguation. The proposed framework examines all the unique features in the various expressions within a text to resolve the contextual usage of all tokens and decipher the effect of negation on sentiment analysis. The application of popular expression detectors skips this important step, thereby neglecting the root words caught in the web of negation and making text classification difficult for machine learning and sentiment analysis. This study adopts the Natural Language Processing (NLP) approach to discover and antonimize words that were negated for better accuracy in text classification using a knowledge base provided by an NLP library called WordHoard. Early results show that our initial analysis improved on traditional sentiment analysis, which sometimes neglects negations or assigns an inverse polarity score. The SentiWordNet analyzer was improved by 35%, the Vader analyzer by 20% and the TextBlob by 6%.

machine learning, natural language, negation, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3582768.3582789

2302.02291

Country:

Asia > Thailand > Bangkok > Bangkok (0.05)
North America > United States > Ohio > Hamilton County > Cincinnati (0.04)
North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
(3 more...)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.92)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.92)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback