AITopics | Information Extraction

Collaborating Authors

Information Extraction

News Overviews Instructional Materials AI-Alerts Classics

Urdu Speech and Text Based Sentiment Analyzer

arXiv.org Artificial IntelligenceJul-19-2022

Discovering what other people think has always been a key aspect of our information-gathering strategy. People can now actively utilize information technology to seek out and comprehend the ideas of others, thanks to the increased availability and popularity of opinion-rich resources such as online review sites and personal blogs. Because of its crucial function in understanding people's opinions, sentiment analysis (SA) is a crucial task. Existing research, on the other hand, is primarily focused on the English language, with just a small amount of study devoted to low-resource languages. For sentiment analysis, this work presented a new multi-class Urdu dataset based on user evaluations. The tweeter website was used to get Urdu dataset. Our proposed dataset includes 10,000 reviews that have been carefully classified into two categories by human experts: positive, negative. The primary purpose of this research is to construct a manually annotated dataset for Urdu sentiment analysis and to establish the baseline result. Five different lexicon- and rule-based algorithms including Naivebayes, Stanza, Textblob, Vader, and Flair are employed and the experimental results show that Flair with an accuracy of 70% outperforms other tested algorithms.

classification, dataset, sentiment analysis, (16 more...)

arXiv.org Artificial Intelligence

2207.09163

Country:

Asia > Pakistan (0.14)
North America > United States > New York > New York County > New York City (0.04)
Europe > Norway (0.04)
(4 more...)

Genre: Research Report > New Finding (0.66)

Industry:

Health & Medicine (0.69)
Education > Educational Technology (0.68)
Information Technology (0.68)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)

Add feedback

Enhancing Collaborative Filtering Recommender with Prompt-Based Sentiment Analysis

Dang, Elliot, Hu, Zheyuan, Li, Tong

arXiv.org Artificial IntelligenceJul-19-2022

Collaborative Filtering(CF) recommender is a crucial application in the online market and ecommerce. However, CF recommender has been proven to suffer from persistent problems related to sparsity of the user rating that will further lead to a cold-start issue. Existing methods address the data sparsity issue by applying token-level sentiment analysis that translate text review into sentiment scores as a complement of the user rating. In this paper, we attempt to optimize the sentiment analysis with advanced NLP models including BERT and RoBERTa, and experiment on whether the CF recommender has been further enhanced. We build the recommenders on the Amazon US Reviews dataset, and tune the pretrained BERT and RoBERTa with the traditional fine-tuned paradigm as well as the new prompt-based learning paradigm. Experimental result shows that the recommender enhanced with the sentiment ratings predicted by the fine-tuned RoBERTa has the best performance, and achieved 30.7% overall gain by comparing MAP, NDCG and precision at K to the baseline recommender. Prompt-based learning paradigm, although superior to traditional fine-tune paradigm in pure sentiment analysis, fail to further improve the CF recommender.

dataset, recommender, sentiment analysis, (14 more...)

arXiv.org Artificial Intelligence

2207.12883

Country:

North America > United States > New York (0.05)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)

Genre: Research Report > New Finding (0.34)

Industry: Information Technology > Services (0.34)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
(2 more...)

Add feedback

Tools to Use When Building Sentiment Analyzer

#artificialintelligenceJul-18-2022, 23:40:30 GMT

Originally published on Towards AI the World's Leading AI and Technology News and Media Company. If you are building an AI-related product or service, we invite you to consider becoming an AI sponsor. At Towards AI, we help scale AI and technology startups. Let us help you unleash your technology to the masses. Sentiment Analysis is a powerful tool to use when trying to understand how to test your website.

application, building sentiment analyzer, sentiment analysis, (8 more...)

#artificialintelligence

Industry: Media (0.71)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.97)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.82)

Add feedback

STT: Soft Template Tuning for Few-Shot Adaptation

Yu, Ping, Wang, Wei, Li, Chunyuan, Zhang, Ruiyi, Jin, Zhanpeng, Chen, Changyou

arXiv.org Artificial IntelligenceJul-18-2022

Prompt tuning has been an extremely effective tool to adapt a pre-trained model to downstream tasks. However, standard prompt-based methods mainly consider the case of sufficient data of downstream tasks. It is still unclear whether the advantage can be transferred to the few-shot regime, where only limited data are available for each downstream task. Although some works have demonstrated the potential of prompt-tuning under the few-shot setting, the main stream methods via searching discrete prompts or tuning soft prompts with limited data are still very challenging. Through extensive empirical studies, we find that there is still a gap between prompt tuning and fully fine-tuning for few-shot learning. To bridge the gap, we propose a new prompt-tuning framework, called Soft Template Tuning (STT). STT combines manual and auto prompts, and treats downstream classification tasks as a masked language modeling task. Comprehensive evaluation on different settings suggests STT can close the gap between fine-tuning and prompt-based methods without introducing additional parameters. Significantly, it can even outperform the time- and resource-consuming fine-tuning method on sentiment classification tasks.

arxiv preprint arxiv, fine-tuning, language model, (13 more...)

arXiv.org Artificial Intelligence

2207.08408

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.34)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.34)
(2 more...)

Add feedback

Aspect-specific Context Modeling for Aspect-based Sentiment Analysis

Ma, Fang, Zhang, Chen, Zhang, Bo, Song, Dawei

arXiv.org Artificial IntelligenceJul-17-2022

Aspect-based sentiment analysis (ABSA) aims at predicting sentiment polarity (SC) or extracting opinion span (OE) expressed towards a given aspect. Previous work in ABSA mostly relies on rather complicated aspect-specific feature induction. Recently, pretrained language models (PLMs), e.g., BERT, have been used as context modeling layers to simplify the feature induction structures and achieve state-of-the-art performance. However, such PLM-based context modeling can be not that aspect-specific. Therefore, a key question is left under-explored: how the aspect-specific context can be better modeled through PLMs? To answer the question, we attempt to enhance aspect-specific context modeling with PLM in a non-intrusive manner. We propose three aspect-specific input transformations, namely aspect companion, aspect prompt, and aspect marker. Informed by these transformations, non-intrusive aspect-specific PLMs can be achieved to promote the PLM to pay more attention to the aspect-specific context in a sentence. Additionally, we craft an adversarial benchmark for ABSA (advABSA) to see how aspect-specific modeling can impact model robustness. Extensive experimental results on standard and adversarial benchmarks for SC and OE demonstrate the effectiveness and robustness of the proposed method, yielding new state-of-the-art performance on OE and competitive performance on SC.

context modeling, proceedings, transformation, (12 more...)

arXiv.org Artificial Intelligence

2207.08099

Country:

Asia > Myanmar > Tanintharyi Region > Dawei (0.05)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.87)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

BiSyn-GAT+: Bi-Syntax Aware Graph Attention Network for Aspect-based Sentiment Analysis

Liang, Shuo, Wei, Wei, Mao, Xian-Ling, Wang, Fei, He, Zhiyong

arXiv.org Artificial IntelligenceJul-15-2022

Aspect-based sentiment analysis (ABSA) is a fine-grained sentiment analysis task that aims to align aspects and corresponding sentiments for aspect-specific sentiment polarity inference. It is challenging because a sentence may contain multiple aspects or complicated (e.g., conditional, coordinating, or adversative) relations. Recently, exploiting dependency syntax information with graph neural networks has been the most popular trend. Despite its success, methods that heavily rely on the dependency tree pose challenges in accurately modeling the alignment of the aspects and their words indicative of sentiment, since the dependency tree may provide noisy signals of unrelated associations (e.g., the "conj" relation between "great" and "dreadful" in Figure 2). In this paper, to alleviate this problem, we propose a Bi-Syntax aware Graph Attention Network (BiSyn-GAT+). Specifically, BiSyn-GAT+ fully exploits the syntax information (e.g., phrase segmentation and hierarchical structure) of the constituent tree of a sentence to model the sentiment-aware context of every single aspect (called intra-context) and the sentiment relations across aspects (called inter-context) for learning. Experiments on four benchmark datasets demonstrate that BiSyn-GAT+ outperforms the state-of-the-art methods consistently.

computational linguistic, information, relation, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.18653/v1/2022.findings-acl.144

2204.03117

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
Asia > China > Hong Kong (0.04)
(14 more...)

Genre: Research Report (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Layout-Aware Information Extraction for Document-Grounded Dialogue: Dataset, Method and Demonstration

Zhang, Zhenyu, Yu, Bowen, Yu, Haiyang, Liu, Tingwen, Fu, Cheng, Li, Jingyang, Tang, Chengguang, Sun, Jian, Li, Yongbin

arXiv.org Artificial IntelligenceJul-14-2022

Building document-grounded dialogue systems have received growing interest as documents convey a wealth of human knowledge and commonly exist in enterprises. Wherein, how to comprehend and retrieve information from documents is a challenging research problem. Previous work ignores the visual property of documents and treats them as plain text, resulting in incomplete modality. In this paper, we propose a Layout-aware document-level Information Extraction dataset, LIE, to facilitate the study of extracting both structural and semantic knowledge from visually rich documents (VRDs), so as to generate accurate responses in dialogue systems. LIE contains 62k annotations of three extraction tasks from 4,061 pages in product and official documents, becoming the largest VRD-based information extraction dataset to the best of our knowledge. We also develop benchmark methods that extend the token-based language model to consider layout features like humans. Empirical results show that layout is critical for VRD-based extraction, and system demonstration also verifies that the extracted knowledge can help locate the answers that users care about.

artificial intelligence, extraction, natural language, (13 more...)

arXiv.org Artificial Intelligence

2207.06717

Country:

Europe > Portugal > Lisbon > Lisbon (0.05)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > New York > New York County > New York City (0.04)
Asia > China (0.04)

Genre: Research Report > New Finding (0.34)

Industry: Banking & Finance > Insurance (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)

Add feedback

MDEAW: A Multimodal Dataset for Emotion Analysis through EDA and PPG signals from wireless wearable low-cost off-the-shelf Devices

Nandi, Arijit, Xhafa, Fatos, Subirats, Laia, Fort, Santi

arXiv.org Artificial IntelligenceJul-14-2022

We present MDEAW, a multimodal database consisting of Electrodermal Activity (EDA) and Photoplethysmography (PPG) signals recorded during the exams for the course taught by the teacher at Eurecat Academy, Sabadell, Barcelona in order to elicit the emotional reactions to the students in a classroom scenario. Signals from 10 students were recorded along with the students' self-assessment of their affective state after each stimulus, in terms of 6 basic emotion states. All the signals were captured using portable, wearable, wireless, low-cost, and off-the-shelf equipment that has the potential to allow the use of affective computing methods in everyday applications. A baseline for student-wise affect recognition using EDA and PPG-based features, as well as their fusion, was established through ReMECS, Fed-ReMECS, and Fed-ReMECS-U. These results indicate the prospects of using low-cost devices for affective state recognition applications. The proposed database will be made publicly available in order to allow researchers to achieve a more thorough evaluation of the suitability of these capturing devices for emotion state recognition applications.

experiment, mdeaw dataset, student, (14 more...)

arXiv.org Artificial Intelligence

2207.0641

Country: Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.07)

Genre:

Research Report (0.64)
Instructional Material > Course Syllabus & Notes (0.49)

Industry:

Health & Medicine > Therapeutic Area > Hematology (0.92)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.92)
Health & Medicine > Diagnostic Medicine > Imaging (0.92)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.41)
Information Technology > Artificial Intelligence > Cognitive Science > Emotion (0.36)

Add feedback

Developing a Component Comment Extractor from Product Reviews on E-Commerce Sites

Anda, Shogo, Kikuchi, Masato, Ozono, Tadachika

arXiv.org Artificial IntelligenceJul-13-2022

Consumers often read product reviews to inform their buying decision, as some consumers want to know a specific component of a product. However, because typical sentences on product reviews contain various details, users must identify sentences about components they want to know amongst the many reviews. Therefore, we aimed to develop a system that identifies and collects component and aspect information of products in sentences. Our BERT-based classifiers assign labels referring to components and aspects to sentences in reviews and extract sentences with comments on specific components and aspects. We determined proper labels based for the words identified through pattern matching from product reviews to create the training data. Because we could not use the words as labels, we carefully created labels covering the meanings of the words. However, the training data was imbalanced on component and aspect pairs. We introduced a data augmentation method using WordNet to reduce the bias. Our evaluation demonstrates that the system can determine labels for road bikes using pattern matching, covering more than 88\% of the indicators of components and aspects on e-commerce sites. Moreover, our data augmentation method can improve the-F1-measure on insufficient data from 0.66 to 0.76.

augmentation, data augmentation, training data, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/IIAI-AAI55812.2022.00026

2207.05979

Country: Asia > Japan > Honshū > Tōhoku (0.04)

Genre: Research Report (0.83)

Industry: Information Technology > Services > e-Commerce Services (0.62)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.55)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.46)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.35)

Add feedback

16 Open Source NLP Models for Sentiment Analysis; One Rises on Top

#artificialintelligenceJul-11-2022, 13:05:25 GMT

Sentiment analysis is a process of quantifying and categorizing opinions expressed in the written text; over the many years, I have also heard it labeled as opinion mining, emotion AI, and text analytics.

artificial intelligence, natural language, open source nlp model, (1 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)

Add feedback