Information Extraction
Enhance Multi-domain Sentiment Analysis of Review Texts through Prompting Strategies
Large Language Models (LLMs) have made significant strides in both scientific research and practical applications. Existing studies have demonstrated the state-of-the-art (SOTA) performance of LLMs in various natural language processing tasks. However, the question of how to further enhance LLMs' performance in specific task using prompting strategies remains a pivotal concern. This paper explores the enhancement of LLMs' performance in sentiment analysis through the application of prompting strategies. We formulate the process of prompting for sentiment analysis tasks and introduce two novel strategies tailored for sentiment analysis: RolePlaying (RP) prompting and Chain-of-thought (CoT) prompting. Specifically, we also propose the RP-CoT prompting strategy which is a combination of RP prompting and CoT prompting. We conduct comparative experiments on three distinct domain datasets to evaluate the effectiveness of the proposed sentiment analysis strategies. The results demonstrate that the adoption of the proposed prompting strategies leads to a increasing enhancement in sentiment analysis accuracy. Further, the CoT prompting strategy exhibits a notable impact on implicit sentiment analysis, with the RP-CoT prompting strategy delivering the most superior performance among all strategies.
UMIE: Unified Multimodal Information Extraction with Instruction Tuning
Sun, Lin, Zhang, Kai, Li, Qingyuan, Lou, Renze
Multimodal information extraction (MIE) gains significant attention as the popularity of multimedia content increases. However, current MIE methods often resort to using task-specific model structures, which results in limited generalizability across tasks and underutilizes shared knowledge across MIE tasks. To address these issues, we propose UMIE, a unified multimodal information extractor to unify three MIE tasks as a generation problem using instruction tuning, being able to effectively extract both textual and visual mentions. Extensive experiments show that our single UMIE outperforms various state-of-the-art (SoTA) methods across six MIE datasets on three tasks. Furthermore, in-depth analysis demonstrates UMIE's strong generalization in the zero-shot setting, robustness to instruction variants, and interpretability. Our research serves as an initial step towards a unified MIE model and initiates the exploration into both instruction tuning and large language models within the MIE domain. Our code, data, and model are available at https://github.com/ZUCC-AI/UMIE
DocGraphLM: Documental Graph Language Model for Information Extraction
Wang, Dongsheng, Ma, Zhiqiang, Nourbakhsh, Armineh, Gu, Kang, Shah, Sameena
Advances in Visually Rich Document Understanding (VrDU) have Information extraction from visually-rich documents (VrDs), such enabled information extraction and question answering over documents as business forms, receipts, and invoices in the format of PDF or with complex layouts. Two tropes of architectures have image has gained recent traction. Tasks such as field identification emerged--transformer-based models inspired by LLMs, and Graph and extraction and entity linkage are crucial to digitizing VrDs Neural Networks. In this paper, we introduce DocGraphLM, a novel and building information retrieval systems on the data. Tasks that framework that combines pre-trained language models with graph require complex reasoning such as Visual Question Answering semantics. To achieve this, we propose 1) a joint encoder architecture over documents require modeling the spatial, visual, and semantic to represent documents, and 2) a novel link prediction approach signals in VrDs. Therefore, VrD Understanding is concerned with to reconstruct document graphs. DocGraphLM predicts both directions modeling the multi-modal content in image documents. Previous and distances between nodes using a convergent joint loss research has explored the use of encoding text, layout, and image function that prioritizes neighborhood restoration and downweighs features in a layout language model or multi-modal setting to improve distant node detection.
Survey on Publicly Available Sinhala Natural Language Processing Tools and Research
Sinhala is the native language of the Sinhalese people who make up the largest ethnic group of Sri Lanka. The language belongs to the globe-spanning language tree, Indo-European. However, due to poverty in both linguistic and economic capital, Sinhala, in the perspective of Natural Language Processing tools and research, remains a resource-poor language which has neither the economic drive its cousin English has nor the sheer push of the law of numbers a language such as Chinese has. A number of research groups from Sri Lanka have noticed this dearth and the resultant dire need for proper tools and research for Sinhala natural language processing. However, due to various reasons, these attempts seem to lack coordination and awareness of each other. The objective of this paper is to fill that gap of a comprehensive literature survey of the publicly available Sinhala natural language tools and research so that the researchers working in this field can better utilize contributions of their peers. As such, we shall be uploading this paper to arXiv and perpetually update it periodically to reflect the advances made in the field.
Unveiling Comparative Sentiments in Vietnamese Product Reviews: A Sequential Classification Framework
Le, Ha, Tran, Bao, Le, Phuong, Nguyen, Tan, Nguyen, Dac, Pham, Ngoan, Huynh, Dang
Comparative opinion mining is a specialized field of sentiment analysis that aims to identify and extract sentiments expressed comparatively. To address this task, we propose an approach that consists of solving three sequential sub-tasks: (i) identifying comparative sentence, i.e., if a sentence has a comparative meaning, (ii) extracting comparative elements, i.e., what are comparison subjects, objects, aspects, predicates, and (iii) classifying comparison types which contribute to a deeper comprehension of user sentiments in Vietnamese product reviews. Our method is ranked fifth at the Vietnamese Language and Speech Processing (VLSP) 2023 challenge on Comparative Opinion Mining (ComOM) from Vietnamese Product Reviews.
Real-Time Online Stock Forecasting Utilizing Integrated Quantitative and Qualitative Analysis
Bathini, Sai Akash, Cihan, Dagli
The application of Machine learning to finance has become a familiar approach, even more so in stock market forecasting. The stock market is highly volatile, and huge amounts of data are generated every minute globally. The extraction of effective intelligence from this data is of critical importance. However, a collaboration of numerical stock data with qualitative text data can be a challenging task. In this work, we accomplish this by providing an unprecedented, publicly available dataset with technical and fundamental data and sentiment that we gathered from news archives, TV news captions, radio transcripts, tweets, daily financial newspapers, etc. The text data entries used for sentiment extraction total more than 1.4 Million. The dataset consists of daily entries from January 2018 to December 2022 for eight companies representing diverse industrial sectors and the Dow Jones Industrial Average (DJIA) as a whole. Holistic Fundamental and Technical data is provided training ready for Model learning and deployment. Most importantly, the data generated could be used for incremental online learning with real-time data points retrieved daily since no stagnant data was utilized. All the data was retired from APIs or self-designed robust information retrieval technologies with extremely low latency and zero monetary cost. These adaptable technologies facilitate data extraction for any stock. Moreover, the utilization of Spearman's rank correlation over real-time data, linking stock returns with sentiment analysis has produced noteworthy results for the DJIA and the eight other stocks, achieving accuracy levels surpassing 60%. The dataset is made available at https://github.com/batking24/Huge-Stock-Dataset.
Large language model for Bible sentiment analysis: Sermon on the Mount
Vora, Mahek, Blau, Tom, Kachhwal, Vansh, Solo, Ashu M. G., Chandra, Rohitash
The revolution of natural language processing via large language models has motivated its use in multidisciplinary areas that include social sciences and humanities and more specifically, comparative religion. Sentiment analysis provides a mechanism to study the emotions expressed in text. Recently, sentiment analysis has been used to study and compare translations of the Bhagavad Gita, which is a fundamental and sacred Hindu text. In this study, we use sentiment analysis for studying selected chapters of the Bible. These chapters are known as the Sermon on the Mount. We utilize a pre-trained language model for sentiment analysis by reviewing five translations of the Sermon on the Mount, which include the King James version, the New International Version, the New Revised Standard Version, the Lamsa Version, and the Basic English Version. We provide a chapter-by-chapter and verse-by-verse comparison using sentiment and semantic analysis and review the major sentiments expressed. Our results highlight the varying sentiments across the chapters and verses. We found that the vocabulary of the respective translations is significantly different. We detected different levels of humour, optimism, and empathy in the respective chapters that were used by Jesus to deliver his message.
TikTok's data collection being scrutinised by Australia's privacy watchdog
Australia's privacy watchdog has launched an inquiry into how TikTok harvests personal data and whether it is being done with consent. The Office of the Australian Information Commissioner (OAIC) will examine whether the social media platform has breached the online privacy of Australians through the use of marketing pixels, which track people's online habits. This can include where they shop, how long they stay on websites and personal information, such as email addresses and mobile phone numbers. Liberal senator James Paterson, who has been campaigning against TikTok and its parent company, ByteDance, has alleged the social media platform is using pixels to collect information of non-TikTok users. "This conduct would be unacceptable from any company but is particularly alarming given TikTok is beholden to the Chinese Communist party and is required under China's intelligence laws to share information with Chinese government intelligence agencies," Paterson said.
Multimodal Sentiment Analysis with Missing Modality: A Knowledge-Transfer Approach
Liu, Weide, Zhan, Huijing, Chen, Hao, Lv, Fengmao
Previous research studies [11, 12] have attempted to address the issue of missing modalities in multimodal sentiment Multimodal sentiment analysis aims to identify the emotions analysis. In particular, Tsai et al. [12] proposed a joint expressed by individuals through visual, language, and generative-discriminative objective to obtain a robust multimodal acoustic cues. However, most of the existing research efforts representation and a surrogate inference model for assume that all modalities are available during both missing modalities. Pham et al. [11] developed a multimodal training and testing, making their algorithms susceptible to translation network with a cyclic translation loss for forward the missing modality scenario. In this paper, we propose a adaptation between source and target modalities. However, novel knowledge-transfer network to translate between different the performances of their approaches degrade when complete modalities to reconstruct the missing audio modalities.
Hiding in Plain Sight: Towards the Science of Linguistic Steganography
Raj-Sankar, Leela, Rajagopalan, S. Raj
Covert communication (also known as steganography) is the practice of concealing a secret inside an innocuous-looking public object (cover) so that the modified public object (covert code) makes sense to everyone but only someone who knows the code can extract the secret (message). Linguistic steganography is the practice of encoding a secret message in natural language text such as spoken conversation or short public communications such as tweets.. While ad hoc methods for covert communications in specific domains exist ( JPEG images, Chinese poetry, etc), there is no general model for linguistic steganography specifically. We present a novel mathematical formalism for creating linguistic steganographic codes, with three parameters: Decodability (probability that the receiver of the coded message will decode the cover correctly), density (frequency of code words in a cover code), and detectability (probability that an attacker can tell the difference between an untampered cover compared to its steganized version). Verbal or linguistic steganography is most challenging because of its lack of artifacts to hide the secret message in. We detail a practical construction in Python of a steganographic code for Tweets using inserted words to encode hidden digits while using n-gram frequency distortion as the measure of detectability of the insertions. Using the publicly accessible Stanford Sentiment Analysis dataset we implemented the tweet steganization scheme -- a codeword (an existing word in the data set) inserted in random positions in random existing tweets to find the tweet that has the least possible n-gram distortion. We argue that this approximates KL distance in a localized manner at low cost and thus we get a linguistic steganography scheme that is both formal and practical and permits a tradeoff between codeword density and detectability of the covert message.