Discourse & Dialogue
Non-negative matrix factorization algorithms greatly improve topic model fits
Carbonetto, Peter, Sarkar, Abhishek, Wang, Zihao, Stephens, Matthew
We report on the potential for using algorithms for non-negative matrix factorization (NMF) to improve parameter estimation in topic models. While several papers have studied connections between NMF and topic models, none have suggested leveraging these connections to develop new algorithms for fitting topic models. Importantly, NMF avoids the "sum-to-one" constraints on the topic model parameters, resulting in an optimization problem with simpler structure and more efficient computations. Building on recent advances in optimization algorithms for NMF, we show that first solving the NMF problem then recovering the topic model fit can produce remarkably better fits, and in less time, than standard algorithms for topic models. While we focus primarily on maximum likelihood estimation, we show that this approach also has the potential to improve variational inference for topic models. Our methods are implemented in the R package fastTopics.
Multi-turn Dialog System on Single-turn Data in Medical Domain
Sorathiya, Nazib, Lin, Chuan-An, Xiong, Daniel Chen Daniel, Zin, Scott, Zhang, Yi, Yang, He Sarina, Huang, Sharon Xiaolei
Recently there has been a huge interest in dialog systems. This interest has also been developed in the field of the medical domain where researchers are focusing on building a dialog system in the medical domain. This research is focused on the multi-turn dialog system trained on the multi-turn dialog data. It is difficult to gather a huge amount of multi-turn conversational data in the medical domain that is verified by professionals and can be trusted. However, there are several frequently asked questions (FAQs) or single-turn QA pairs that have information that is verified by the experts and can be used to build a multi-turn dialog system.
VISITRON: Visual Semantics-Aligned Interactively Trained Object-Navigator
Shrivastava, Ayush, Gopalakrishnan, Karthik, Liu, Yang, Piramuthu, Robinson, Tür, Gokhan, Parikh, Devi, Hakkani-Tür, Dilek
Interactive robots navigating photo-realistic environments face challenges underlying vision-and-language navigation (VLN), but in addition, they need to be trained to handle the dynamic nature of dialogue. However, research in Cooperative Vision-and-Dialog Navigation (CVDN), where a navigator interacts with a guide in natural language in order to reach a goal, treats the dialogue history as a VLN-style static instruction. In this paper, we present VISITRON, a navigator better suited to the interactive regime inherent to CVDN by being trained to: i) identify and associate object-level concepts and semantics between the environment and dialogue history, ii) identify when to interact vs. navigate via imitation learning of a binary classification head. We perform extensive ablations with VISITRON to gain empirical insights and improve performance on CVDN. VISITRON is competitive with models on the static CVDN leaderboard. We also propose a generalized interactive regime to fine-tune and evaluate VISITRON and future such models with pre-trained guides for adaptability.
Augment Your Small Dataset Using Transformers and Synonym Replacement for Sentiment Analysis-- Part…
Its uniqueness lies in its'self-supervised', pre-training objective architecture. Unlike other models that infer on the meaning of a sentence by extracting small parts of it, Pegasus completely'masks' the sentence and tries to find it by reading the text before and after it. Pegasus is really good at data summarization, but it is also great at paraphrasing sentences. The model is extremely easy to use, doesn't require many dependencies and with just a few lines of code we'll have our augmented dataset ready for training. To be able to leverage our small dataset efficiently, we will be performing text Paraphrasing along with Synonym Replacement to come up with a dataset large and unique enough to train our Sentiment Analysis model with.
Variational Gaussian Topic Model with Invertible Neural Projections
Wang, Rui, Zhou, Deyu, Xiong, Yuxuan, Huang, Haiping
Neural topic models have triggered a surge of interest in extracting topics from text automatically since they avoid the sophisticated derivations in conventional topic models. However, scarce neural topic models incorporate the word relatedness information captured in word embedding into the modeling process. To address this issue, we propose a novel topic modeling approach, called Variational Gaussian Topic Model (VaGTM). Based on the variational auto-encoder, the proposed VaGTM models each topic with a multivariate Gaussian in decoder to incorporate word relatedness. Furthermore, to address the limitation that pre-trained word embeddings of topic-associated words do not follow a multivariate Gaussian, Variational Gaussian Topic Model with Invertible neural Projections (VaGTM-IP) is extended from VaGTM. Three benchmark text corpora are used in experiments to verify the effectiveness of VaGTM and VaGTM-IP. The experimental results show that VaGTM and VaGTM-IP outperform several competitive baselines and obtain more coherent topics.
Researchers develop artificial intelligence that can detect sarcasm in social media
Social media has become a dominant form of communication for individuals, and for companies looking to market and sell their products and services. Properly understanding and responding to customer feedback on Twitter, Facebook and other social media platforms is critical for success, but it is incredibly labor intensive. That's where sentiment analysis comes in. The term refers to the automated process of identifying the emotion -- either positive, negative or neutral -- associated with text. While artificial intelligence refers to logical data analysis and response, sentiment analysis is akin to correctly identifying emotional communication.
Top 10 Machine Learning Applications 2021
Artificial Intelligence and Machine Learning is now considered to be one of the biggest innovations . AI and ML used to be a fanciful concept from science fiction, but now it's becoming a daily reality. The growth of less expensive and more powerful processing, The nearly limitless quantity of available data and affordable data storage has propelled the growth of Machine Learning. Now, before we get into the applications, Let's start with the basic intro to Machine Learning. Machine learning is a branch of AI focused on building applications that improve automatically through experience and by the use of data.
Artificial intelligence with a knack for sarcasm! - TechStory
Although sentiment analysis is an effective process that helps in a proper understanding of a text, a major roadblock to this was the presence of sarcasm in the text. Sarcasm is a hard nut to crack even in normal human communication, thus it can only be imagined what a predicament it might pose to a computer program to do the same. Since it poses a hurdle for the accuracy of sentiment analysis, experts began working on a suitable solution that could address this problem. One of the major challenges that come with the identification of sarcasm in the text is the lack of vocal tones and facial expressions. Thus, identifying sarcasm in the text becomes a task that is performed with a blindfold, making it quite hard.
Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters
Xu, Yan, Ishii, Etsuko, Liu, Zihan, Winata, Genta Indra, Su, Dan, Madotto, Andrea, Fung, Pascale
To diversify and enrich generated dialogue responses, knowledge-grounded dialogue has been investigated in recent years. Despite the success of the existing methods, they mainly follow the paradigm of retrieving the relevant sentences over a large corpus and augment the dialogues with explicit extra information, which is time- and resource-consuming. In this paper, we propose KnowExpert, an end-to-end framework to bypass the retrieval process by injecting prior knowledge into the pre-trained language models with lightweight adapters. To the best of our knowledge, this is the first attempt to tackle this task relying solely on a generation-based approach. Experimental results show that KnowExpert performs comparably with the retrieval-based baselines, demonstrating the potential of our proposed direction.
NLP for Climate Policy: Creating a Knowledge Platform for Holistic and Effective Climate Action
Swarnakar, Pradip, Modi, Ashutosh
Climate change is a burning issue of our time, with the Sustainable Development Goal (SDG) 13 of the United Nations demanding global climate action. Realizing the urgency, in 2015 in Paris, world leaders signed an agreement committing to taking voluntary action to reduce carbon emissions. However, the scale, magnitude, and climate action processes vary globally, especially between developed and developing countries. Therefore, from parliament to social media, the debates and discussions on climate change gather data from wide-ranging sources essential to the policy design and implementation. The downside is that we do not currently have the mechanisms to pool the worldwide dispersed knowledge emerging from the structured and unstructured data sources. The paper thematically discusses how NLP techniques could be employed in climate policy research and contribute to society's good at large. In particular, we exemplify symbiosis of NLP and Climate Policy Research via four methodologies. The first one deals with the major topics related to climate policy using automated content analysis. We investigate the opinions (sentiments) of major actors' narratives towards climate policy in the second methodology. The third technique explores the climate actors' beliefs towards pro or anti-climate orientation. Finally, we discuss developing a Climate Knowledge Graph. The present theme paper further argues that creating a knowledge platform would help in the formulation of a holistic climate policy and effective climate action. Such a knowledge platform would integrate the policy actors' varied opinions from different social sectors like government, business, civil society, and the scientific community. The research outcome will add value to effective climate action because policymakers can make informed decisions by looking at the diverse public opinion on a comprehensive platform.