Jurgens, David
SemEval 2023 Task 9: Multilingual Tweet Intimacy Analysis
Pei, Jiaxin, Silva, Vítor, Bos, Maarten, Liu, Yozon, Neves, Leonardo, Jurgens, David, Barbieri, Francesco
R model trained over the twitter dataset (XLM-T) performs the best on 7 languages. While the Intimacy has long been viewed as a primary dimension pre-trained language models are able to achieve of human relationships and interpersonal promising performance, zero-shot prediction of unseen interactions (Maslow, 1981; Sullivan, 2013; Prager, languages remains challenging especially for 1995). Existing studies suggest that intimacy is an Korean and Hindi.
Audrey: A Personalized Open-Domain Conversational Bot
Hong, Chung Hoon, Liang, Yuan, Roy, Sagnik Sinha, Jain, Arushi, Agarwal, Vihang, Draves, Ryan, Zhou, Zhizhuo, Chen, William, Liu, Yujian, Miracky, Martha, Ge, Lily, Banovic, Nikola, Jurgens, David
Conversational Intelligence requires that a person engage on informational, personal and relational levels. Advances in Natural Language Understanding have helped recent chatbots succeed at dialog on the informational level. However, current techniques still lag for conversing with humans on a personal level and fully relating to them. The University of Michigan's submission to the Alexa Prize Grand Challenge 3, Audrey, is an open-domain conversational chat-bot that aims to engage customers on these levels through interest driven conversations guided by customers' personalities and emotions. Audrey is built from socially-aware models such as Emotion Detection and a Personal Understanding Module to grasp a deeper understanding of users' interests and desires. Our architecture interacts with customers using a hybrid approach balanced between knowledge-driven response generators and context-driven neural response generators to cater to all three levels of conversations. During the semi-finals period, we achieved an average cumulative rating of 3.25 on a 1-5 Likert scale.
Everyone's Invited: A New Paradigm for Evaluation on Non-Transferable Datasets
Jurgens, David (McGill University) | Finethy, Tyler (McGill University) | Armstrong, Caitrin (McGill University) | Ruths, Derek (McGill University)
Social media data mining and analytics has stimulated a wide array of computational research. Traditionally, individual researchers are responsible for acquiring and managing their own datasets. However, the temporal nature of social data, the challenges involved in correctly preparing a dataset, the sheer scale of many datasets, and the proprietary nature of many data sources can make extending and comparing computational methods difficult and often impossible. In light of this, because replicability is a fundamental pillar of the scientific process and because method comparison is essential to characterizing computational advancements, we require an alternative to the traditional model of researcher-owned datasets. In this paper we propose FREESR, a framework that gives researchers the ability to develop and test method performance without requiring direct access to “shared” datasets. As a case study and first community resource, we have implemented the FREESR paradigm around the task of Tweet geolocation. The implementation showcases the clear suitability of this framework for the social media research context. Beyond the implementation, we see the FREESR paradigm as being an important step towards making study reproducibility and method comparison more principled and ubiquitous in the social media research community.
Temporal Motifs Reveal the Dynamics of Editor Interactions in Wikipedia
Jurgens, David (University of California, Los Angeles and HRL Laboratories, LLC) | Lu, Tsai-Ching (HRL Laboratories, LLC)
Wikipedia is a collaborative setting with both combative and cooperative editing. We propose a new method for investigating the types of editor interactions using a novel representation of Wikipedia's revision history as a temporal, bipartite network with multiple node and edge types for users and revisions. From this representation we identify significant author interactions as network motifs and show how the motif types capture important, diverse editing behaviors. Two experiments demonstrate the further benefit of motifs. First, we demonstrate significant performance improvement over a purely revision-based analysis in classifying pages as combative or cooperative page by using motifs; and second we use motifs as a basis for analyzing trends in the dynamics of editor behavior to explain Wikipedia's content growth.