Goto

Collaborating Authors

 wizard


A Customer Journey in the Land of Oz: Leveraging the Wizard of Oz Technique to Model Emotions in Customer Service Interactions

Labat, Sofie, Demeester, Thomas, Hoste, Véronique

arXiv.org Artificial Intelligence

Emotion-aware customer service needs in-domain conversational data, rich annotations, and predictive capabilities, but existing resources for emotion recognition are often out-of-domain, narrowly labeled, and focused on post-hoc detection. To address this, we conducted a controlled Wizard of Oz (WOZ) experiment to elicit interactions with targeted affective trajectories. The resulting corpus, EmoWOZ-CS, contains 2,148 bilingual (Dutch-English) written dialogues from 179 participants across commercial aviation, e-commerce, online travel agencies, and telecommunication scenarios. Our contributions are threefold: (1) Evaluate WOZ-based operator-steered valence trajectories as a design for emotion research; (2) Quantify human annotation performance and variation, including divergences between self-reports and third-party judgments; (3) Benchmark detection and forward-looking emotion inference in real-time support. Findings show neutral dominates participant messages; desire and gratitude are the most frequent non-neutral emotions. Agreement is moderate for multilabel emotions and valence, lower for arousal and dominance; self-reports diverge notably from third-party labels, aligning most for neutral, gratitude, and anger. Objective strategies often elicit neutrality or gratitude, while suboptimal strategies increase anger, annoyance, disappointment, desire, and confusion. Some affective strategies (cheerfulness, gratitude) foster positive reciprocity, whereas others (apology, empathy) can also leave desire, anger, or annoyance. Temporal analysis confirms successful conversation-level steering toward prescribed trajectories, most distinctly for negative targets; positive and neutral targets yield similar final valence distributions. Benchmarks highlight the difficulty of forward-looking emotion inference from prior turns, underscoring the complexity of proactive emotion-aware support.



Zero-Resource Knowledge-Grounded Dialogue Generation Linxiao Li Peking University

Neural Information Processing Systems

To this end, we propose representing the knowledge that bridges a context and a response and the way that the knowledge is expressed as latent variables, and devise a variational approach that can effectively estimate a generation model from a dialogue corpus and a knowledge corpus that are independent with each other.


Efficient Tuning of Large Language Models for Knowledge-Grounded Dialogue Generation

Zhang, Bo, Ma, Hui, Li, Dailin, Ding, Jian, Wang, Jian, Xu, Bo, Lin, HongFei

arXiv.org Artificial Intelligence

Large language models (LLMs) demonstrate remarkable text comprehension and generation capabilities but often lack the ability to utilize up-to-date or domain-specific knowledge not included in their training data. To address this gap, we introduce KEDiT, an efficient method for fine-tuning LLMs for knowledge-grounded dialogue generation. KEDiT operates in two main phases: first, it employs an information bottleneck to compress retrieved knowledge into learnable parameters, retaining essential information while minimizing computational overhead. Second, a lightweight knowledge-aware adapter integrates these compressed knowledge vectors into the LLM during fine-tuning, updating less than 2\% of the model parameters. The experimental results on the Wizard of Wikipedia and a newly constructed PubMed-Dialog dataset demonstrate that KEDiT excels in generating contextually relevant and informative responses, outperforming competitive baselines in automatic, LLM-based, and human evaluations. This approach effectively combines the strengths of pretrained LLMs with the adaptability needed for incorporating dynamic knowledge, presenting a scalable solution for fields such as medicine.


SRWToolkit: An Open Source Wizard of Oz Toolkit to Create Social Robotic Avatars

Nilgar, Atikkhan Faridkhan, Van Laerhoven, Kristof, Kinoti, Ayub

arXiv.org Artificial Intelligence

We present SR WToolkit, an open-source Wizard of Oz toolkit designed to facilitate the rapid prototyping of social robotic avatars powered by local large language models (LLMs). Our web-based toolkit enables multimodal interaction through text input, button-activated speech, and wake-word command. The toolkit offers real-time configuration of avatar appearance, behavior, language, and voice via an intuitive control panel. In contrast to prior works that rely on cloud-based LLMs services, SRWToolkit emphasizes modularity and ensures on-device functionality through local LLM inference. In our small-scale user study, [n = 11] participants created and interacted with diverse robotic roles (hospital receptionist, mathematics teacher, and driving assistant), which demonstrated positive outcomes in the toolkit's usability, trust, and user experience. The toolkit enables rapid and efficient development of robot characters customized to researchers' needs, supporting scalable research in human-robot interaction.


CARIS: A Context-Adaptable Robot Interface System for Personalized and Scalable Human-Robot Interaction

Arias-Russi, Felipe, Bai, Yuanchen, Taylor, Angelique

arXiv.org Artificial Intelligence

-- The human-robot interaction (HRI) field has traditionally used Wizard-of-Oz (WoZ) controlled robots to explore navigation, conversational dynamics, human-in-the-loop interactions, and more to explore appropriate robot behaviors in everyday settings. However, existing WoZ tools are often limited to one context, making them less adaptable across different settings, users, and robotic platforms. T o mitigate these issues, we introduce a Context-Adaptable Robot Interface System (CARIS) that combines advanced robotic capabilities such teleoperation, human perception, human-robot dialogue, and multimodal data recording. Through pilot studies, we demonstrate the potential of CARIS to WoZ control a robot in two contexts: 1) mental health companion and as a 2) tour guide. Furthermore, we identified areas of improvement for CARIS, including smoother integration between movement and communication, clearer functionality separation, recommended prompts, and one-click communication options to enhance the usability wizard control of CARIS. This project offers a publicly available, context-adaptable tool for the HRI community, enabling researchers to streamline data-driven approaches to intelligent robot behavior . The human-robot interaction (HRI) field has long explored intelligent systems that complement human skills in both social and task-oriented contexts [1]. Social robots must seamlessly coordinate perception, conversation, navigation, and other high-level functions to engage naturally with humans. This is a demanding task that requires specialized frameworks, diverse toolsets, and depends on robust communication protocols to enable efficient, real-time exchanges between the robot control system and people.


CyberLink Photo Director 365 review: Smarter photo editing with AI

PCWorld

The new AI in CyberLink Photo Director 365 helps achieve results faster, with tools and wizards supporting design drafts. CyberLink's photo editing software has already impressed users in the past with its ease of use and strong results. The manufacturer offers many wizards, straightforward functions, and the option to edit images manually in an editor. Various special functions and templates support the user in their work. The development team has now added extensive artificial intelligence (AI) capabilities to the feature list.


The Much-Hyped New em Wizard of Oz /em Is an Atrocity

Slate

Although it is, at least according to the Library of Congress, the most-watched movie of all time, The Wizard of Oz was a costly failure at the box office, and only became a perennial favorite thanks to the regular TV airings that began in the 1950s. But in the decades since it's become a metonym for the wonder of the big screen, a movie even people who prefer their content streaming will make the effort to see in a movie theater. Beginning on Labor Day weekend, audiences will get to experience the movie on perhaps the largest screen ever created. But it won't be The Wizard of Oz as we've come to know it for the better part of a century. The version of the movie that will fill Las Vegas' Sphere starting Aug. 28 has been retooled to fit the venue's curved shell, its images enhanced and expanded to fill four football fields' worth of 16K LED screens--the foundation of an immersive presentation that also includes flames, gusts of wind, and inflatable flying monkeys piloted by drone. It is, to quote the title of a CBS news report, "The Wizard of Oz as you've never seen it before."


Set the Stage: Enabling Storytelling with Multiple Robots through Roleplaying Metaphors

Maria, Tyrone Justin Sta, Griffin, Faith, Deja, Jordan Aiko

arXiv.org Artificial Intelligence

Gestures are an expressive input modality for controlling multiple robots, but their use is often limited by rigid mappings and recognition constraints. To move beyond these limitations, we propose roleplaying metaphors as a scaffold for designing richer interactions. By introducing three roles: Director, Puppeteer, and Wizard, we demonstrate how narrative framing can guide the creation of diverse gesture sets and interaction styles. These roles enable a variety of scenarios, showing how roleplay can unlock new possibilities for multi-robot systems. Our approach emphasizes creativity, expressiveness, and intuitiveness as key elements for future human-robot interaction design.


How Stylistic Similarity Shapes Preferences in Dialogue Dataset with User and Third Party Evaluations

Numaya, Ikumi, Moriya, Shoji, Sato, Shiki, Akama, Reina, Suzuki, Jun

arXiv.org Artificial Intelligence

Recent advancements in dialogue generation have broadened the scope of human-bot interactions, enabling not only contextually appropriate responses but also the analysis of human affect and sensitivity. While prior work has suggested that stylistic similarity between user and system may enhance user impressions, the distinction between subjective and objective similarity is often overlooked. To investigate this issue, we introduce a novel dataset that includes users' preferences, subjective stylistic similarity based on users' own perceptions, and objective stylistic similarity annotated by third party evaluators in open-domain dialogue settings. Analysis using the constructed dataset reveals a strong positive correlation between subjective stylistic similarity and user preference. Furthermore, our analysis suggests an important finding: users' subjective stylistic similarity differs from third party objective similarity. This underscores the importance of distinguishing between subjective and objective evaluations and understanding the distinct aspects each captures when analyzing the relationship between stylistic similarity and user preferences. The dataset presented in this paper is available online.