Personal
Threefold model for AI Readiness: A Case Study with Finnish Healthcare SMEs
Alnajjar, Mohammed, Alnajjar, Khalid, Hämäläinen, Mika
This study examines AI adoption among Finnish healthcare SMEs through semi-structured interviews with six health-tech companies. We identify three AI engagement categories: AI-curious (exploring AI), AI-embracing (integrating AI), and AI-catering (providing AI solutions). Our proposed threefold model highlights key adoption barriers, including regulatory complexities, technical expertise gaps, and financial constraints. While SMEs recognize AI's potential, most remain in early adoption stages. We provide actionable recommendations to accelerate AI integration, focusing on regulatory reforms, talent development, and inter-company collaboration, offering valuable insights for healthcare organizations, policymakers, and researchers.
AI Rivalry as a Craft: How Resisting and Embracing Generative AI Reshape Writing Professions
Varanasi, Rama Adithya, Wiesenfeld, Batia Mishan, Nov, Oded
Generative AI (GAI) technologies are disrupting professional writing, challenging traditional practices. Recent studies explore GAI adoption experiences of creative practitioners, but we know little about how these experiences evolve into established practices and how GAI resistance alters these practices. To address this gap, we conducted 25 semi-structured interviews with writing professionals who adopted and/or resisted GAI. Using the theoretical lens of Job Crafting, we identify four strategies professionals employ to reshape their roles. Writing professionals employed GAI resisting strategies to maximize human potential, reinforce professional identity, carve out a professional niche, and preserve credibility within their networks. In contrast, GAI-enabled strategies allowed writers who embraced GAI to enhance desirable workflows, minimize mundane tasks, and engage in new AI-managerial labor. These strategies amplified their collaborations with GAI while reducing their reliance on other people. We conclude by discussing implications of GAI practices on writers' identity and practices as well as crafting theory.
PromptMap: An Alternative Interaction Style for AI-Based Image Generation
Adamkiewicz, Krzysztof, Woźniak, Paweł W., Dominiak, Julia, Romanowski, Andrzej, Karolus, Jakob, Frolov, Stanislav
Recent technological advances popularized the use of image generation among the general public. Crafting effective prompts can, however, be difficult for novice users. To tackle this challenge, we developed PromptMap, a new interaction style for text-to-image AI that allows users to freely explore a vast collection of synthetic prompts through a map-like view with semantic zoom. PromptMap groups images visually by their semantic similarity, allowing users to discover relevant examples. We evaluated PromptMap in a between-subject online study ($n=60$) and a qualitative within-subject study ($n=12$). We found that PromptMap supported users in crafting prompts by providing them with examples. We also demonstrated the feasibility of using LLMs to create vast example collections. Our work contributes a new interaction style that supports users unfamiliar with prompting in achieving a satisfactory image output.
Interview with Tunazzina Islam: Understand microtargeting and activity patterns on social media
In this interview series, we're meeting some of the AAAI/SIGAI Doctoral Consortium participants to find out more about their research. The Doctoral Consortium provides an opportunity for a group of PhD students to discuss and explore their research interests and career objectives in an interdisciplinary workshop together with a panel of established researchers. In the third of our interviews with the 2025 cohort, we heard from Tunazzina Islam who has recently completed her PhD in Computer Science at Purdue University, advised by Dr Dan Goldwasser. Her primary research interests lie in computational social science (CSS), natural language processing (NLP), and social media mining and analysis. We now live in a world where we can reach people directly through social media, without relying on traditional media such as television and radio.
Stakeholder Perspectives on Whether and How Social Robots Can Support Mediation and Advocacy for Higher Education Students with Disabilities
Markelius, Alva, Bailey, Julie, Gibson, Jenny L., Gunes, Hatice
Existing power dynamics, social injustices and structural barriers may exacerbate challenges related to support and advocacy, limiting some students' ability to articulate their needs effectively [59]. This disparity highlights an increasing need for alternative approaches to student advocacy that may empower students with disabilities in ways that current practices may not. While human disability support practitioners can play a crucial role in bridging gaps between students and institutions, these efforts are resource-intensive, relying on trained personnel, availability, and sustained institutional commitment. This study explores the feasibility and ethical implications of employing artificial intelligence (AI) and in particular social robots as tools for mediation and advocacy for disabled students in higher education. While the overarching focus regards social robots and LLMs, the study adopts a broader perspective of understanding the use of technology and AI in general for disabled students, to draw insights and identify patterns that can inform the design, implementation, and ethical considerations of AI-driven assistive technologies.
Telephone Surveys Meet Conversational AI: Evaluating a LLM-Based Telephone Survey System at Scale
Telephone surveys remain a valuable tool for gathering insights but typically require substantial resources in training and coordinating human interviewers. This work presents an AI-driven telephone survey system integrating text-to-speech (TTS), a large language model (LLM), and speech-to-text (STT) that mimics the versatility of human-led interviews (full-duplex dialogues) at scale. We tested the system across two populations, a pilot study in the United States (n = 75) and a large-scale deployment in Peru (n = 2,739), inviting participants via web-based links and contacting them via direct phone calls. The AI agent successfully administered open-ended and closed-ended questions, handled basic clarifications, and dynamically navigated branching logic, allowing fast large-scale survey deployment without interviewer recruitment or training. Our findings demonstrate that while the AI system's probing for qualitative depth was more limited than human interviewers, overall data quality approached human-led standards for structured items. This study represents one of the first successful large-scale deployments of an LLM-based telephone interviewer in a real-world survey context. The AI-powered telephone survey system has the potential for expanding scalable, consistent data collecting across market research, social science, and public opinion studies, thus improving operational efficiency while maintaining appropriate data quality for research.
HalluVerse25: Fine-grained Multilingual Benchmark Dataset for LLM Hallucinations
Abdaljalil, Samir, Kurban, Hasan, Serpedin, Erchin
Large Language Models (LLMs) are increasingly used in various contexts, yet remain prone to generating non-factual content, commonly referred to as "hallucinations". The literature categorizes hallucinations into several types, including entity-level, relation-level, and sentence-level hallucinations. However, existing hallucination datasets often fail to capture fine-grained hallucinations in multilingual settings. In this work, we introduce HalluVerse25, a multilingual LLM hallucination dataset that categorizes fine-grained hallucinations in English, Arabic, and Turkish. Our dataset construction pipeline uses an LLM to inject hallucinations into factual biographical sentences, followed by a rigorous human annotation process to ensure data quality. We evaluate several LLMs on HalluVerse25, providing valuable insights into how proprietary models perform in detecting LLM-generated hallucinations across different contexts.
Revisiting Noise in Natural Language Processing for Computational Social Science
Computational Social Science (CSS) is an emerging field driven by the unprecedented availability of human-generated content for researchers. This field, however, presents a unique set of challenges due to the nature of the theories and datasets it explores, including highly subjective tasks and complex, unstructured textual corpora. Among these challenges, one of the less well-studied topics is the pervasive presence of noise. This thesis aims to address this gap in the literature by presenting a series of interconnected case studies that examine different manifestations of noise in CSS. These include character-level errors following the OCR processing of historical records, archaic language, inconsistencies in annotations for subjective and ambiguous tasks, and even noise and biases introduced by large language models during content generation. This thesis challenges the conventional notion that noise in CSS is inherently harmful or useless. Rather, it argues that certain forms of noise can encode meaningful information that is invaluable for advancing CSS research, such as the unique communication styles of individuals or the culture-dependent nature of datasets and tasks. Further, this thesis highlights the importance of nuance in dealing with noise and the considerations CSS researchers must address when encountering it, demonstrating that different types of noise require distinct strategies.
The potential role of AI agents in transforming nuclear medicine research and cancer management in India
Vashistha, Rajat, Gulzar, Arif, Kundu, Parveen, Sharma, Punit, Brunstein, Mark, Vegh, Viktor
India faces a significant cancer burden, with an incidence - to - mortality ratio indicating that nearly three out of five individuals diagnosed with cancer succumb to the disease. While the limitations of physical healthcare infrastructure are widely acknowledged as a primary challenge, concerted efforts by government and healthcare agencies are underway to mitigate these constraints. However, given the country's vast geography and high population density, it is imperative to explore alternative soft infrastructure solutions to complement existing frameworks . Artificial Intelligence agents are increasingly transforming problem - solving approaches across various domains, with their application in medicine proving particularly transformative. In this perspective, we examine the potential role of AI agents in advancing nuclear medicine fo r cancer research, diagnosis, and management in India. We begin with a brief overview of AI agents and their capabilities, followed by a proposed agent - based ecosystem that can address prevailing sustainability challenges in India's nuclear medicine. Keywords: AI Agents; cancer; nuclear medicine ecosystem; sustainability challenges 1. Introduction India's with population of 1.4 billion faces a significant cancer burden, with ~1.5 million new cases and ~850,000 deaths annually [1] [2] . With an i ncidence - to - m ortality p ercentage of approximately 64.8%, nearly three out of five individuals diagnosed with cancer are expected to succumb to the disease [2] . Projections indicate that mortality rates will rise significantly, increasing from 64.7% to 109.6% between 2022 and 2050, largely due to demographic shifts as the reproductive - age population transitions into middle and old age. This growing cancer burden will place even more pressure on the already overburdened healthcare system, making it essential to address the gaps in both infrastructure and indigenous research and innovations to ensure timely and effective patient treatment [3] . This trend underscores the urgent need for a resilient, patient - centred framework that integrates medical advancements, early detection through diagnostics, timely therapeutic interventions, and equitable access to care. Nuclear medicine uses a small amount of targeted radioactive material to diagnose and treat cancer [4] .
Dr Genre: Reinforcement Learning from Decoupled LLM Feedback for Generic Text Rewriting
Li, Yufei, Nham, John, Jawahar, Ganesh, Shu, Lei, Uthus, David, Sung, Yun-Hsuan, Yang, Chengrun, Rolnick, Itai, Qiao, Yi, Liu, Cong
Generic text rewriting is a prevalent large language model (LLM) application that covers diverse real-world tasks, such as style transfer, fact correction, and email editing. These tasks vary in rewriting objectives (e.g., factual consistency vs. semantic preservation), making it challenging to develop a unified model that excels across all dimensions. Existing methods often specialize in either a single task or a specific objective, limiting their generalizability. In this work, we introduce a generic model proficient in factuality, stylistic, and conversational rewriting tasks. To simulate real-world user rewrite requests, we construct a conversational rewrite dataset, ChatRewrite, that presents ``natural''-sounding instructions, from raw emails using LLMs. Combined with other popular rewrite datasets, including LongFact for the factuality rewrite task and RewriteLM for the stylistic rewrite task, this forms a broad benchmark for training and evaluating generic rewrite models. To align with task-specific objectives, we propose Dr Genre, a Decoupled-reward learning framework for Generic rewriting, that utilizes objective-oriented reward models with a task-specific weighting. Evaluation shows that \approach delivers higher-quality rewrites across all targeted tasks, improving objectives including instruction following (agreement), internal consistency (coherence), and minimal unnecessary edits (conciseness).