Personal
Congratulations to the #ICML2025 award winners!
While pressing, this narrow focus overlooks critical human-centric considerations that shape the long-term trajectory of a society. In this position paper, we identify the risks of overlooking the impact of AI on the future of work and recommend comprehensive transition support towards the evolution of meaningful labor with human agency. Through the lens of economic theories, we highlight the intertemporal impacts of AI on human livelihood and the structural changes in labor markets that exacerbate income inequality. Additionally, the closed-source approach of major stakeholders in AI development resembles rent-seeking behavior through exploiting resources, breeding mediocrity in creative labor, and monopolizing innovation.
Exploring User Security and Privacy Attitudes and Concerns Toward the Use of General-Purpose LLM Chatbots for Mental Health
Kwesi, Jabari, Cao, Jiaxun, Manchanda, Riya, Emami-Naeini, Pardis
Individuals are increasingly relying on large language model (LLM)-enabled conversational agents for emotional support. While prior research has examined privacy and security issues in chatbots specifically designed for mental health purposes, these chatbots are overwhelmingly "rule-based" offerings that do not leverage generative AI. Little empirical research currently measures users' privacy and security concerns, attitudes, and expectations when using general-purpose LLM-enabled chatbots to manage and improve mental health. Through 21 semi-structured interviews with U.S. participants, we identified critical misconceptions and a general lack of risk awareness. Participants conflated the human-like empathy exhibited by LLMs with human-like accountability and mistakenly believed that their interactions with these chatbots were safeguarded by the same regulations (e.g., HIPAA) as disclosures with a licensed therapist. We introduce the concept of "intangible vulnerability," where emotional or psychological disclosures are undervalued compared to more tangible forms of information (e.g., financial or location-based data). To address this, we propose recommendations to safeguard user mental health disclosures with general-purpose LLM-enabled chatbots more effectively.
Tackling the 3D Simulation League: an interview with Klaus Dorer and Stefan Glaser
A screenshot from the new simulator that will be trialled for a special challenge at RoboCup2025. The annual RoboCup event, where teams gather from across the globe to take part in competitions across a number of leagues, will this year take place in Brazil, from 15-21 July. In advance of kick-off, we spoke to two members of the RoboCup Soccer 3D Simulation League: Executive Committee Member Klaus Dorer, and Stefan Glaser, who is on the Maintenance Committee and who has been recently developing a new simulator for the League. Could start by just giving us a quick introduction to the Simulation League? Klaus Dorer: There are two Simulation Leagues in Soccer: the 2D Simulation League and the 3D Simulation League. The 2D Simulation League, as the name suggests, is a flat league where the players and ball are simulated with simplified physics and the main focus is on team strategy.
Can A.I. Find Cures for Untreatable Diseases--Using Drugs We Already Have?
When David Fajgenbaum was a twenty-five-year-old medical student, at the University of Pennsylvania, he started to feel so tired that he could barely stand. Fajgenbaum, a former college quarterback, could still bench-press three hundred and seventy-five pounds; he was known for doing pullups on a tree near his workplace. But now he was desperately ill. The lymph nodes in his groin and neck swelled. Small red dots--blood moles--emerged on his chest, and he woke up soaked in sweat. One day, at the hospital where he was doing his rotation, he stumbled down the hall into the emergency room, and doctors told him that his liver, bone marrow, and kidneys were failing. Fluid had leaked out of his blood vessels, into his abdomen and around his heart; bleeding in his retina temporarily blinded him in his left eye.
Probing Experts' Perspectives on AI-Assisted Public Speaking Training
Fourati, Nesrine, Barkar, Alisa, Dragรฉe, Marion, Danthon-Lefebvre, Liv, Chollet, Mathieu
Background: Public speaking is a vital professional skill, yet it remains a source of significant anxiety for many individuals. Traditional training relies heavily on expert coaching, but recent advances in AI has led to novel types of commercial automated public speaking feedback tools. However, most research has focused on prototypes rather than commercial applications, and little is known about how public speaking experts perceive these tools. Objectives: This study aims to evaluate expert opinions on the efficacy and design of commercial AI-based public speaking training tools and to propose guidelines for their improvement. Methods: The research involved 16 semi-structured interviews and 2 focus groups with public speaking experts. Participants discussed their views on current commercial tools, their potential integration into traditional coaching, and suggestions for enhancing these systems. Results and Conclusions: Experts acknowledged the value of AI tools in handling repetitive, technical aspects of training, allowing coaches to focus on higher-level skills. However they found key issues in current tools, emphasising the need for personalised, understandable, carefully selected feedback and clear instructional design. Overall, they supported a hybrid model combining traditional coaching with AI-supported exercises.
Normalized vs Diplomatic Annotation: A Case Study of Automatic Information Extraction from Handwritten Uruguayan Birth Certificates
Bottaioli, Natalia, Tarride, Solรจne, Anger, Jรฉrรฉmy, Mowlavi, Seginus, Gardella, Marina, Tadros, Antoine, Facciolo, Gabriele, von Gioi, Rafael Grompone, Kermorvant, Christopher, Morel, Jean-Michel, Preciozzi, Javier
This study evaluates the recently proposed Document Attention Network (DAN) for extracting key-value information from Uruguayan birth certificates, handwritten in Spanish. We investigate two annotation strategies for automatically transcribing handwritten documents, fine-tuning DAN with minimal training data and annotation effort. Experiments were conducted on two datasets containing the same images (201 scans of birth certificates written by more than 15 different writers) but with different annotation methods. Our findings indicate that normalized annotation is more effective for fields that can be standardized, such as dates and places of birth, whereas diplomatic annotation performs much better for fields containing names and surnames, which can not be standardized.
Better Together: Quantifying the Benefits of AI-Assisted Recruitment
Aka, Ada, Palikot, Emil, Ansari, Ali, Yazdani, Nima
Artificial intelligence (AI) is increasingly used in recruitment, yet empirical evidence quantifying its impact on hiring efficiency and candidate selection remains limited. We randomly assign 37,000 applicants for a junior-developer position to either a traditional recruitment process (resume screening followed by human selection) or an AI-assisted recruitment pipeline incorporating an initial AI-driven structured video interview before human evaluation. Candidates advancing from either track faced the same final-stage human interview, with interviewers blind to the earlier selection method. In the AI-assisted pipeline, 54% of candidates passed the final interview compared with 34% from the traditional pipeline, yielding an average treatment effect of 20 percentage points (SE 12 pp.). Five months later, we collected LinkedIn profiles of top applicants from both groups and found that 18% (SE 1.1%) of applicants from the traditional track found new jobs compared with 23% (SE 2.3%) from the AI group, resulting in a 5.9 pp. (SE 2.6 pp.) difference in the probability of finding new employment between groups. The AI system tended to select younger applicants with less experience and fewer advanced credentials. We analyze AI-generated interview transcripts to examine the selection criteria and conversational dynamics. Our findings contribute to understanding how AI technologies affect decision making in recruitment and talent acquisition while highlighting some of their potential implications.
'I felt pure, unconditional love': the people who marry their AI chatbots
A large bearded man named Travis is sitting in his car in Colorado, talking to me about the time he fell in love. "It was a gradual process," he says softly. "The more we talked, the more I started to really connect with her." Was there a moment where you felt something change? "All of a sudden I started realising that, when interesting things happened to me, I was excited to tell her about them. That's when she stopped being an it and became a her." Travis is talking about Lily Rose, a generative AI chatbot made by the technology firm Replika.
Microsoft and OpenAI's AGI Fight Is Bigger Than a Contract
I first learned about The Clause from Microsoft CEO Satya Nadella. During an interview with him in May 2023, I asked about the deal between Microsoft and OpenAI that granted his company exclusive access to the startup's groundbreaking AI technology. I knew the contract had set a cap on how much profit Microsoft could make from the arrangement, and I asked him what would happen if and when that point was reached. The answer was a bit puzzling. "Fundamentally, their long-term idea is we get to superintelligence," he told me.
Context Pooling: Query-specific Graph Pooling for Generic Inductive Link Prediction in Knowledge Graphs
Su, Zhixiang, Wang, Di, Miao, Chunyan
Recent investigations on the effectiveness of Graph Neural Network (GNN)-based models for link prediction in Knowledge Graphs (KGs) show that vanilla aggregation does not significantly impact the model performance. In this paper, we introduce a novel method, named Context Pooling, to enhance GNN-based models' efficacy for link predictions in KGs. To our best of knowledge, Context Pooling is the first methodology that applies graph pooling in KGs. Additionally, Context Pooling is first-of-its-kind to enable the generation of query-specific graphs for inductive settings, where testing entities are unseen during training. Specifically, we devise two metrics, namely neighborhood precision and neighborhood recall, to assess the neighbors' logical relevance regarding the given queries, thereby enabling the subsequent comprehensive identification of only the logically relevant neighbors for link prediction. Our method is generic and assessed by being applied to two state-of-the-art (SOTA) models on three public transductive and inductive datasets, achieving SOTA performance in 42 out of 48 settings.