AITopics | interacting

Collaborating Authors

interacting

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

The Formation of Trust in Autonomous Vehicles after Interacting with Robotaxis on Public Roads

Chang, Xiang, Yi, Zhijie, Liu, Yichang, Sheng, Hongling, He, Dengbo

arXiv.org Artificial IntelligenceOct-2-2025

Existing research regarding users' trust in automation has identified associations between trust and users' personality dimensions, including neuroticism and extraversion ( Merritt et al., 20 08; Hoff et al., 20 15) . Additionally, PI, which reflects an individual's tendency to accept novel experienc es and technologies, has been found to be associated with openness to A V adoption ( Deb et al., 2017) . On the other hand, e xisting research on pedestrian - A V interactions primarily relied on laboratory simulations or closed - road field studies, which, though, has provide d valuable theoretical insights (Chang et al., 2024; Clamann et al., 2017; Ma-hadevan et al., 2019), often oversimplif ied the complexity of real - world traffic environments by neglecting the dynamic and unpredictable characteristics of urban ecosystems (Beggiato et al., 2017) and nullified the risks that pedestrians may face in real - world traffic scenarios . As a result, there is still a gap regarding how pedestrians ' trust in A Vs evolves in dynamic, highly interactive real - world scenarios. Addressing thi s gap is essential for accurately capturing the evolution of pedestrians' attitudes toward A V technology and providing actionable insights for optimizing A V design. Thus, a user experiment was conducted at a real - world uncontrolled urban road intersection where participants interacted with commercially running robot taxis that can be categorized as Level 4 by the S ociety of A utomotive E ngineers (SAE) ( SAE international, 2021) . By evaluating participants' perception of A Vs before and after they interacted with the A Vs, for the first time, we explored how interacting with the A Vs can affect the formation of participants' trust in A Vs . Further, given that individual differences, such as personalit y ( Kraus et al., 202 1; Nordhoff et al., 202 5) and p ersonal i nnovativeness ( Hegner et al., 20 19) have been found to influence users' trust in A Vs, we also explored the factors that can moderate pedestrians' trust in A Vs during the interaction process. METHODS Experimental Site As shown in Figure 1, t he experiment was conducted at an uncontrolled intersection with a zebra crossing but without a traffic light on a straight, two - way urban road .

artificial intelligence, participant, vehicle, (15 more...)

arXiv.org Artificial Intelligence

2510.0012

Country: North America > United States > California > Los Angeles County > Los Angeles (0.28)

Genre:

Research Report > New Finding (0.69)
Research Report > Experimental Study (0.46)

Industry:

Transportation > Infrastructure & Services (1.00)
Transportation > Ground > Road (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)

Add feedback

RoleAgent: Building, Interacting, and Benchmarking High-quality Role-Playing Agents from Scripts

Neural Information Processing SystemsMay-27-2025, 01:59:23 GMT

Believable agents can empower interactive applications ranging from immersive environments to rehearsal spaces for interpersonal communication. Recently, generative agents have been proposed to simulate believable human behavior by using Large Language Models. However, the existing method heavily relies on human-annotated agent profiles (e.g., name, age, personality, relationships with others, and so on) for the initialization of each agent, which cannot be scaled up easily. In this paper, we propose a scalable RoleAgent framework to generate high-quality role-playing agents from raw scripts, which includes building and interacting stages. Specifically, in the building stage, we use a hierarchical memory system to extract and summarize the structure and high-level information of each agent for the raw script.

artificial intelligence, benchmarking high-quality role-playing agent, natural language, (6 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.86)
Information Technology > Artificial Intelligence > Natural Language (0.62)

Add feedback

Generating Fine Details of Entity Interactions

Gu, Xinyi, Mao, Jiayuan

arXiv.org Artificial IntelligenceApr-14-2025

Images not only depict objects but also encapsulate rich interactions between them. However, generating faithful and high-fidelity images involving multiple entities interacting with each other, is a long-standing challenge. While pre-trained text-to-image models are trained on large-scale datasets to follow diverse text instructions, they struggle to generate accurate interactions, likely due to the scarcity of training data for uncommon object interactions. This paper introduces InterActing, an interaction-focused dataset with 1000 fine-grained prompts covering three key scenarios: (1) functional and action-based interactions, (2) compositional spatial relationships, and (3) multi-subject interactions. To address interaction generation challenges, we propose a decomposition-augmented refinement procedure. Our approach, DetailScribe, built on Stable Diffusion 3.5, leverages LLMs to decompose interactions into finer-grained concepts, uses a VLM to critique generated images, and applies targeted interventions within the diffusion process in refinement. Automatic and human evaluations show significantly improved image quality, demonstrating the potential of enhanced inference strategies. Our dataset and code are available at https://concepts-ai.com/p/detailscribe/ to facilitate future exploration of interaction-rich image generation.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2504.08714

Genre: Research Report (0.50)

Industry: Leisure & Entertainment > Games (0.47)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

Add feedback

Interacting with AI Reasoning Models: Harnessing "Thoughts" for AI-Driven Software Engineering

Treude, Christoph, Kula, Raula Gaikovina

arXiv.org Artificial IntelligenceMar-1-2025

Recent advances in AI reasoning models provide unprecedented transparency into their decision-making processes, transforming them from traditional black-box systems into models that articulate step-by-step chains of thought rather than producing opaque outputs. This shift has the potential to improve software quality, explainability, and trust in AI-augmented development. However, software engineers rarely have the time or cognitive bandwidth to analyze, verify, and interpret every AI-generated thought in detail. Without an effective interface, this transparency could become a burden rather than a benefit. In this paper, we propose a vision for structuring the interaction between AI reasoning models and software engineers to maximize trust, efficiency, and decision-making power. We argue that simply exposing AI's reasoning is not enough -- software engineers need tools and frameworks that selectively highlight critical insights, filter out noise, and facilitate rapid validation of key assumptions. To illustrate this challenge, we present motivating examples in which AI reasoning models state their assumptions when deciding which external library to use and produce divergent reasoning paths and recommendations about security vulnerabilities, highlighting the need for an interface that prioritizes actionable insights while managing uncertainty and resolving conflicts. We then outline a research roadmap for integrating automated summarization, assumption validation, and multi-model conflict resolution into software engineering workflows. Achieving this vision will unlock the full potential of AI reasoning models to enable software engineers to make faster, more informed decisions without being overwhelmed by unnecessary detail.

developer, reasoning, software engineering, (10 more...)

arXiv.org Artificial Intelligence

2503.00483

Country:

Asia > Japan > Honshū > Kansai > Osaka Prefecture > Osaka (0.05)
North America > United States (0.04)
Asia > Singapore > Central Region > Singapore (0.04)

Genre: Research Report (0.64)

Industry:

Information Technology > Security & Privacy (0.35)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.34)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(4 more...)

Add feedback

A Paragraph is All It Takes: Rich Robot Behaviors from Interacting, Trusted LLMs

OpenMind, null, Zhong, Shaohong, Zhou, Adam, Chen, Boyuan, Luo, Homin, Liphardt, Jan

arXiv.org Artificial IntelligenceDec-24-2024

Large Language Models (LLMs) are compact representations of all public knowledge of our physical environment and animal and human behaviors. The application of LLMs to robotics may offer a path to highly capable robots that perform well across most human tasks with limited or even zero tuning. Aside from increasingly sophisticated reasoning and task planning, networks of (suitably designed) LLMs offer ease of upgrading capabilities and allow humans to directly observe the robot's thinking. Here we explore the advantages, limitations, and particularities of using LLMs to control physical robots. The basic system consists of four LLMs communicating via a human language data bus implemented via web sockets and ROS2 message passing. Surprisingly, rich robot behaviors and good performance across different tasks could be achieved despite the robot's data fusion cycle running at only 1Hz and the central data bus running at the extremely limited rates of the human brain, of around 40 bits/s. The use of natural language for inter-LLM communication allowed the robot's reasoning and decision making to be directly observed by humans and made it trivial to bias the system's behavior with sets of rules written in plain English. These rules were immutably written into Ethereum, a global, public, and censorship resistant Turing-complete computer. We suggest that by using natural language as the data bus among interacting AIs, and immutable public ledgers to store behavior constraints, it is possible to build robots that combine unexpectedly rich performance, upgradability, and durable alignment with humans.

artificial intelligence, large language model, natural language, (16 more...)

arXiv.org Artificial Intelligence

2412.18588

Genre: Research Report (0.40)

Industry:

Information Technology > Security & Privacy (0.46)
Banking & Finance > Trading (0.35)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

The Conversation is the Command: Interacting with Real-World Autonomous Robot Through Natural Language

Nwankwo, Linus, Rueckert, Elmar

arXiv.org Artificial IntelligenceJan-22-2024

In recent years, autonomous agents have surged in real-world environments such as our homes, offices, and public spaces. However, natural human-robot interaction remains a key challenge. In this paper, we introduce an approach that synergistically exploits the capabilities of large language models (LLMs) and multimodal vision-language models (VLMs) to enable humans to interact naturally with autonomous robots through conversational dialogue. We leveraged the LLMs to decode the high-level natural language instructions from humans and abstract them into precise robot actionable commands or queries. Further, we utilised the VLMs to provide a visual and semantic understanding of the robot's task environment. Our results with 99.13% command recognition accuracy and 97.96% commands execution success show that our approach can enhance human-robot interaction in real-world applications. The video demonstrations of this paper can be found at https://osf.io/wzyf6 and the code is available at our GitHub repository (https://github.com/LinusNEP/TCC_IRoNL.git).

arxiv, llmnode, robot, (9 more...)

arXiv.org Artificial Intelligence

2401.11838

Country:

North America > United States > Colorado > Boulder County > Boulder (0.05)
North America > United States > New York > New York County > New York City (0.05)
Europe > Austria > Styria > Leoben (0.04)
(6 more...)

Genre: Research Report > New Finding (0.34)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

AIxArtist: A First-Person Tale of Interacting with Artificial Intelligence to Escape Creative Block

Lewis, Makayla

arXiv.org Artificial IntelligenceAug-22-2023

The future of the arts and artificial intelligence (AI) is promising as technology advances. As the use of AI in design becomes more widespread, art practice may not be a human-only art form and could instead become a digitally integrated experience. With enhanced creativity and collaboration, arts and AI could work together towards creating artistic outputs that are visually appealing and meet the needs of the artist and viewer. While it is uncertain how far the integration will go, arts and AI will likely influence one another. This workshop pictorial puts forward first-person research that shares interactions between an HCI researcher and AI as they try to escape the creative block. The pictorial paper explores two questions: How can AI support artists' creativity, and what does it mean to be explainable in this context? HIs, ChatGPT and Midjourney were engaged; the result was a series of reflections that require further discussion and explorations in the XAIxArts community: Transparency of attribution, the creation process, ethics of asking, and inspiration vs copying.

large language model, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

2308.11424

Country:

North America > United States > Texas (0.04)
North America > United States > New York > New York County > New York City (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)
Europe > France (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.38)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.38)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.38)

Add feedback

Interacting with next-phrase suggestions: How suggestion systems aid and influence the cognitive processes of writing

Bhat, Advait, Agashe, Saaket, Mohile, Niharika, Oberoi, Parth, Jangir, Ravi, Joshi, Anirudha

arXiv.org Artificial IntelligenceJan-24-2023

Writing with next-phrase suggestions powered by large language models is becoming more pervasive by the day. However, research to understand writers' interaction and decision-making processes while engaging with such systems is still emerging. We conducted a qualitative study to shed light on writers' cognitive processes while writing with next-phrase suggestion systems. To do so, we recruited 14 amateur writers to write two reviews each, one without suggestions and one with suggestions. Additionally, we also positively and negatively biased the suggestion system to get a diverse range of instances where writers' opinions and the bias in the language model align or misalign to varying degrees. We found that writers interact with next-phrase suggestions in various complex ways: Writers abstracted and extracted multiple parts of the suggestions and incorporated them within their writing, even when they disagreed with the suggestion as a whole; along with evaluating the suggestions on various criteria. The suggestion system also had various effects on the writing process, such as altering the writer's usual writing plans, leading to higher levels of distraction etc. Based on our qualitative analysis using the cognitive process model of writing by Hayes as a lens, we propose a theoretical model of 'writer-suggestion interaction' for writing with GPT-2 (and causal language models in general) for a movie review writing task, followed by directions for future research and design.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3581641.3584060

2208.00636

Country:

North America > United States > California > Santa Cruz County > Santa Cruz (0.14)
Asia > India > Maharashtra > Mumbai (0.04)
North America > United States > New York > New York County > New York City (0.04)
(8 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)
Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

LAIDEN - Interacting with Robots and AI

#artificialintelligenceNov-28-2019, 23:03:28 GMT

The perception of agency and the way we explain an agent behavior play a role in the interaction with agentic things, like robotic object and intelligent playthings. Children easily imbue robotic objects and playthings with agency and explain their behavior on psychological term attributing intelligence relying on agency perception. However, human robot interaction researchers and practitioners often do not tackle in the design process how children make sense of a robot, its agency and how children explain a robot's behavior and intelligence. In this talk, I will shed light on these phenomena and their implications for child-interactions. I will argue that responsible human-centred design should play a more prominent role in the human-robot interaction design cycle to understand how children perceive a robot's agency and explain a robot's behavior.

interacting, robot, robot and ai, (8 more...)

#artificialintelligence

Country: North America > United States (0.07)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Learning to Prove Theorems via Interacting with Proof Assistants

Yang, Kaiyu, Deng, Jia

arXiv.org Machine LearningMay-21-2019

Humans prove theorems by relying on substantial high-level reasoning and problem-specific insights. Proof assistants offer a formalism that resembles human mathematical reasoning, representing theorems in higher-order logic and proofs as high-level tactics. However, human experts have to construct proofs manually by entering tactics into the proof assistant. In this paper, we study the problem of using machine learning to automate the interaction with proof assistants. We construct CoqGym, a large-scale dataset and learning environment containing 71K human-written proofs from 123 projects developed with the Coq proof assistant. We develop ASTactic, a deep learning-based model that generates tactics as programs in the form of abstract syntax trees (ASTs). Experiments show that ASTactic trained on CoqGym can generate effective tactics and can be used to prove new theorems not previously provable by automated methods. Code is available at https://github.com/princeton-vl/CoqGym.

learning, proof assistant, theorem, (14 more...)

arXiv.org Machine Learning

1905.09381

Country:

North America > United States > Utah (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)

Genre: Research Report (0.64)

Industry: Education (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback