AITopics

2509.0145

Country: North America > United States > Massachusetts (0.28)

Genre:

Questionnaire & Opinion Survey (1.00)
Research Report > New Finding (0.93)

Industry: Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (0.48)

Technology:

Information Technology > Artificial Intelligence > Robots > Humanoid Robots (0.36)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.34)

arXiv.org Artificial IntelligenceAug-28-2025

InquireMobile: Teaching VLM-based Mobile Agent to Request Human Assistance via Reinforcement Fine-Tuning

Ai, Qihang, Bu, Pi, Cao, Yue, Wang, Yingyao, Gu, Jihao, Xing, Jingxuan, Zhu, Zekun, Jiang, Wei, Zheng, Zhicheng, Song, Jun, Jiang, Yuning, Zheng, Bo

Recent advances in Vision-Language Models (VLMs) have enabled mobile agents to perceive and interact with real-world mobile environments based on human instructions. However, the current fully autonomous paradigm poses potential safety risks when model understanding or reasoning capabilities are insufficient. To address this challenge, we first introduce \textbf{InquireBench}, a comprehensive benchmark specifically designed to evaluate mobile agents' capabilities in safe interaction and proactive inquiry with users, encompassing 5 categories and 22 sub-categories, where most existing VLM-based agents demonstrate near-zero performance. In this paper, we aim to develop an interactive system that actively seeks human confirmation at critical decision points. To achieve this, we propose \textbf{InquireMobile}, a novel model inspired by reinforcement learning, featuring a two-stage training strategy and an interactive pre-action reasoning mechanism. Finally, our model achieves an 46.8% improvement in inquiry success rate and the best overall success rate among existing baselines on InquireBench. We will open-source all datasets, models, and evaluation codes to facilitate development in both academia and industry.

arxiv preprint arxiv, large language model, natural language, (18 more...)

2508.19679

Genre: Research Report (1.00)

Industry: Information Technology (0.93)

Technology:

Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)

He, Zhanpeng, Cao, Yifeng, Ciocarlie, Matei

Uncertainty Comes for Free: Human-in-the-Loop Policies with Diffusion Models

arXiv.org Artificial IntelligenceFeb-26-2025

Human-in-the-loop (HitL) robot deployment has gained significant attention in both academia and industry as a semi-autonomous paradigm that enables human operators to intervene and adjust robot behaviors at deployment time, improving success rates. However, continuous human monitoring and intervention can be highly labor-intensive and impractical when deploying a large number of robots. To address this limitation, we propose a method that allows diffusion policies to actively seek human assistance only when necessary, reducing reliance on constant human oversight. To achieve this, we leverage the generative process of diffusion policies to compute an uncertainty-based metric based on which the autonomous agent can decide to request operator assistance at deployment time, without requiring any operator interaction during training. Additionally, we show that the same method can be used for efficient data collection for fine-tuning diffusion policies in order to improve their autonomous performance. Experimental results from simulated and real-world environments demonstrate that our approach enhances policy performance during deployment for a variety of scenarios.

assistance, deployment, diffusion policy, (13 more...)

2503.01876

Country: Europe > Netherlands > South Holland > Delft (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.95)

arXiv.org Artificial IntelligenceSep-18-2024

HARP: Human-Assisted Regrouping with Permutation Invariant Critic for Multi-Agent Reinforcement Learning

Hu, Huawen, Shi, Enze, Yue, Chenxi, Yang, Shuocun, Wu, Zihao, Li, Yiwei, Zhong, Tianyang, Zhang, Tuo, Liu, Tianming, Zhang, Shu

Human-in-the-loop reinforcement learning integrates human expertise to accelerate agent learning and provide critical guidance and feedback in complex fields. However, many existing approaches focus on single-agent tasks and require continuous human involvement during the training process, significantly increasing the human workload and limiting scalability. In this paper, we propose HARP (Human-Assisted Regrouping with Permutation Invariant Critic), a multi-agent reinforcement learning framework designed for group-oriented tasks. HARP integrates automatic agent regrouping with strategic human assistance during deployment, enabling and allowing non-experts to offer effective guidance with minimal intervention. During training, agents dynamically adjust their groupings to optimize collaborative task completion. When deployed, they actively seek human assistance and utilize the Permutation Invariant Group Critic to evaluate and refine human-proposed groupings, allowing non-expert users to contribute valuable suggestions. In multiple collaboration scenarios, our approach is able to leverage limited guidance from non-experts and enhance performance. The project can be found at https://github.com/huawen-hu/HARP.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

2409.11741

Country:

North America > United States > Georgia > Clarke County > Athens (0.14)
Asia > Middle East > Republic of Türkiye > Karaman Province > Karaman (0.04)
Asia > China > Shaanxi Province > Xi'an (0.04)

Genre: Research Report (0.82)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Wakabayashi, Shumpei, Kawaharazuka, Kento, Okada, Kei, Inaba, Masayuki

Behavioral Learning of Dish Rinsing and Scrubbing based on Interruptive Direct Teaching Considering Assistance Rate

arXiv.org Artificial IntelligenceSep-3-2024

Robots are expected to manipulate objects in a safe and dexterous way. For example, washing dishes is a dexterous operation that involves scrubbing the dishes with a sponge and rinsing them with water. It is necessary to learn it safely without splashing water and without dropping the dishes. In this study, we propose a safe and dexterous manipulation system. The robot learns a dynamics model of the object by estimating the state of the object and the robot itself, the control input, and the amount of human assistance required (assistance rate) after the human corrects the initial trajectory of the robot's hands by interruptive direct teaching. By backpropagating the error between the estimated and the reference value using the acquired dynamics model, the robot can generate a control input that approaches the reference value, for example, so that human assistance is not required and the dish does not move excessively. This allows for adaptive rinsing and scrubbing of dishes with unknown shapes and properties. As a result, it is possible to generate safe actions that require less human assistance.

assistance, human assistance, robot, (17 more...)

doi: 10.1080/01691864.2024.2379393

2408.0936

Country: Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)

Genre: Research Report (0.85)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

#artificialintelligenceMar-29-2023, 05:10:51 GMT

End to End Chatbot using Python

A chatbot is a computer program that understands the intent of your query to answer with a solution. Chatbots are the most popular applications of Natural Language Processing in the industry. So, if you want to build an end-to-end chatbot, this article is for you. In this article, I will take you through how to create an end-to-end chatbot using Python. An end-to-end chatbot refers to a chatbot that can handle a complete conversation from start to finish without requiring human assistance.

chatbot, end-to-end chatbot, python, (9 more...)

Technology: Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)

#artificialintelligenceNov-5-2022, 07:55:40 GMT

Will cars ever be fully autonomous?

Self-driving cars or autonomous vehicles are classified into various levels based on the level of automation built into them. Instead of a self-driving car, why not take the bus, you might ask. As you likely know, automated connected systems are no longer restricted to factories. They continue to percolate and expand in the daily thoroughfare of our lives. Gone are the days when owning and driving a car was a matter of privilege afforded by a select few.

automation, autonomous driving, vehicle, (16 more...)

Country: Asia > Japan > Kyūshū & Okinawa > Kyūshū > Kagoshima Prefecture > Kagoshima (0.05)

Industry:

Transportation > Ground > Road (1.00)
Automobiles & Trucks (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)

#artificialintelligenceMay-24-2022, 23:43:32 GMT

CNN for Autonomous Driving

We set an early stopping rule so that the model stops training when validation loss stops improving, and we specify a batch size of 16 to determine how many image are passed through the network at a time. We use the flow_from_directory() function to pull the training and validation images that were generated in the preprocessing step by the ImageDataGenerator function from the training and validation file paths respectively.

cnn, pedestrian, vehicle, (16 more...)

Industry: Transportation > Ground > Road (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)

#artificialintelligenceJan-14-2022, 15:04:54 GMT

Top 20 Must-Know Vital Chatbot Statistics 2022

The chatbot revolution is upon us, and it's changing the way we interact with companies and brands. In fact, Servion predicts that, by 2025, AI will power 95% of all customer interactions. It's only a matter of time before the majority of people use chatbots for their business and social needs. The statistics show that it's not just a fad, but a serious business opportunity that's already being adopted by the biggest names in the industry. "I think chatbots and voicebots may become the future of commerce, as it relates to Gen Z." -- Tiffany Zhong, Founder & CEO of Zebra Intelligence Are you aware of these crucial chatbot statistics to have a better understanding of their future?

chatbot, must-know vital chatbot statistics 2022, stat, (11 more...)

Country: North America > United States (0.05)

Genre: Research Report > New Finding (0.49)

Industry:

Retail (0.73)
Banking & Finance (0.49)

Technology: Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)

De, Abir, Okati, Nastaran, Zarezade, Ali, Gomez-Rodriguez, Manuel

Classification Under Human Assistance

arXiv.org Machine LearningJun-21-2020

Most supervised learning models are trained for full automation. However, their predictions are sometimes worse than those by human experts on some specific instances. Motivated by this empirical observation, our goal is to design classifiers that are optimized to operate under different automation levels. More specifically, we focus on convex margin-based classifiers and first show that the problem is NP-hard. Then, we further show that, for support vector machines, the corresponding objective function can be expressed as the difference of two functions f = g - c, where g is monotone, non-negative and {\gamma}-weakly submodular, and c is non-negative and modular. This representation allows a recently introduced deterministic greedy algorithm, as well as a more efficient randomized variant of the algorithm, to enjoy approximation guarantees at solving the problem. Experiments on synthetic and real-world data from several applications in medical diagnosis illustrate our theoretical findings and demonstrate that, under human assistance, supervised learning models trained to operate under different automation levels can outperform those trained for full automation as well as humans operating alone.

artificial intelligence, machine learning, support vector machine, (15 more...)

arXiv.org Machine Learning

2006.11845

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.64)

Industry:

Health & Medicine > Therapeutic Area (0.67)
Health & Medicine > Diagnostic Medicine (0.66)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.89)