AITopics | Manipulation

Collaborating Authors

Manipulation

News Overviews Instructional Materials AI-Alerts Classics

Wristband enables wearers to control a robotic hand with their own movements

RobohubJul-13-2026, 09:38:45 GMT

The next time you're scrolling your phone, take a moment to appreciate the feat: The seemingly mundane act is possible thanks to the coordination of 34 muscles, 27 joints, and over 100 tendons and ligaments in your hand. Indeed, our hands are the most nimble parts of our bodies. Mimicking their many nuanced gestures has been a longstanding challenge in robotics and virtual reality. Now, MIT engineers have designed an ultrasound wristband that precisely tracks a wearer's hand movements in real-time. The wristband produces ultrasound images of the wrist's muscles, tendons, and ligaments as the hand moves, and is paired with an artificial intelligence algorithm that continuously translates the images into the corresponding positions of the five fingers and palm.

artificial intelligence, human computer interaction, wristband, (18 more...)

Robohub

Country: North America > United States > California (0.15)

Industry:

Government > Regional Government > North America Government > United States Government (0.48)
Leisure & Entertainment > Sports (0.48)

Technology:

Information Technology > Artificial Intelligence > Robots > Robots in the Workplace (0.42)
Information Technology > Artificial Intelligence > Robots > Manipulation (0.42)
Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (0.35)

Add feedback

China wants to solve the hardest problem in robotics – making hands

The GuardianJul-6-2026, 00:31:18 GMT

Race to develop'embodied AI' focuses on creating dextrous hands to transform humanoid robots from gimmicks into useful products Human hands - nimble, nerve-filled appendages that are the most flexible part of the human skeleton - are exceptionally complex. Many tasks that most people can do largely without thinking, from tying a pair of shoelaces to buttoning up a shirt, in fact require a complex set of neurological instructions and precise choreography. In thousands of years of human history, no machine has been able to truly replicate human's greatest tool. But now, as artificial intelligence (AI) races forwards, some companies think they are close to surpassing this final but most difficult hurdle in robotics. Most of them are in China . A new suite of Chinese start-ups are leveraging China's advantages in manufacturing and enthusiasm for what the government calls "embodied AI" to build the fully dextrous robotic hands that are needed to transform humanoid robots from dancing gimmicks into useful products.

artificial intelligence, china, robot, (13 more...)

The Guardian

Country: Asia > China (1.00)

Industry:

Leisure & Entertainment > Sports (1.00)
Government > Regional Government (0.70)

Technology:

Information Technology > Artificial Intelligence > Robots > Robots in the Workplace (0.75)
Information Technology > Artificial Intelligence > Robots > Manipulation (0.65)

Add feedback

Robot Talk Episode 162 – The robot doctor will see you now

RobohubJun-26-2026, 11:22:29 GMT

Since the first robot-assisted surgery was performed, over 40 years ago, major advances in robotics, computer vision and artificial intelligence have fundamentally changed medicine and healthcare. Innovative new technologies are already aiding skilled medical professionals in diagnosis, surgery, rehabilitation and beyond. But many questions remain: What ethical issues arise as medical tools become increasingly autonomous? How do we regulate technologies that can learn and change over time? And how can we ensure that cutting-edge medical devices are accessible to all?

artificial intelligence, claire chatted, college london, (12 more...)

Robohub

Genre: Personal (0.50)

Industry:

Health & Medicine > Health Care Technology (0.90)
Health & Medicine > Health Care Equipment & Supplies (0.56)

Technology: Information Technology > Artificial Intelligence > Robots > Manipulation (0.31)

Add feedback

Object-centric 3DMotion Field for Robot Learning from Human Videos

Neural Information Processing SystemsJun-23-2026, 07:25:04 GMT

Learning robot control policies from human videos is a promising direction for scaling up robot learning. However, how to extract action knowledge (or action representations) from videos for policy learning remains a key challenge. Existing action representations such as video frames, pixelflow, and pointcloud flow have inherent limitations such as modeling complexity or loss of information. In this paper, we propose to use object-centric 3D motion field to represent actions for robot learning from human videos, and present a novel framework for extracting this representation from videos for zero-shot control. We introduce two novel components in its implementation.

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Genre: Research Report > Experimental Study (1.00)

Industry:

Information Technology (0.46)
Media > Photography (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.67)
Information Technology > Artificial Intelligence > Robots > Manipulation (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

HumanoidGen: Data Generation for Bimanual Dexterous Manipulation via LLMReasoning

Neural Information Processing SystemsJun-23-2026, 00:25:44 GMT

For robotic manipulation, existing robotics datasets and simulation benchmarks predominantly cater to robot-arm platforms. However, for humanoid robots equipped with dual arms and dexterous hands, simulation tasks and high-quality demonstrations are notably lacking. Bimanual dexterous manipulation is inherently more complex, as it requires coordinated arm movements and hand operations, making autonomous data collection challenging. This paper presents HumanoidGen, an automated task creation and demonstration collection framework that leverages atomic dexterous operations and LLM reasoning to generate relational constraints. Specifically, we provide spatial annotations for both assets and dexterous hands based on the atomic operations, and perform an LLM planner to generate a chain of actionable spatial constraints for arm movements based on object affordances and scenes. To further improve planning ability, we employ a variant of Monte Carlo tree search to enhance LLM reasoning for long-horizon tasks and insufficient annotation. In experiments, we create a novel benchmark with augmented scenarios to evaluate the quality of the collected data. The results show that the performance of the 2D and 3D diffusion policies can scale with the generated dataset.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Genre:

Research Report > Experimental Study (1.00)
Workflow (0.93)
Research Report > New Finding (0.87)

Technology:

Information Technology > Artificial Intelligence > Robots > Manipulation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.87)

Add feedback

Touch in the wild

Neural Information Processing SystemsJun-22-2026, 23:48:16 GMT

Handheld grippers are increasingly used to collect human demonstrations due to their ease of deployment and versatility. However, most existing designs lack tactile lation.

arxiv preprint arxiv, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.67)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Robots > Manipulation (0.68)
(2 more...)

Add feedback

DexGarmentLab: Dexterous Garment Manipulation Environment with Generalizable Policy

Neural Information Processing SystemsJun-22-2026, 16:13:36 GMT

Garment manipulation is a critical challenge due to the diversity in garment categories, geometries, and deformations. Despite this, humans can effortlessly handle garments, thanks to the dexterity of our hands. However, existing research in the field has struggled to replicate this level of dexterity, primarily hindered by the lack of realistic simulations of dexterous garment manipulation. Therefore, we propose DexGarmentLab, the first environment specifically designed for dexterous (especially bimanual) garment manipulation, which features large-scale high-quality 3D assets for 15 task scenarios, and refines simulation techniques tailored for garment modeling to reduce the sim-to-real gap. Previous data collection typically relies on teleoperation or training expert reinforcement learning (RL) policies, which are labor-intensive and inefficient. In this paper, we leverage garment structural correspondence to automatically generate a dataset with diverse trajectories using only a single expert demonstration, significantly reducing manual intervention.

artificial intelligence, garment, machine learning, (18 more...)

Neural Information Processing Systems

Genre: Research Report > Experimental Study (1.00)

Industry: Leisure & Entertainment > Games > Computer Games (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)
Information Technology > Artificial Intelligence > Robots > Manipulation (0.46)

Add feedback

9ecafb09de180aaad7b7205be7eb24a4-Paper-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsJun-20-2026, 18:40:00 GMT

Vision-Language Models (VLMs) are increasingly pivotal for generalist robot manipulation, enabling tasks such as physical reasoning, policy generation, and failure detection. However, their proficiency in these high-level applications often assumes a deep understanding of low-level physical prerequisites, a capability that is largely unverified. To perform actions reliably, robots must comprehend intrinsic object properties (e.g., material, weight), action affordances (e.g., graspable, stackable), and physical constraints (e.g., stability, reachability, or an object's state like being closed). Despite their ubiquitous use in manipulation, we argue that off-the-shelf VLMs may lack this granular, physically-grounded understanding, as these specific prerequisites are often overlooked during training. Addressing this critical gap, we introduce PACBench, a comprehensive benchmark designed to systematically evaluate VLMs on their understanding of these core Properties, Affordances, and Constraints (PAC) from a task executability perspective. PAC Bench features a diverse dataset with more than 30,000 annotations, comprising 673 real-world images (115 object classes, 15 property types, 1-3 affordances defined per object class), 100 real-world humanoid view scenarios, and 120 unique simulated constraint scenarios across four tasks. Our evaluations reveal significant gaps in the ability of VLMs to grasp fundamental physical concepts, underscoring their current limitations for reliable robot manipulation and pointing to key areas that require targeted research. PACBench also serves as a standardized benchmark for rigorously evaluating the physical reasoning capabilities of VLMs guiding the development of more robust and physically grounded models for robot manipulation.

large language model, machine learning, natural language, (22 more...)

Neural Information Processing Systems

Genre: Research Report > Experimental Study (1.00)

Industry: Appliances & Durable Goods (0.67)

Technology: