screwdriver
Learning Dexterous Manipulation Skills from Imperfect Simulations
Hsieh, Elvis, Hsieh, Wen-Han, Wang, Yen-Jen, Lin, Toru, Malik, Jitendra, Sreenath, Koushil, Qi, Haozhi
Figure 1: We propose DexScrew, a sim-to-real framework for learning dexterous manipulation skills when the environment cannot be accurately simulated. In simulation, we use simplified objects to learn transferable rotational skills, which are then used to collect data and train tactile policies in the real world. We demonstrate the framework on contact-rich screwdriving (top row) and nut-bolt fastening (middle row). We also show generalization across different objects (bottom row). More videos and code are available on https://dexscrew.github.io. Abstract-- Reinforcement learning and sim-to-real transfer have made significant progress in dexterous manipulation. However, progress remains limited by the difficulty of simulating complex contact dynamics and multisensory signals, especially tactile feedback. In this work, we propose DexScrew, a sim-to-real framework that addresses these limitations and demonstrates its effectiveness on nut-bolt fastening and screwdriving with multi-fingered hands. The framework has three stages. First, we train reinforcement learning policies in simulation using simplified object models that lead to the emergence of correct finger gaits. We then use the learned policy as a skill primitive within a teleoperation system to collect real-world demonstrations that contain tactile and proprioceptive information. Finally, we train a behavior cloning policy that incorporates tactile sensing and show that it generalizes to nuts and screwdrivers with diverse geometries. Experiments across both tasks show high task progress ratios compared to direct sim-to-real transfer and robust performance even on unseen object shapes and under external perturbations.
Leveraging Neural Descriptor Fields for Learning Contact-Aware Dynamic Recovery
Yang, Fan, Huang, Zixuan, Kumar, Abhinav, Marinovic, Sergio Aguilera, Iba, Soshi, Zarrin, Rana Soltani, Berenson, Dmitry
Real-world dexterous manipulation often encounters unexpected errors and disturbances, which can lead to catastrophic failures, such as dropping the manipulated object. To address this challenge, we focus on the problem of catching a falling object while it remains within grasping range and, importantly, resetting the system to a configuration favorable for resuming the primary manipulation task. We propose Contact-Aware Dynamic Recovery (CADRE), a reinforcement learning framework that incorporates a Neural Descriptor Field (NDF)-inspired module to extract implicit contact features. Compared to methods that rely solely on object pose or point cloud input, NDFs can directly reason about finger-object correspondence and adapt to different object geometries. Our experiments show that incorporating contact features improves training efficiency, enhances convergence performance for RL training, and ultimately leads to more successful recoveries. Additionally, we demonstrate that CADRE can generalize zero-shot to unseen objects with different geometries.
- North America > United States > Montana (0.04)
- North America > United States > Michigan > Washtenaw County > Ann Arbor (0.04)
PLEXUS Hand: Lightweight Four-Motor Prosthetic Hand Enabling Precision-Lateral Dexterous Manipulation
Kuroda, Yuki, Takahashi, Tomoya, Beltran-Hernandez, Cristian C, Hamaya, Masashi, Tanaka, Kazutoshi
Electric prosthetic hands should be lightweight to decrease the burden on the user, shaped like human hands for cosmetic purposes, and have motors inside to protect them from damage and dirt. In addition to the ability to perform daily activities, these features are essential for everyday use of the hand. In-hand manipulation is necessary to perform daily activities such as transitioning between different postures, particularly through rotational movements, such as reorienting cards before slot insertion and operating tools such as screwdrivers. However, currently used electric prosthetic hands only achieve static grasp postures, and existing manipulation approaches require either many motors, which makes the prosthesis heavy for daily use in the hand, or complex mechanisms that demand a large internal space and force external motor placement, complicating attachment and exposing the components to damage. Alternatively, we combine a single-axis thumb and optimized thumb positioning to achieve basic posture and in-hand manipulation, that is, the reorientation between precision and lateral grasps, using only four motors in a lightweight (311 g) prosthetic hand. Experimental validation using primitive objects of various widths (5-30 mm) and shapes (cylinders and prisms) resulted in success rates of 90-100% for reorientation tasks. The hand performed seal stamping and USB device insertion, as well as rotation to operate a screwdriver.
ARRC: Advanced Reasoning Robot Control - Knowledge-Driven Autonomous Manipulation Using Retrieval-Augmented Generation
Vorobiov, Eugene, Mahmood, Ammar Jaleel, Rezvani, Salim, Chhabra, Robin
Each action is represented as a tu-ple of an action label and bounded parameters (e.g., APPROACH OBJECT {label: "bottle", hover mm: 30, timeout sec: 8}). B. Hierarchical Scanning Algorithm The manipulator system employs a two-phase scanning routine for robust object detection. In the first phase, a horizontal scan is performed across the workspace at a fixed height, where the perception module inspects each sampled position for potential targets. If no objects are detected, the system transitions to a fallback arc scan, which uses three predetermined joint-space configurations (LEFT, CENTER, and RIGHT). These configurations are hardcoded to ensure safe, repeatable coverage of the workspace and avoid kinematic singularities, providing a deterministic alternative to Cartesian waypoint-based exploration. C. Coordinate Transformation Pipeline Object coordinates obtained from AprilTags are first expressed in the eye-in-hand camera frame and then transformed into the robot base frame using calibrated transformations. These base-frame coordinates are subsequently used for inverse kinematics computations during manipulation. The transformed coordinates are stored in memory and retrieved on demand, depending on the specific objects referenced in the execution plan generated by the LLM.
A novel parameter estimation method for pneumatic soft hand control applying logarithmic decrement for pseudo rigid body modeling
Zhang, Haiyun, Heung, Kelvin HoLam, Naquila, Gabrielle J., Hingwe, Ashwin, Deshpande, Ashish D.
The rapid advancement in physical human-robot interaction (HRI) has accelerated the development of soft robot designs and controllers. Controlling soft robots, especially soft hand grasping, is challenging due to their continuous deformation, motivating the use of reduced model-based controllers for real-time dynamic performance. Most existing models, however, suffer from computational inefficiency and complex parameter identification, limiting their real-time applicability. To address this, we propose a paradigm coupling Pseudo-Rigid Body Modeling with the Logarithmic Decrement Method for parameter estimation (PRBM plus LDM). Using a soft robotic hand test bed, we validate PRBM plus LDM for predicting position and force output from pressure input and benchmark its performance. We then implement PRBM plus LDM as the basis for closed-loop position and force controllers. Compared to a simple PID controller, the PRBM plus LDM position controller achieves lower error (average maximum error across all fingers: 4.37 degrees versus 20.38 degrees). For force control, PRBM plus LDM outperforms constant pressure grasping in pinching tasks on delicate objects: potato chip 86 versus 82.5, screwdriver 74.42 versus 70, brass coin 64.75 versus 35. These results demonstrate PRBM plus LDM as a computationally efficient and accurate modeling technique for soft actuators, enabling stable and flexible grasping with precise force regulation.
- North America > United States > Texas > Travis County > Austin (0.14)
- Asia > China > Hong Kong (0.04)
Geodesic Tracing-Based Kinematic Integration of Rolling and Sliding Contact on Manifold Meshes for Dexterous In-Hand Manipulation
Wang, Sunyu, Lakshmipathy, Arjun S., Oh, Jean, Pollard, Nancy S.
Figure 1: Snapshots of a robotic hand performing dexterous in-hand manipulation tasks using the integration scheme we developed. Tasks from left to right: 1) horizontally turning a screwdriver, 2) vertically turning the screwdriver, 3) turning an M2 threaded rod, 4) pressing down a knurled hobby knife, 5) pressing down a dining knife, 6) closing tweezers. Abstract -- Reasoning about rolling and sliding contact, or roll-slide contact for short, is critical for dexterous manipulation tasks that involve intricate geometries. But existing works on roll-slide contact mostly focus on continuous shapes with differentiable parametrizations. This work extends roll-slide contact modeling to manifold meshes. Specifically, we present an integration scheme based on geodesic tracing to first-order time-integrate roll-slide contact directly on meshes, enabling dexterous manipulation to reason over high-fidelity discrete representations of an object's true geometry. Using our method, we planned dexterous motions of a multi-finger robotic hand manipulating five objects in-hand in simulation. The planning was achieved with a least-squares optimizer that strives to maintain the most stable instantaneous grasp by minimizing contact sliding and spinning. Then, we evaluated our method against a baseline using collision detection and a baseline using primitive shapes.
- North America > United States > Montana (0.05)
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- North America > United States > California (0.04)
- (2 more...)
Modular Vacuum-Based Fixturing System for Adaptive Disassembly Workspace Integration
Pan, Haohui, Kiyokawa, Takuya, Ishikura, Tomoki, Hamada, Shingo, Matsuda, Genichiro, Harada, Kensuke
-- The disassembly of small household appliances poses significant challenges due to their complex and curved geometries, which render traditional rigid fixtures inadequate. In this paper, we propose a modular vacuum-based fixturing system that leverages commercially available balloon-type soft grippers to conform to arbitrarily shaped surfaces and provide stable support during screw-removal tasks. T o enable a reliable deployment of the system, we develop a stability-aware planning framework that samples the bottom surface of the target object, filters candidate contact points based on geometric continuity, and evaluates support configurations using convex hull-based static stability criteria. We compare the quality of object placement under different numbers and configurations of balloon hands. In addition, real-world experiments were conducted to compare the success rates of traditional rigid fixtures with our proposed system. The results demonstrate that our method consistently achieves higher success rates and superior placement stability during screw removal tasks. As demand for sustainable manufacturing and the circular economy grows, robotic disassembly is increasingly recognized as a key enabler for end-of-life product recovery.
RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping
Wu, Dongming, Fu, Yanping, Huang, Saike, Liu, Yingfei, Jia, Fan, Liu, Nian, Dai, Feng, Wang, Tiancai, Anwer, Rao Muhammad, Khan, Fahad Shahbaz, Shen, Jianbing
General robotic grasping systems require accurate object affordance perception in diverse open-world scenarios following human instructions. However, current studies suffer from the problem of lacking reasoning-based large-scale affordance prediction data, leading to considerable concern about open-world effectiveness. To address this limitation, we build a large-scale grasping-oriented affordance segmentation benchmark with human-like instructions, named RAGNet. It contains 273k images, 180 categories, and 26k reasoning instructions. The images cover diverse embodied data domains, such as wild, robot, ego-centric, and even simulation data. They are carefully annotated with an affordance map, while the difficulty of language instructions is largely increased by removing their category name and only providing functional descriptions. Furthermore, we propose a comprehensive affordance-based grasping framework, named AffordanceNet, which consists of a VLM pre-trained on our massive affordance data and a grasping network that conditions an affordance map to grasp the target. Extensive experiments on affordance segmentation benchmarks and real-robot manipulation tasks show that our model has a powerful open-world generalization ability. Our data and code is available at https://github.com/wudongming97/AffordanceNet.