Goto

Collaborating Authors

 Janabi-Sharifi, Farrokh


Deep Visual Servoing of an Aerial Robot Using Keypoint Feature Extraction

arXiv.org Artificial Intelligence

The problem of image-based visual servoing (IBVS) of an aerial robot using deep-learning-based keypoint detection is addressed in this article. A monocular RGB camera mounted on the platform is utilized to collect the visual data. A convolutional neural network (CNN) is then employed to extract the features serving as the visual data for the servoing task. This paper contributes to the field by circumventing not only the challenge stemming from the need for man-made marker detection in conventional visual servoing techniques, but also enhancing the robustness against undesirable factors including occlusion, varying illumination, clutter, and background changes, thereby broadening the applicability of perception-guided motion control tasks in aerial robots. Additionally, extensive physics-based ROS Gazebo simulations are conducted to assess the effectiveness of this method, in contrast to many existing studies that rely solely on physics-less simulations. A demonstration video is available at https://youtu.be/Dd2Her8Ly-E.


Hybrid Visual Servoing of Tendon-driven Continuum Robots

arXiv.org Artificial Intelligence

HVS outperforms DLBVS in iteration time, error reduction, and con - trol smoothness. Experimental validation confirms HVS effectiveness under occlusion s and noise. Abstract This paper introduces a novel Hybrid Visual Servoing (HVS) approa ch for controlling tendon-driven continuum robots (TDCRs). The HVS sys tem combines Image-Based Visual Servoing (IBVS) with Deep Learning-Based Visual Servoing (DLBVS) to overcome the limitations of each method and improve overall performance. IBVS offers higher accuracy and fa ster convergence in feature-rich environments, while DLBVS enhances rob ustness against disturbances and offers a larger workspace. By enabling sm ooth transitions between IBVS and DLBVS, the proposed HVS ensures e ffective control in dynamic, unstructured environments. The effectivene ss of this approach is validated through simulations and real-world experiments, demonstrating that HVS achieves reduced iteration time, faster conver gence, lower final error, and smoother performance compared to DLBVS alone, while maintaining DLBVS's robustness in challenging conditions such as occlu - sions, lighting changes, actuator noise, and physical impacts.


Backstepping Control of Tendon-Driven Continuum Robots in Large Deflections Using the Cosserat Rod Model

arXiv.org Artificial Intelligence

This paper presents a study on the backstepping control of tendon-driven continuum robots for large deflections using the Cosserat rod model. Continuum robots are known for their flexibility and adaptability, making them suitable for various applications. However, modeling and controlling them pose challenges due to their nonlinear dynamics. To model their dynamics, the Cosserat rod method is employed to account for significant deflections, and a numerical solution method is developed to solve the resulting partial differential equations. Previous studies on controlling tendon-driven continuum robots using Cosserat rod theory focused on sliding mode control and were not tested for large deflections, lacking experimental validation. In this paper, backstepping control is proposed as an alternative to sliding mode control for achieving a significant bending. The numerical results are validated through experiments in this study, demonstrating that the proposed backstepping control approach is a promising solution for achieving large deflections with smoother trajectories, reduced settling time, and lower overshoot. Furthermore, two scenarios involving external forces and disturbances were introduced to further highlight the robustness of the backstepping control approach.


Dual Agent Learning Based Aerial Trajectory Tracking

arXiv.org Artificial Intelligence

This paper presents a novel reinforcement learning framework for trajectory tracking of unmanned aerial vehicles in cluttered environments using a dual-agent architecture. Traditional optimization methods for trajectory tracking face significant computational challenges and lack robustness in dynamic environments. Our approach employs deep reinforcement learning (RL) to overcome these limitations, leveraging 3D pointcloud data to perceive the environment without relying on memory-intensive obstacle representations like occupancy grids. The proposed system features two RL agents: one for predicting UAV velocities to follow a reference trajectory and another for managing collision avoidance in the presence of obstacles. This architecture ensures real-time performance and adaptability to uncertainties. We demonstrate the efficacy of our approach through simulated and real-world experiments, highlighting improvements over state-of-the-art RL and optimization-based methods. Additionally, a curriculum learning paradigm is employed to scale the algorithms to more complex environments, ensuring robust trajectory tracking and obstacle avoidance in both static and dynamic scenarios.


Keypoint Detection Technique for Image-Based Visual Servoing of Manipulators

arXiv.org Artificial Intelligence

This paper introduces an innovative keypoint detection technique based on Convolutional Neural Networks (CNNs) to enhance the performance of existing Deep Visual Servoing (DVS) models. To validate the convergence of the Image-Based Visual Servoing (IBVS) algorithm, real-world experiments utilizing fiducial markers for feature detection are conducted before designing the CNN-based feature detector. To address the limitations of fiducial markers, the novel feature detector focuses on extracting keypoints that represent the corners of a more realistic object compared to fiducial markers. A dataset is generated from sample data captured by the camera mounted on the robot end-effector while the robot operates randomly in the task space. The samples are automatically labeled, and the dataset size is increased by flipping and rotation. The CNN model is developed by modifying the VGG-19 pre-trained on the ImageNet dataset. While the weights in the base model remain fixed, the fully connected layer's weights are updated to minimize the mean absolute error, defined based on the deviation of predictions from the real pixel coordinates of the corners. The model undergoes two modifications: replacing max-pooling with average-pooling in the base model and implementing an adaptive learning rate that decreases during epochs. These changes lead to a 50 percent reduction in validation loss. Finally, the trained model's reliability is assessed through k-fold cross-validation.


Image-to-Joint Inverse Kinematic of a Supportive Continuum Arm Using Deep Learning

arXiv.org Artificial Intelligence

In this work, a deep learning-based technique is used to study the image-to-joint inverse kinematics of a tendon-driven supportive continuum arm. An eye-off-hand configuration is considered by mounting a camera at a fixed pose with respect to the inertial frame attached at the arm base. This camera captures an image for each distinct joint variable at each sampling time to construct the training dataset. This dataset is then employed to adapt a feed-forward deep convolutional neural network, namely the modified VGG-16 model, to estimate the joint variable. One thousand images are recorded to train the deep network, and transfer learning and fine-tuning techniques are applied to the modified VGG-16 to further improve the training. Finally, training is also completed with a larger dataset of images that are affected by various types of noises, changes in illumination, and partial occlusion. The main contribution of this research is the development of an image-to-joint network that can estimate the joint variable given an image of the arm, even if the image is not captured in an ideal condition. The key benefits of this research are twofold: 1) image-to-joint mapping can offer a real-time alternative to computationally complex inverse kinematic mapping through analytical models; and 2) the proposed technique can provide robustness against noise, occlusion, and changes in illumination. The dataset is publicly available on Kaggle.