Ryll, Markus
PHODCOS: Pythagorean Hodograph-based Differentiable Coordinate System
Arrizabalaga, Jon, Vega, Fausto, ŠÍR, Zbyněk, Manchester, Zachary, Ryll, Markus
This paper presents PHODCOS, an algorithm that assigns a moving coordinate system to a given curve. The parametric functions underlying the coordinate system, i.e., the path function, the moving frame and its angular velocity, are exact -- approximation free -- differentiable, and sufficiently continuous. This allows for computing a coordinate system for highly nonlinear curves, while remaining compliant with autonomous navigation algorithms that require first and second order gradient information. In addition, the coordinate system obtained by PHODCOS is fully defined by a finite number of coefficients, which may then be used to compute additional geometric properties of the curve, such as arc-length, curvature, torsion, etc. Therefore, PHODCOS presents an appealing paradigm to enhance the geometrical awareness of existing guidance and navigation on-orbit spacecraft maneuvers. The PHODCOS algorithm is presented alongside an analysis of its error and approximation order, and thus, it is guaranteed that the obtained coordinate system matches the given curve within a desired tolerance. To demonstrate the applicability of the coordinate system resulting from PHODCOS, we present numerical examples in the Near Rectilinear Halo Orbit (NRHO) for the Lunar Gateway.
A Universal Formulation for Path-Parametric Planning and Control
Arrizabalaga, Jon, Ryll, Markus
Path-parametric methods have gained popularity in the formulation of navigation algorithms, such as high-level planners [1, 2, 3], reinforcement learning (RL) policies [4, 5, 6] or low-level model predictive controllers (MPC) [7, 8, 9]. The fundamental concept behind these parametric methods is to either introduce the path parameter as an additional degree of freedom, enabling the system to regulate its progress along the path [10, 11], or to conduct a change of coordinates that project the Euclidean states to the spatial states, i.e., the progress along the path and the orthogonal distance to it [12, 13, 14]. These parametric formulations have proven successful for three primary reasons: firstly, they inherently capture the notion of advancement along the path, secondly they allow for embedding the path's geometric features, such as the curvature and the torsion, into the system dynamics, and thirdly, spatial bounds manifest as convex constraints in the orthogonal terms of the spatial states. Given the broad range of problems encompassing path-parametric approaches, existing methods remain detached from each other and are frequently presented as independent work. This has resulted in a disjointed body of literature, where these techniques are viewed as distinct methods. Consequently, the reader is left with a fragmented view of the path-parametric problem, making it difficult to understand the interplay between the different techniques. To close this gap, in this paper we show how all these approaches are interconnected by presenting a universal formulation for path-parametric planning and control. To this end, the path-parametric problem is analyzed from three different yet interconnected perspectives (i-iii): First, in Section 2, we study the (i) interplay of existing parametric techniques and show how they can be unified under a single framework consisting of two ingredients: (ii) a path-parameterization technique and (iii) a spatial representation of the system dynamics. These are discussed in-depth across the subsequent Sections 3 and 4, respectively.
Differentiable Collision-Free Parametric Corridors
Arrizabalaga, Jon, Manchester, Zachary, Ryll, Markus
Abstract-- This paper presents a method to compute differentiable collision-free parametric corridors. In contrast to existing solutions that decompose the obstacle-free space into multiple convex sets, the continuous corridors computed by our method are smooth and differentiable, making them compatible with existing numerical techniques for learning and optimization. To achieve this, we represent the collision-free corridors as a path-parametric off-centered ellipse with a polynomial basis. We show that the problem of maximizing the volume of such corridors is convex, and can be efficiently solved. To assess the effectiveness of the proposed method, we examine its performance in a synthetic case study and subsequently evaluate its applicability in a real-world scenario from the KITTI dataset.
Dual Quaternion Control of UAVs with Cable-suspended Load
Yuan, Yuxia, Ryll, Markus
Modeling the kinematics and dynamics of robotics systems with suspended loads using dual quaternions has not been explored so far. This paper introduces a new innovative control strategy using dual quaternions for UAVs with cable-suspended loads, focusing on the sling load lifting and tracking problems. By utilizing the mathematical efficiency and compactness of dual quaternions, a unified representation of the UAV and its suspended load's dynamics and kinematics is achieved, facilitating the realization of load lifting and trajectory tracking. The simulation results have tested the proposed strategy's accuracy, efficiency, and robustness. This study makes a substantial contribution to present this novel control strategy that harnesses the benefits of dual quaternions for cargo UAVs. Our work also holds promise for inspiring future innovations in under-actuated systems control using dual quaternions.
Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship Detection
Salzmann, Tim, Ryll, Markus, Bewley, Alex, Minderer, Matthias
Visual relationship detection aims to identify objects and their relationships in images. Prior methods approach this task by adding separate relationship modules or decoders to existing object detection architectures. This separation increases complexity and hinders end-to-end training, which limits performance. We propose a simple and highly efficient decoder-free architecture for open-vocabulary visual relationship detection. Our model consists of a Transformer-based image encoder that represents objects as tokens and models their relationships implicitly. To extract relationship information, we introduce an attention mechanism that selects object pairs likely to form a relationship. We provide a single-stage recipe to train this model on a mixture of object and relationship detection data. Our approach achieves state-of-the-art relationship detection performance on Visual Genome and on the large-vocabulary GQA benchmark at real-time inference speeds. We provide analyses of zero-shot performance, ablations, and real-world qualitative examples.
Geometric Slosh-Free Tracking for Robotic Manipulators
Arrizabalaga, Jon, Pries, Lukas, Laha, Riddhiman, Li, Runkang, Haddadin, Sami, Ryll, Markus
This work focuses on the agile transportation of liquids with robotic manipulators. In contrast to existing methods that are either computationally heavy, system/container specific or dependant on a singularity-prone pendulum model, we present a real-time slosh-free tracking technique. This method solely requires the reference trajectory and the robot's kinematic constraints to output kinematically feasible joint space commands. The crucial element underlying this approach consists on mimicking the end-effector's motion through a virtual quadrotor, which is inherently slosh-free and differentially flat, thereby allowing us to calculate a slosh-free reference orientation. Through the utilization of a cascaded proportional-derivative (PD) controller, this slosh-free reference is transformed into task space acceleration commands, which, following the resolution of a Quadratic Program (QP) based on Resolved Acceleration Control (RAC), are translated into a feasible joint configuration. The validity of the proposed approach is demonstrated by simulated and real-world experiments on a 7 DoF Franka Emika Panda robot. Code: https://github.com/jonarriza96/gsft Video: https://youtu.be/4kitqYVS9n8
Pose-Following with Dual Quaternions
Arrizabalaga, Jon, Ryll, Markus
This work focuses on pose-following, a variant of path-following in which the goal is to steer the system's position and attitude along a path with a moving frame attached to it. Full body motion control, while accounting for the additional freedom to self-regulate the progress along the path, is an appealing trade-off. Towards this end, we extend the well-established dual quaternion-based pose-tracking method into a pose-following control law. Specifically, we derive the equations of motion for the full pose error between the geometric reference and the rigid body in the form of a dual quaternion and dual twist. Subsequently, we formulate an almost globally asymptotically stable control law. The global attractivity of the presented approach is validated in a spatial example, while its benefits over pose-tracking are showcased through a planar case-study.
Learning for CasADi: Data-driven Models in Numerical Optimization
Salzmann, Tim, Arrizabalaga, Jon, Andersson, Joel, Pavone, Marco, Ryll, Markus
While real-world problems are often challenging to analyze analytically, deep learning excels in modeling complex processes from data. Existing optimization frameworks like CasADi facilitate seamless usage of solvers but face challenges when integrating learned process models into numerical optimizations. To address this gap, we present the Learning for CasADi (L4CasADi) framework, enabling the seamless integration of PyTorch-learned models with CasADi for efficient and potentially hardware-accelerated numerical optimization. The applicability of L4CasADi is demonstrated with two tutorial examples: First, we optimize a fish's trajectory in a turbulent river for energy efficiency where the turbulent flow is represented by a PyTorch model. Second, we demonstrate how an implicit Neural Radiance Field environment representation can be easily leveraged for optimal control with L4CasADi.
Robots That Can See: Leveraging Human Pose for Trajectory Prediction
Salzmann, Tim, Chiang, Lewis, Ryll, Markus, Sadigh, Dorsa, Parada, Carolina, Bewley, Alex
Anticipating the motion of all humans in dynamic environments such as homes and offices is critical to enable safe and effective robot navigation. Such spaces remain challenging as humans do not follow strict rules of motion and there are often multiple occluded entry points such as corners and doors that create opportunities for sudden encounters. In this work, we present a Transformer based architecture to predict human future trajectories in human-centric environments from input features including human positions, head orientations, and 3D skeletal keypoints from onboard in-the-wild sensory information. The resulting model captures the inherent uncertainty for future human trajectory prediction and achieves state-of-the-art performance on common prediction benchmarks and a human tracking dataset captured from a mobile robot adapted for the prediction task. Furthermore, we identify new agents with limited historical data as a major contributor to error and demonstrate the complementary nature of 3D skeletal poses in reducing prediction error in such challenging scenarios.
Real-time Neural-MPC: Deep Learning Model Predictive Control for Quadrotors and Agile Robotic Platforms
Salzmann, Tim, Kaufmann, Elia, Arrizabalaga, Jon, Pavone, Marco, Scaramuzza, Davide, Ryll, Markus
Model Predictive Control (MPC) has become a popular framework in embedded control for high-performance autonomous systems. However, to achieve good control performance using MPC, an accurate dynamics model is key. To maintain real-time operation, the dynamics models used on embedded systems have been limited to simple first-principle models, which substantially limits their representative power. In contrast to such simple models, machine learning approaches, specifically neural networks, have been shown to accurately model even complex dynamic effects, but their large computational complexity hindered combination with fast real-time iteration loops. With this work, we present Real-time Neural MPC, a framework to efficiently integrate large, complex neural network architectures as dynamics models within a model-predictive control pipeline. Our experiments, performed in simulation and the real world onboard a highly agile quadrotor platform, demonstrate the capabilities of the described system to run learned models with, previously infeasible, large modeling capacity using gradient-based online optimization MPC. Compared to prior implementations of neural networks in online optimization MPC we can leverage models of over 4000 times larger parametric capacity in a 50Hz real-time window on an embedded platform. Further, we show the feasibility of our framework on real-world problems by reducing the positional tracking error by up to 82% when compared to state-of-the-art MPC approaches without neural network dynamics.