Shih, Chia-Hsien
Rethinking VLMs and LLMs for Image Classification
Cooper, Avi, Kato, Keizo, Shih, Chia-Hsien, Yamane, Hiroaki, Vinken, Kasper, Takemoto, Kentaro, Sunagawa, Taro, Yeh, Hao-Wei, Yamanaka, Jin, Mason, Ian, Boix, Xavier
Visual Language Models (VLMs) are now increasingly being merged with Large Language Models (LLMs) to enable new capabilities, particularly in terms of improved interactivity and open-ended responsiveness. While these are remarkable capabilities, the contribution of LLMs to enhancing the longstanding key problem of classifying an image among a set of choices remains unclear. Through extensive experiments involving seven models, ten visual understanding datasets, and multiple prompt variations per dataset, we find that, for object and scene recognition, VLMs that do not leverage LLMs can achieve better performance than VLMs that do. Yet at the same time, leveraging LLMs can improve performance on tasks requiring reasoning and outside knowledge. In response to these challenges, we propose a pragmatic solution: a lightweight fix involving a relatively small LLM that efficiently routes visual tasks to the most suitable model for the task. The LLM router undergoes training using a dataset constructed from more than 2.5 million examples of pairs of visual task and model accuracy. Our results reveal that this lightweight fix surpasses or matches the accuracy of state-of-the-art alternatives, including GPT-4V and HuggingGPT, while improving cost-effectiveness.
Hierarchical control and learning of a foraging CyberOctopus
Shih, Chia-Hsien, Naughton, Noel, Halder, Udit, Chang, Heng-Sheng, Kim, Seung Hyun, Gillette, Rhanor, Mehta, Prashant G., Gazzola, Mattia
Inspired by the unique neurophysiology of the octopus, we propose a hierarchical framework that simplifies the coordination of multiple soft arms by decomposing control into high-level decision making, low-level motor activation, and local reflexive behaviors via sensory feedback. When evaluated in the illustrative problem of a model octopus foraging for food, this hierarchical decomposition results in significant improvements relative to end-to-end methods. Performance is achieved through a mixed-modes approach, whereby qualitatively different tasks are addressed via complementary control schemes. Here, model-free reinforcement learning is employed for high-level decision-making, while model-based energy shaping takes care of arm-level motor execution. To render the pairing computationally tenable, a novel neural-network energy shaping (NN-ES) controller is developed, achieving accurate motions with time-to-solutions 200 times faster than previous attempts. Our hierarchical framework is then successfully deployed in increasingly challenging foraging scenarios, including an arena littered with obstacles in 3D space, demonstrating the viability of our approach.
Energy Shaping Control of a Muscular Octopus Arm Moving in Three Dimensions
Chang, Heng-Sheng, Halder, Udit, Shih, Chia-Hsien, Naughton, Noel, Gazzola, Mattia, Mehta, Prashant G.
Interest in soft robots, specifically soft continuum arms (SCA), comes from their potential ability to perform complex tasks in unstructured environments as well as to operate safely around humans, with applications ranging from agriculture [1-3] to surgery [4-6]. An important bio-inspiration for SCAs is provided by octopus arms [7-10]. An octopus arm is hyper-flexible with nearly infinite degrees of freedom, seamlessly coordinated to generate a rich orchestra of motions such as reaching, grasping, fetching, crawling, or swimming [11,12]. How such a marvelous coordination is possible remains a source of mystery and amazement, and of inspiration to soft roboticists. Part of the challenge comes from the intricate organization and biomechanics of the three major muscle groups--transverse, longitudinal, and oblique--which add to the overall complexity of the problem [13-16]. In this paper, we develop a bio-physical model of octopus arm equipped with virtual musculature, using the formalism of the Cosserat rod theory [17,18]. In this type of modeling, a key concept is the stored energy function of nonlinear elasticity theory whereby the internal forces and couples of a hyperelastic rod are obtained as the gradients of the stored energy function. The goal of this work is to extend the energy concept for following inter-related tasks: (i) Bio-physical modeling of the internal muscles, and (ii) Model-based control design. The specific contributions on the two tasks are as follows.
Energy Shaping Control of a CyberOctopus Soft Arm
Chang, Heng-Sheng, Halder, Udit, Shih, Chia-Hsien, Tekinalp, Arman, Parthasarathy, Tejaswin, Gribkova, Ekaterina, Chowdhary, Girish, Gillette, Rhanor, Gazzola, Mattia, Mehta, Prashant G.
This paper entails application of the energy shaping methodology to control a flexible, elastic Cosserat rod model. Recent interest in such continuum models stems from applications in soft robotics, and from the growing recognition of the role of mechanics and embodiment in biological control strategies: octopuses are often regarded as iconic examples of this interplay. Here, the dynamics of the Cosserat rod, modeling a single octopus arm, are treated as a Hamiltonian system and the internal muscle actuators are modeled as distributed forces and couples. The proposed energy shaping control design procedure involves two steps: (1) a potential energy is designed such that its minimizer is the desired equilibrium configuration; (2) an energy shaping control law is implemented to reach the desired equilibrium. By interpreting the controlled Hamiltonian as a Lyapunov function, asymptotic stability of the equilibrium configuration is deduced. The energy shaping control law is shown to require only the deformations of the equilibrium configuration. A forward-backward algorithm is proposed to compute these deformations in an online iterative manner. The overall control design methodology is implemented and demonstrated in a dynamic simulation environment. Results of several bio-inspired numerical experiments involving the control of octopus arms are reported.