A comparative study on wearables and single-camera video for upper-limb out-of-thelab activity recognition with different deep learning architectures

Martínez-Zarzuela, Mario, González-Ortega, David, Antón-Rodríguez, Míriam, Díaz-Pernas, Francisco Javier, Müller, Henning, Simón-Martínez, Cristina

arXiv.org Artificial Intelligence 

Introduction: The use of a wide range of computer vision solutions, and more recently high-end Inertial Measurement Units (IMU) have become increasingly popular for assessing human physical activity in clinical and research settings [1]. Nevertheless, to increase the feasibility of patient tracking in out-of-the-lab settings, it is necessary to use a reduced number of devices for movement acquisition. Promising solutions in this context are IMU-based wearables and single camera systems [2]. Additionally, the development of machine learning systems able to recognize and digest clinically relevant data in-the-wild is needed, and therefore determining the ideal input to those is crucial [3]. Research question: For upper-limb activity recognition out-of-the-lab, do wearables or single camera offer better performance?