Goto

Collaborating Authors

 elevator


Robots in China are riding the subway to make 7-Eleven deliveries

Popular Science

Breakthroughs, discoveries, and DIY tips sent every weekday. Subway commuters in Shenzhen, China, may soon need to make room for a fleet of chunky, snack-carrying delivery robots. Earlier this week, more than three dozen autonomous, four-wheeled delivery robots boarded and exited active subway trains, and eventually delivered packages to several 7-Eleven convenience stores. Although this demonstration was only a preliminary test and took place during off-peak hours, the company behind the subway-riding robots believes they could soon help stock shelves at around 100 7-Eleven locations. The initiative is part of a broader effort in China and other countries to normalize the presence of delivery robots operating in public spaces.


Novel RL approach for efficient Elevator Group Control Systems

Vaartjes, Nathan, Francois-Lavet, Vincent

arXiv.org Artificial Intelligence

Efficient elevator traffic management in large buildings is critical for minimizing passenger travel times and energy consumption. Because heuristic- or pattern-detection-based controllers struggle with the stochastic and combinatorial nature of dispatching, we model the six-elevator, fifteen-floor system at Vrije Universiteit Amsterdam as a Markov Decision Process and train an end-to-end Reinforcement Learning (RL) Elevator Group Control System (EGCS). Key innovations include a novel action space encoding to handle the combinatorial complexity of elevator dispatching, the introduction of infra-steps to model continuous passenger arrivals, and a tailored reward signal to improve learning efficiency. In addition, we explore various ways to adapt the discounting factor to the infra-step formulation. We investigate RL architectures based on Dueling Double Deep Q-learning, showing that the proposed RL-based EGCS adapts to fluctuating traffic patterns, learns from a highly stochastic environment, and thereby outperforms a traditional rule-based algorithm.


Range-based 6-DoF Monte Carlo SLAM with Gradient-guided Particle Filter on GPU

Nakao, Takumi, Koide, Kenji, Takanose, Aoki, Oishi, Shuji, Yokozuka, Masashi, Date, Hisashi

arXiv.org Artificial Intelligence

-- This paper presents range-based 6-DoF Monte Carlo SLAM with a gradient-guided particle update strategy. While non-parametric state estimation methods, such as particle filters, are robust in situations with high ambiguity, they are known to be unsuitable for high-dimensional problems due to the curse of dimensionality. T o address this issue, we propose a particle update strategy that improves the sampling efficiency by using the gradient information of the likelihood function to guide particles toward its mode. Additionally, we introduce a keyframe-based map representation that represents the global map as a set of past frames (i.e., keyframes) to mitigate memory consumption. The keyframe poses for each particle are corrected using a simple loop closure method to maintain trajectory consistency. The combination of gradient information and keyframe-based map representation significantly enhances sampling efficiency and reduces memory usage compared to traditional RBPF approaches. T o process a large number of particles (e.g., 100,000 particles) in real-time, the proposed framework is designed to fully exploit GPU parallel processing. Experimental results demonstrate that the proposed method exhibits extreme robustness to state ambiguity and can even deal with kidnapping situations, such as when the sensor moves to different floors via an elevator, with minimal heuristics.


A New Way to Fix the Housing Crisis

Slate

Sign up for the Slatest to get the most insightful analysis, criticism, and advice out there, delivered to your inbox daily. Two decades ago, the fire marshal in Glendale, Arizona, was concerned that the elevators in a new stadium wouldn't be large enough to accommodate a 7-foot stretcher held flat. Tilting a stretcher to make it fit in the cab, the marshal worried, might jeopardize the treatment of a patient with a back injury. Maybe our elevators should be bigger, he thought. The marshal put this idea to the International Code Council, the organization that governs the construction of American buildings. After minor feedback and minimal research (the marshal measured three stretchers in the Phoenix area), the suggestion was incorporated into the ICC's model code.


Fall Detection in Passenger Elevators using Intelligent Surveillance Camera Systems: An Application with YoloV8 Nano Model

Yozgatli, Pinar, Acar, Yavuz, Tulumen, Mehmet, Minga, Selman, Selamet, Salih, Nalbant, Beytullah, Toru, Mustafa Talha, Koca, Berna, Keles, Tevfik, Selcok, Mehmet

arXiv.org Artificial Intelligence

Computer vision technology, which involves analyzing images and videos captured by cameras through deep learning algorithms, has significantly advanced the field of human fall detection. This study focuses on the application of the YoloV8 Nano model in identifying fall incidents within passenger elevators, a context that presents unique challenges due to the enclosed environment and varying lighting conditions. By training the model on a robust dataset comprising over 10,000 images across diverse elevator types, we aim to enhance the detection precision and recall rates. The model's performance, with an 85% precision and 82% recall in fall detection, underscores its potential for integration into existing elevator safety systems to enable rapid intervention.


BUMBLE: Unifying Reasoning and Acting with Vision-Language Models for Building-wide Mobile Manipulation

Shah, Rutav, Yu, Albert, Zhu, Yifeng, Zhu, Yuke, Martín-Martín, Roberto

arXiv.org Artificial Intelligence

To operate at a building scale, service robots must perform very long-horizon mobile manipulation tasks by navigating to different rooms, accessing different floors, and interacting with a wide and unseen range of everyday objects. We refer to these tasks as Building-wide Mobile Manipulation. To tackle these inherently long-horizon tasks, we introduce BUMBLE, a unified Vision-Language Model (VLM)-based framework integrating open-world RGBD perception, a wide spectrum of gross-to-fine motor skills, and dual-layered memory. Our extensive evaluation (90+ hours) indicates that BUMBLE outperforms multiple baselines in long-horizon building-wide tasks that require sequencing up to 12 ground truth skills spanning 15 minutes per trial. BUMBLE achieves 47.1% success rate averaged over 70 trials in different buildings, tasks, and scene layouts from different starting rooms and floors. Our user study demonstrates 22% higher satisfaction with our method than state-of-the-art mobile manipulation methods. Finally, we demonstrate the potential of using increasingly-capable foundation models to push performance further. For more information, see https://robin-lab.cs.utexas.edu/BUMBLE/


Raising the Bar(ometer): Identifying a User's Stair and Lift Usage Through Wearable Sensor Data Analysis

Karande, Hrishikesh Balkrishna, Shivalingappa, Ravikiran Arasur Thippeswamy, Yaici, Abdelhafid Nassim, Haghbin, Iman, Bavadiya, Niravkumar, Burchard, Robin, Van Laerhoven, Kristof

arXiv.org Artificial Intelligence

Many users are confronted multiple times daily with the choice of whether to take the stairs or the elevator. Whereas taking the stairs could be beneficial for cardiovascular health and wellness, taking the elevator might be more convenient but it also consumes energy. By precisely tracking and boosting users' stairs and elevator usage through their wearable, users might gain health insights and motivation, encouraging a healthy lifestyle and lowering the risk of sedentary-related health problems. This research describes a new exploratory dataset, to examine the patterns and behaviors related to using stairs and lifts. We collected data from 20 participants while climbing and descending stairs and taking a lift in a variety of scenarios. The aim is to provide insights and demonstrate the practicality of using wearable sensor data for such a scenario. Our collected dataset was used to train and test a Random Forest machine learning model, and the results show that our method is highly accurate at classifying stair and lift operations with an accuracy of 87.61% and a multi-class weighted F1-score of 87.56% over 8-second time windows. Furthermore, we investigate the effect of various types of sensors and data attributes on the model's performance. Our findings show that combining inertial and pressure sensors yields a viable solution for real-time activity detection.


MuNES: Multifloor Navigation Including Elevators and Stairs

Jung, Donghwi, Kim, Chan, Cho, Jae-Kyung, Kim, Seong-Woo

arXiv.org Artificial Intelligence

We propose a scheme called MuNES for single mapping and trajectory planning including elevators and stairs. Optimized multifloor trajectories are important for optimal interfloor movements of robots. However, given two or more options of moving between floors, it is difficult to select the best trajectory because there are no suitable indoor multifloor maps in the existing methods. To solve this problem, MuNES creates a single multifloor map including elevators and stairs by estimating altitude changes based on pressure data. In addition, the proposed method performs floor-based loop detection for faster and more accurate loop closure. The single multifloor map is then voxelized leaving only the parts needed for trajectory planning. An optimal and realistic multifloor trajectory is generated by exploring the voxels using an A* algorithm based on the proposed cost function, which affects realistic factors. We tested this algorithm using data acquired from around a campus and note that a single accurate multifloor map could be created. Furthermore, optimal and realistic multifloor trajectory could be found by selecting the means of motion between floors between elevators and stairs according to factors such as the starting point, ending point, and elevator waiting time. The code and data used in this work are available at https://github.com/donghwijung/MuNES.


A method for Selecting Scenes and Emotion-based Descriptions for a Robot's Diary

Ichikura, Aiko, Kawaharazuka, Kento, Obinata, Yoshiki, Okada, Kei, Inaba, Masayuki

arXiv.org Artificial Intelligence

Furthermore, we found that the robot's emotion generally improves the preference of the robot's diary regardless of the scene it describes. However, presenting negative or mixed emotions at once may decrease the preference of the diary or reduce the robot's robot-likeness, and thus the method of presenting emotions still needs further investigation. I. INTRODUCTION In human-robot communication, various studies have attempted to enhance the relationship between humans and robots. Among them, we focused on the effect on the relationship between a robot and a person when the robot shares its daily experiences with the person through a diary. Diaries are also used as an application for commercial robots sold in Japan, and have become one of the interaction tools between robots and people.SHARP's RoBoHoN[1], for example, can remember events of the day like a diary when you talk to the robot. GROOVE X, Inc.'s LOVOT[2] can't speak, but its mobile app provides a diary that displays a timeline of human


Multi-Valued Partial Order Plans in Numeric Planning

Helal, Hayyan, Lakemeyer, Gerhard

arXiv.org Artificial Intelligence

Many planning formalisms allow for mixing numeric with Boolean effects. However, most of these formalisms are undecidable. In this paper, we will analyze possible causes for this undecidability by studying the number of different occurrences of actions, an approach that proved useful for metric fluents before. We will start by reformulating a numeric planning problem known as restricted tasks as a search problem. We will then show how an NP-complete fragment of numeric planning can be found by using heuristics. To achieve this, we will develop the idea of multi-valued partial order plans, a least committing compact representation for (sequential and parallel) plans. Finally, we will study optimization techniques for this representation to incorporate soft preconditions.