normal activity
ActPlan-1K: Benchmarking the Procedural Planning Ability of Visual Language Models in Household Activities
Su, Ying, Ling, Zhan, Shi, Haochen, Cheng, Jiayang, Yim, Yauwai, Song, Yangqiu
Large language models~(LLMs) have been adopted to process textual task description and accomplish procedural planning in embodied AI tasks because of their powerful reasoning ability. However, there is still lack of study on how vision language models~(VLMs) behave when multi-modal task inputs are considered. Counterfactual planning that evaluates the model's reasoning ability over alternative task situations are also under exploited. In order to evaluate the planning ability of both multi-modal and counterfactual aspects, we propose ActPlan-1K. ActPlan-1K is a multi-modal planning benchmark constructed based on ChatGPT and household activity simulator iGibson2. The benchmark consists of 153 activities and 1,187 instances. Each instance describing one activity has a natural language task description and multiple environment images from the simulator. The gold plan of each instance is action sequences over the objects in provided scenes. Both the correctness and commonsense satisfaction are evaluated on typical VLMs. It turns out that current VLMs are still struggling at generating human-level procedural plans for both normal activities and counterfactual activities. We further provide automatic evaluation metrics by finetuning over BLEURT model to facilitate future research on our benchmark.
Early Anomaly Detection in Time Series: A Hierarchical Approach for Predicting Critical Health Episodes
Cerqueira, Vitor, Torgo, Luis, Soares, Carlos
The early detection of anomalous events in time series data is essential in many domains of application. In this paper we deal with critical health events, which represent a significant cause of mortality in intensive care units of hospitals. The timely prediction of these events is crucial for mitigating their consequences and improving healthcare. One of the most common approaches to tackle early anomaly detection problems is standard classification methods. In this paper we propose a novel method that uses a layered learning architecture to address these tasks. One key contribution of our work is the idea of pre-conditional events, which denote arbitrary but computable relaxed versions of the event of interest. We leverage this idea to break the original problem into two hierarchical layers, which we hypothesize are easier to solve. The results suggest that the proposed approach leads to a better performance relative to state of the art approaches for critical health episode prediction.
You Need Real Data Science With Data Visualization
In the last decade, our data has slowly multiplied exponentially. Enterprises celebrated the Big Data movement as business managers, data scientists and technologists eagerly looked for insights in the data they acquired. Each day, companies, government, private individuals are in a race to acquire even more data. With machine learning and artificial intelligence, acquiring the right data for the right purpose seems to become more urgent. As our data multiply, we are caught in a race to generate insights.
Detecting Unseen Falls from Wearable Devices using Channel-wise Ensemble of Autoencoders
Khan, Shehroz S., Taati, Babak
A fall is an abnormal activity that occurs rarely, so it is hard to collect real data for falls. It is, therefore, difficult to use supervised learning methods to automatically detect falls. Another challenge in using machine learning methods to automatically detect falls is the choice of engineered features. In this paper, we propose to use an ensemble of autoencoders to extract features from different channels of wearable sensor data trained only on normal activities. We show that the traditional approach of choosing a threshold as the maximum of the reconstruction error on the training normal data is not the right way to identify unseen falls. We propose two methods for automatic tightening of reconstruction error from only the normal activities for better identification of unseen falls. We present our results on two activity recognition datasets and show the efficacy of our proposed method against traditional autoencoder models and two standard one-class classification methods. Keywords: detection 1. Introduction fall detection, one-class classification, autoencoders, anomaly Falls are a major cause of both fatal and nonfatal injury and a hindrance in living independently. Each year an estimated 424, 000 individuals die from falls globally and 37.3 million falls require medical attention [23]. Experiencing a fall may lead to a fear of falling [6], which in turn can result in lack of mobility, less productivity and reduced quality of life. There exist several commercial wearable devices to detect falls [24]; most of them use accelerometers to capture motion information. They normally come with an alarm button to manually contact a caregiver if the fall is not detected by the device.
Detecting Falls with X-Factor Hidden Markov Models
Khan, Shehroz S., Karg, Michelle E., Kulic, Dana, Hoey, Jesse
Identification of falls while performing normal activities of daily living (ADL) is important to ensure personal safety and well-being. However, falling is a short term activity that occurs infrequently. This poses a challenge to traditional classification algorithms, because there may be very little training data for falls (or none at all). This paper proposes an approach for the identification of falls using a wearable device in the absence of training data for falls but with plentiful data for normal ADL. We propose three `X-Factor' Hidden Markov Model (XHMMs) approaches. The XHMMs model unseen falls using "inflated" output covariances (observation models). To estimate the inflated covariances, we propose a novel cross validation method to remove "outliers" from the normal ADL that serve as proxies for the unseen falls and allow learning the XHMMs using only normal activities. We tested the proposed XHMM approaches on two activity recognition datasets and show high detection rates for falls in the absence of fall-specific training data. We show that the traditional method of choosing a threshold based on maximum of negative of log-likelihood to identify unseen falls is ill-posed for this problem. We also show that supervised classification methods perform poorly when very limited fall data are available during the training phase.