Dynamic Reward Scaling for Multivariate Time Series Anomaly Detection: A VAE-Enhanced Reinforcement Learning Approach
Golchin, Bahareh, Rekabdar, Banafsheh
–arXiv.org Artificial Intelligence
Abstract-- Detecting anomalies in multivariate time series is essential for monitoring complex industrial systems, where high dimensionality, limited labeled data, and subtle dependencies between sensors cause significant challenges. This paper presents a deep reinforcement learning framework that combines a V ari-ational Autoencoder (V AE), an LSTM-based Deep Q-Network (DQN), dynamic reward shaping, and an active learning module to address these issues in a unified learning framework. The main contribution is the implementation of Dynamic Reward Scaling for Multivariate Time Series Anomaly Detection (DRSMT), which demonstrates how each component enhances the detection process. The V AE captures compact latent representations and reduces noise. The DQN enables adaptive, sequential anomaly classification, and the dynamic reward shaping balances exploration and exploitation during training by adjusting the importance of reconstruction and classification signals. In addition, active learning identifies the most uncertain samples for labeling, reducing the need for extensive manual supervision. Experiments on two multivariate benchmarks, namely Server Machine Dataset (SMD) and Water Distribution T estbed (W ADI), show that the proposed method outperforms existing baselines in F1-score and AU-PR. In many of today's applications, identifying and removing anomalies (i.e., outliers) has become essential to ensure system reliability. In multivariate time series data, specifically, different factors can result in anomalies.
arXiv.org Artificial Intelligence
Nov-18-2025
- Country:
- Asia > Singapore (0.04)
- North America > United States
- Ohio > Lucas County
- Oregon (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- Ohio > Lucas County
- Genre:
- Research Report (1.00)
- Industry:
- Water & Waste Management > Water Management > Lifecycle (0.46)
- Technology: