Dynamic Stress Detection: A Study of Temporal Progression Modelling of Stress in Speech

Lall, Vishakha, Liu, Yisi

arXiv.org Artificial Intelligence 

Abstract--Detecting psychological stress from speech is critical in high-pressure settings. While prior work has leveraged acoustic features for stress detection, most treat stress as a static label. In this work, we model stress as a temporally evolving phenomenon influenced by historical emotional state. We propose a dynamic labelling strategy that derives fine-grained stress annotations from emotional labels and introduce cross-attention-based sequential models--a Unidirectional LSTM and a Transformer Encoder--to capture temporal stress progression. Our approach achieves notable accuracy gains on MuSE (+5%) and StressID (+18%) over existing baselines, and generalises well to a custom real-world dataset. These results highlight the value of modelling stress as a dynamic construct in speech.