Detection of Intoxicated Individuals from Facial Video Sequences via a Recurrent Fusion Model
Baroutian, Bita, Aghaei, Atefe, Moghaddam, Mohsen Ebrahimi
–arXiv.org Artificial Intelligence
Abstract--Alcohol consumption is a significant public health concern and a major cause of accidents and fatalities worldwide. This study introduces a novel video-based facial sequence analysis approach dedicated to the detection of alcohol intoxication. The method integrates facial landmark analysis via a Graph Attention Network (GA T) with spatiotemporal visual features extracted using a 3D ResNet. These features are dynamically fused with adaptive prioritization to enhance classification performance. Additionally, we introduce a curated dataset comprising 3,542 video segments derived from 202 individuals to support training and evaluation. Our model is compared against two baselines: a custom 3D-CNN and a VGGFace+LSTM architecture. Experimental results show that our approach achieves 95.82% accuracy, 0.977 precision, and 0.97 recall, outperforming prior methods. The findings demonstrate the model's potential for practical deployment in public safety systems for non-invasive, reliable alcohol intoxication detection. Alcohol consumption remains a significant public safety challenge, particularly when it negatively affects cognitive functions, physical coordination, and judgment.
arXiv.org Artificial Intelligence
Dec-5-2025
- Country:
- Asia > Middle East
- Iran > Tehran Province > Tehran (0.04)
- Europe > United Kingdom
- England > Oxfordshire > Oxford (0.04)
- North America
- Canada > Quebec
- Montreal (0.04)
- United States > New Mexico
- Bernalillo County > Albuquerque (0.04)
- Canada > Quebec
- Asia > Middle East
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Technology: