Detection of Intoxicated Individuals from Facial Video Sequences via a Recurrent Fusion Model