Dropout Training as Adaptive Regularization Stefan Wager, Sida Wang, and Percy Liang Departments of Statistics