Are deep learning models superior for missing data imputation in large surveys? Evidence from an empirical comparison