One Proxy Device Is Enough for Hardware-Aware Neural Architecture Search