Robust Multiple Description Neural Video Codec with Masked Transformer for Dynamic and Noisy Networks