Boosting Neural Representations for Videos with a Conditional Decoder