Scalable Neural Video Representations with Learnable Positional Features