Uni-NaVid: A Video-based Vision-Language-Action Model for Unifying Embodied Navigation Tasks