T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs