Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation