Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks