Can Synthetic Audio From Generative Foundation Models Assist Audio Recognition and Speech Modeling?