Deploying Large NLP Models: Infrastructure Cost Optimization