Inference Optimization of Foundation Models on AI Accelerators