Joint Model Assignment and Resource Allocation for Cost-Effective Mobile Generative Services