Joint Partitioning and Placement of Foundation Models for Real-Time Edge AI