The Larger the Merrier? Efficient Large AI Model Inference in Wireless Edge Networks