Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models