NoLoCo: No-all-reduce Low Communication Training Method for Large Models