SupplementaryMaterial

Neural Information Processing Systems 

Given these considerations, we split our analysis to the case wherebq = s (referred to as the nonbottleneck case) and wherebq = min(M1,,ML 1)(referred to as the bottleneck case).