Distributional Scaling Laws for Emergent Capabilities