Hausdorff Dimension, Heavy Tails, and Generalization in Neural Networks
–Neural Information Processing Systems
In contrast to convex optimization setting where the behavior of SGD is fairly well-understood (see e.g.
Neural Information Processing Systems
Oct-2-2025, 16:16:42 GMT
- Country:
- North America > Canada (0.28)
- Europe > United Kingdom
- England (0.14)
- Genre:
- Research Report > New Finding (0.68)
- Technology: