optimization trajectory
- North America > United States > California (0.04)
- Europe > Germany > North Rhine-Westphalia > Upper Bavaria > Munich (0.04)
- Asia > South Korea > Seoul > Seoul (0.04)
On the Limitations of Fractal Dimension as a Measure of Generalization Charlie B. Tan University of Oxford Inés García-Redondo Imperial College London Qiquan Wang
Bounding and predicting the generalization gap of overparameterized neural networks remains a central open problem in theoretical machine learning. There is a recent and growing body of literature that proposes the framework of fractals to model optimization trajectories of neural networks, motivating generalization bounds and measures based on the fractal dimension of the trajectory. Notably, the persistent homology dimension has been proposed to correlate with the generalization gap.
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.40)
- North America > United States > California (0.04)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
e04101138a3c94544760c1dbdf2c7a2d-Paper-Conference.pdf
For example, while prior work has suggested that theglobally optimal VAEsolution canlearn thecorrect manifold dimension, anecessary (butnotsufficient)condition forproducing samplesfrom the true data distribution, this has never been rigorously proven. Moreover, it remains unclear how such considerations would change when various types of conditioning variablesare introduced, or when the data support is extended to a union of manifolds (e.g., as is likely the case for MNIST digits and related). In this work, we address these points by first proving that VAE global minima are indeed capable of recovering the correct manifold dimension.
- Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Europe > Switzerland > Vaud > Lausanne (0.04)
- Asia > Russia (0.04)
- Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
- Energy (0.93)
- North America > United States > Illinois > Cook County > Chicago (0.04)
- North America > Canada (0.04)
- Asia > Middle East > Israel (0.04)
- North America > Canada > Ontario > Toronto (0.15)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Europe > Spain > Andalusia > Cádiz Province > Cadiz (0.04)
addressed all raised questions below: conducting new experiments to compare with hand-designed optimizers (# 1)
We genuinely appreciate all three reviewers' (#1,#2,#3,#4) valuable suggestions to strengthen our paper. More details are referred to Reviewer #3's Q2. We sincerely appreciate your suggestion and will revise the caption in figure 4 for better readability. Reply: It is a great observation. Thus, IL appears to be a main contributor for L2O generalizing across different optimizees.