A Distributional View of High Dimensional Optimization

Jul-23-2025–arXiv.org Machine Learning

This PhD thesis presents a distributional view of optimization in place of a worst-case perspective. We motivate this view with an investigation of the failure point of classical optimization. Subsequently we consider the optimization of a randomly drawn objective function. This is the setting of Bayesian Optimization. After a review of Bayesian optimization we outline how such a distributional view may explain predictable progress of optimization in high dimension. It further turns out that this distributional view provides insights into optimal step size control of gradient descent. To enable these results, we develop mathematical tools to deal with random input to random functions and a characterization of non-stationary isotropic covariance kernels. Finally, we outline how assumptions about the data, specifically exchangability, can lead to random objective functions in machine learning and analyze their landscape.

artificial intelligence, high-dimensional non-convex optimization, machine learning, (20 more...)

arXiv.org Machine Learning

Jul-23-2025

arXiv.org PDF

Add feedback

Genre:
- Workflow (0.92)
- Research Report > New Finding (0.45)
- Instructional Material > Course Syllabus & Notes (0.45)

Technology:
- Information Technology
  - Data Science (1.00)
  - Artificial Intelligence
    - Representation & Reasoning
      - Uncertainty > Bayesian Inference (1.00)
      - Optimization (1.00)
    - Machine Learning
      - Statistical Learning (1.00)
      - Neural Networks > Deep Learning (0.67)
      - Learning Graphical Models > Directed Networks
        Bayesian Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found