6fee03d84375a159ecd3769ebbacae83-Supplemental-Conference.pdf
–Neural Information Processing Systems
Convergence of stochastic gradient descent for non-smooth problems is a known result. For completeness, wereproduce and adapt ausual proof toour setting. Let us denote byF the class of functions fromX toY we are going to work with. Assumption 1 states that we have a well-specified modelF to estimate the median,i.e. Let us begin by controlling the estimation error.
Neural Information Processing Systems
Feb-9-2026, 17:27:05 GMT