AITopics | huber loss

Online robust locally differentially private learning for nonparametric regression

Neural Information Processing SystemsJun-23-2026, 03:26:57 GMT

The growing prevalence of streaming data and increasing concerns over data privacy pose significant challenges for traditional nonparametric regression methods, which are often ill-suited for real-time, privacy-aware learning. In this paper, we tackle these issues by first proposing a novel one-pass online functional stochastic gradient descent algorithm that leverages the Huber loss (H-FSGD), to improve robustness against outliers and heavy-tailed errors in dynamic environments. To further accommodate privacy constraints, we introduce a locally differentially private extension, Private H-FSGD (PH-FSGD), designed to real-time, privacy-preserving estimation. Theoretically, we conduct a comprehensive non-asymptotic convergence analysis of the proposed estimators, establishing finite-sample guarantees and identifying optimal step size schedules that achieve optimal convergence rates. In particular, we provide practical insights into the impact of key hyperparameters, such as step size and privacy budget, on convergence behavior. Extensive experiments validate our theoretical findings, demonstrating that our methods achieve strong robustness and privacy protection without sacrificing efficiency.

artificial intelligence, machine learning, noise, (18 more...)

Neural Information Processing Systems

Country:

Asia (0.46)
North America > United States (0.27)
Europe (0.27)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.87)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback

TailedTS: Benchmark Dataset for Heavy-Tailed Time Series Prediction and Periodicity Quantification

Chen, Xinyu, Cai, HanQin, Ding, Lijun, Zhao, Jinhua

arXiv.org Machine LearningMay-19-2026

We present TailedTS, a large-scale benchmark dataset derived from Wikipedia hourly page view observations throughout 2024, specifically designed to test time series forecasting models under heavy-tailed, zero-inflated, and non-Gaussian conditions. The dataset comprises approximately 24.69 billion data points spanning roughly 3 million unique Wikipedia pages per month, stored in high-efficiency Apache Parquet format. Wikipedia traffic follows a pronounced power-law distribution where roughly 5% of pages account for over 70% of total page views, creating a natural and rigorous testbed for model robustness against extreme volatility that are absent from or underrepresented in existing benchmarks such as M4, M5, and UCI electricity datasets. TailedTS enables several research tasks. First, we introduce a periodicity quantification framework based on sparse autoregression with sparsity and non-negativity constraints, revealing that frequently-viewed pages exhibit significantly weaker periodic structure than their less-viewed counterparts, showing direct implications for server allocation and traffic forecasting on large digital platforms. Second, we provide standardized prediction benchmarks evaluated under a suite of non-Gaussian loss functions, including $\ell_1$-norm, Huber, quantile, and $\ell_p$-norm losses, demonstrating that standard Gaussian-based estimators degrade substantially on high-volume page categories, while robust alternatives provide consistent gains across all traffic scales. TailedTS is publicly available at https://doi.org/10.5281/zenodo.17070469.

data mining, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

2605.16361

Country:

North America > United States > California (0.46)
North America > United States > Massachusetts (0.28)

Genre: Research Report > New Finding (0.46)

Industry: Energy > Power Industry (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

4afa19649ae378da31a423bcd78a97c8-Paper.pdf

Neural Information Processing SystemsApr-25-2026, 18:35:33 GMT

artificial intelligence, assumption 2, machine learning, (15 more...)

Neural Information Processing Systems

Country: North America (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Data Science (0.68)

Add feedback

Material

Neural Information Processing SystemsApr-24-2026, 15:15:37 GMT

A.1 Data Configuration The inputs to a hydraulic simulation include an elevation map, initial conditions, and the boundary conditions. For a given elevation map, there is an infinite possible combinations of initial and boundary conditions that could potentially realize in future events. It is an interesting question how to automatically configure the most relevant initial and boundary conditions to train on, to get a representation that will be useful in potential future real-world scenarios. We suggest a basic configuration that adequate for the purpose of this paper. These include the water height h Rm m at each pixel and a staggered grid flux q R2 (m 1) (m 1) in each direction x,y.

artificial intelligence, elevation map, machine learning, (17 more...)

Neural Information Processing Systems

Country: Asia > India (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.94)

Add feedback

01d78b294d80491fecddea897cf03642-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 07:38:19 GMT

agent, artificial intelligence, machine learning, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

4f20f7f5d2e7a1b640ebc8244428558c-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-19-2026, 01:28:18 GMT

exploration, idac, sia, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

d5b3d8dadd770c460b1cde910a711987-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 08:45:35 GMT

Estimating information from structured data is acentral theme in statistics that by now has found applications in a wide array of disciplines.

artificial intelligence, estimator, machine learning, (19 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.16)
North America > United States > Arizona > Maricopa County > Phoenix (0.04)
Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.95)

Add feedback

a3bf6e4db673b6449c2f7d13ee6ec9c0-Supplemental.pdf

Neural Information Processing SystemsFeb-9-2026, 15:54:34 GMT

gradient, implementation, time step, (14 more...)

Neural Information Processing Systems

Country: North America > Puerto Rico (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback

OnEmpiricalRiskMinimizationwithDependent and Heavy-TailedData

Neural Information Processing SystemsFeb-8-2026, 12:56:54 GMT

The above problem requires the knowledge of the distributionπ which is typically unknown in practice.

artificial intelligence, citedonpage2, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Yolo County > Davis (0.04)
North America > Canada > Ontario > Toronto (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Fair Regression under Demographic Parity: A Unified Framework

Feng, Yongzhen, Wang, Weiwei, Wong, Raymond K. W., Zhang, Xianyang

arXiv.org Machine LearningJan-16-2026

We propose a unified framework for fair regression tasks formulated as risk minimization problems subject to a demographic parity constraint. Unlike many existing approaches that are limited to specific loss functions or rely on challenging non-convex optimization, our framework is applicable to a broad spectrum of regression tasks. Examples include linear regression with squared loss, binary classification with cross-entropy loss, quantile regression with pinball loss, and robust regression with Huber loss. We derive a novel characterization of the fair risk minimizer, which yields a computationally efficient estimation procedure for general loss functions. Theoretically, we establish the asymptotic consistency of the proposed estimator and derive its convergence rates under mild assumptions. We illustrate the method's versatility through detailed discussions of several common loss functions. Numerical results demonstrate that our approach effectively minimizes risk while satisfying fairness constraints across various regression settings.

artificial intelligence, machine learning, regression, (15 more...)

arXiv.org Machine Learning

2601.10623

Country: North America > United States (0.69)

Genre: Research Report > New Finding (0.48)

Technology: