AITopics | gen

fa67d13ba6c73637593bbcc92f6400ff-Paper-Conference.pdf

Neural Information Processing SystemsApr-30-2026, 09:17:38 GMT

artificial intelligence, gen, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States > Illinois (0.28)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.46)

Add feedback

Tail-Aware Information-Theoretic Generalization for RLHF and SGLD

Zhang, Huiming, Li, Binghan, Tian, Wan, Sun, Qiang

arXiv.org Machine LearningApr-14-2026

Classical information-theoretic generalization bounds typically control the generalization gap through KL-based mutual information and therefore rely on boundedness or sub-Gaussian tails via the moment generating function (MGF). In many modern pipelines, such as robust learning, RLHF, and stochastic optimization, losses and rewards can be heavy-tailed, and MGFs may not exist, rendering KL-based tools ineffective. We develop a tail-dependent information-theoretic framework for sub-Weibull data, where the tail parameter $θ$ controls the tail heaviness: $θ=2$ corresponds to sub-Gaussian, $θ=1$ to sub-exponential, and $0<θ<1$ to genuinely heavy tails. Our key technical ingredient is a decorrelation lemma that bounds change-of-measure expectations using a shifted-log $f_θ$-divergence, which admits explicit comparisons to Rényi divergence without MGF arguments. On the empirical-process side, we establish sharp maximal inequalities and a Dudley-type chaining bound for sub-Weibull processes with tail index $θ$, with complexity scaling as $\log^{1/θ}$ and entropy$^{1/θ}$. These tools yield expected and high-probability PAC-Bayes generalization bounds, as well as an information-theoretic chaining inequality based on multiscale Rényi mutual information. We illustrate the consequences in Rényi-regularized RLHF under heavy-tailed rewards and in stochastic gradient Langevin dynamics with heavy-tailed gradient noise.

artificial intelligence, exp, machine learning, (18 more...)

arXiv.org Machine Learning

2604.10727

Country: Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.63)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.34)

Add feedback

fa67d13ba6c73637593bbcc92f6400ff-Paper-Conference.pdf

Neural Information Processing SystemsFeb-18-2026, 02:02:05 GMT

gen, generalization, generalization error, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois > Champaign County > Urbana (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Rhode Island > Providence County > Providence (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.46)

Add feedback

9f975093da0252e2c0ae181d74c90dc6-Supplemental.pdf

Neural Information Processing SystemsFeb-10-2026, 07:30:59 GMT

gen, generalization error, wasserstein distance, (13 more...)

Neural Information Processing Systems

Country:

Europe > Sweden > Stockholm > Stockholm (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

0663a4ddceacb40b095eda264a85f15c-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-7-2026, 08:47:19 GMT

baseline, meta-learning framework, unseen entity, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.66)

Add feedback

Blink Video Doorbell (2nd Gen) review: Impressive features, great price

PCWorldJan-13-2026, 18:00:00 GMT

When you purchase through links in our articles, we may earn a small commission. Amazon's entry-level video doorbell delivers essential features at a bargain price. Limited local storage options (included Sync Module Core doesn't support USB storage) The Blink Video Doorbell (2nd Gen) delivers clear video, wide coverage, reliable alerts, and a long battery life at a remarkably low price. If you don't need advanced features like ultra-sharp resolution, or full-duplex audio, this doorbell is a true bargain. Blink is Amazon's budget line of smart home products.

blink video doorbell, gaming laptop mobile monitor pc, security software storage streaming wi-fi, (6 more...)

PCWorld

Country: North America > United States > California (0.04)

Industry:

Leisure & Entertainment (0.97)
Media > Music (0.47)
Information Technology > Smart Houses & Appliances (0.44)
Information Technology > Security & Privacy (0.31)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Human Computer Interaction > Interfaces (0.88)
Information Technology > Artificial Intelligence > Vision > Video Understanding (0.88)

Add feedback

Learning Shrinks the Hard Tail: Training-Dependent Inference Scaling in a Solvable Linear Model

Levi, Noam

arXiv.org Machine LearningJan-8-2026

We analyze neural scaling laws in a solvable model of last-layer fine-tuning where targets have intrinsic, instance-heterogeneous difficulty. In our Latent Instance Difficulty (LID) model, each input's target variance is governed by a latent ``precision'' drawn from a heavy-tailed distribution. While generalization loss recovers standard scaling laws, our main contribution connects this to inference. The pass@$k$ failure rate exhibits a power-law decay, $k^{-β_\text{eff}}$, but the observed exponent $β_\text{eff}$ is training-dependent. It grows with sample size $N$ before saturating at an intrinsic limit $β$ set by the difficulty distribution's tail. This coupling reveals that learning shrinks the ``hard tail'' of the error distribution: improvements in the model's generalization error steepen the pass@$k$ curve until irreducible target variance dominates. The LID model yields testable, closed-form predictions for this behavior, including a compute-allocation rule that favors training before saturation and inference attempts after. We validate these predictions in simulations and in two real-data proxies: CIFAR-10H (human-label variance) and a maths teacher-student distillation task.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Machine Learning

2601.03764

Country:

North America (0.28)
Europe (0.28)

Genre: Research Report (0.85)

Industry: Education > Curriculum > Subject-Specific Education (0.54)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

03d7e13f0092405804f3a381ade8f3f0-Supplemental-Conference.pdf

Neural Information Processing SystemsDec-27-2025, 15:54:16 GMT

complexity, future-dependent value function, pomdp, (13 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.31)

Add feedback

Drawback of Enforcing Equivariance and its Compensation via the Lens of Expressive Power

Chen, Yuzhu, Qin, Tian, Tian, Xinmei, He, Fengxiang, Tao, Dacheng

arXiv.org Machine LearningDec-11-2025

Equivariant neural networks encode symmetry as an inductive bias and have achieved strong empirical performance in wide domains. However, their expressive power remains not well understood. Focusing on 2-layer ReLU networks, this paper investigates the impact of equiv-ariance constraints on the expressivity of equivariant and layer-wise equivariant networks. By examining the boundary hyperplanes and the channel vectors of ReLU networks, we construct an example showing that equivariance constraints could strictly limit expressive power. However, we demonstrate that this drawback can be compensated via enlarging the model size. Furthermore, we show that despite a larger model size, the resulting architecture could still correspond to a hypothesis space with lower complexity, implying superior generalizability for equivariant networks.

boundary hyperplane, channel vector, expressive power, (14 more...)

arXiv.org Machine Learning

2512.09673

Country: Asia > China (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.83)

Add feedback

Google Nest Cam Indoor and Outdoor 2K Review: Slick, Smart, and Secure

WIREDOct-26-2025, 12:30:00 GMT

The latest Nest cams jump to 2K resolution, but what really elevates them is Gemini's pricey AI subscription smarts. All products featured on WIRED are independently selected by our editors. However, when you buy something through our retail links, we may earn an affiliate commission. Gemini can answer questions and offer descriptions. Overhauled Google Home app is much improved.

google, google nest cam indoor, nest cam indoor, (14 more...)

WIRED

Country: