AITopics

Country:

Asia > Russia (0.14)
Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
North America > Canada > Quebec > Montreal (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Neural Information Processing SystemsFeb-9-2026, 14:04:48 GMT

697200c9d1710c2799720b660abd11bb-Supplemental-Conference.pdf

hyperparameter, kernel, quad, (17 more...)

Country:

North America (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Industry:

Energy > Energy Storage (0.67)
Electrical Industrial Apparatus (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

arXiv.org Machine LearningDec-30-2025

Fundamental Novel Consistency Theory: $H$-Consistency Bounds

Zhong, Yutao

In machine learning, the loss functions optimized during training often differ from the target loss that defines task performance due to computational intractability or lack of differentiability. We present an in-depth study of the target loss estimation error relative to the surrogate loss estimation error. Our analysis leads to $H$-consistency bounds, which are guarantees accounting for the hypothesis set $H$. These bounds offer stronger guarantees than Bayes-consistency or $H$-calibration and are more informative than excess error bounds. We begin with binary classification, establishing tight distribution-dependent and -independent bounds. We provide explicit bounds for convex surrogates (including linear models and neural networks) and analyze the adversarial setting for surrogates like $ρ$-margin and sigmoid loss. Extending to multi-class classification, we present the first $H$-consistency bounds for max, sum, and constrained losses, covering both non-adversarial and adversarial scenarios. We demonstrate that in some cases, non-trivial $H$-consistency bounds are unattainable. We also investigate comp-sum losses (e.g., cross-entropy, MAE), deriving their first $H$-consistency bounds and introducing smooth adversarial variants that yield robust learning algorithms. We develop a comprehensive framework for deriving these bounds across various surrogates, introducing new characterizations for constrained and comp-sum losses. Finally, we examine the growth rates of $H$-consistency bounds, establishing a universal square-root growth rate for smooth surrogates in binary and multi-class tasks, and analyze minimizability gaps to guide surrogate selection.

artificial intelligence, h-consistency, machine learning, (19 more...)

arXiv.org Machine Learning

2512.2288

Country: North America > United States (0.27)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Anton Osokin, Francis Bach, Simon Lacoste-Julien

On Structured Prediction Theory with Calibrated Convex Surrogate Losses

Neural Information Processing SystemsNov-21-2025, 07:22:14 GMT

Exponential number of classes: this brings both additional computational and statistical challenges. By exponential, we mean exponentially large in the size of the natural dimension of output, e.g., the

artificial intelligence, inductive learning, machine learning, (17 more...)

Country:

Europe > France > Île-de-France > Paris > Paris (0.04)
Asia > Middle East > Jordan (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
(4 more...)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.67)

Kirill Struminsky, Simon Lacoste-Julien, Anton Osokin

Quantifying Learning Guarantees for Convex but Inconsistent Surrogates

Neural Information Processing SystemsNov-20-2025, 15:46:39 GMT

We extend the recent framework of Osokin et al.

artificial intelligence, machine learning, optimization problem, (19 more...)

Country:

Asia > Russia (0.14)
Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
North America > Canada > Quebec > Montreal (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

arXiv.org Artificial IntelligenceNov-18-2025

TEMPO: Global Temporal Building Density and Height Estimation from Satellite Imagery

Glazer, Tammy, Hacheme, Gilles Q., Zaytar, Akram, Marotti, Luana, Michaels, Amy, Tadesse, Girmaw Abebe, White, Kevin, Dodhia, Rahul, Zolli, Andrew, Becker-Reshef, Inbal, Ferres, Juan M. Lavista, Robinson, Caleb

We present TEMPO, a global, temporally resolved dataset of building density and height derived from high-resolution satellite imagery using deep learning models. We pair building footprint and height data from existing datasets with quarterly PlanetScope basemap satellite images to train a multi-task deep learning model that predicts building density and building height at a 37.6-meter per pixel resolution. We apply this model to global PlanetScope basemaps from Q1 2018 through Q2 2025 to create global, temporal maps of building density and height. We validate these maps by comparing against existing building footprint datasets. Our estimates achieve an F1 score between 85% and 88% on different hand-labeled subsets, and are temporally stable, with a 0.96 five-year trend-consistency score. TEMPO captures quarterly changes in built settlements at a fraction of the computational cost of comparable approaches, unlocking large-scale monitoring of development patterns and climate impacts essential for global resilience and adaptation efforts.

artificial intelligence, deep learning, machine learning, (18 more...)

2511.12104

Country:

Africa (1.00)
North America > United States > Texas (0.69)

Genre: Research Report (0.64)

Industry:

Government > Regional Government > North America Government > United States Government (0.93)
Health & Medicine (0.93)
Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.71)
Transportation (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceSep-9-2025

An AI system to help scientists write expert-level empirical software

Aygün, Eser, Belyaeva, Anastasiya, Comanici, Gheorghe, Coram, Marc, Cui, Hao, Garrison, Jake, Kast, Renee Johnston Anton, McLean, Cory Y., Norgaard, Peter, Shamsi, Zahra, Smalling, David, Thompson, James, Venugopalan, Subhashini, Williams, Brian P., He, Chujun, Martinson, Sarah, Plomecka, Martyna, Wei, Lai, Zhou, Yuchen, Zhu, Qian-Ze, Abraham, Matthew, Brand, Erica, Bulanova, Anna, Cardille, Jeffrey A., Co, Chris, Ellsworth, Scott, Joseph, Grace, Kane, Malcolm, Krueger, Ryan, Kartiwa, Johan, Liebling, Dan, Lueckmann, Jan-Matthis, Raccuglia, Paul, Xuefei, null, Wang, null, Chou, Katherine, Manyika, James, Matias, Yossi, Platt, John C., Dorfman, Lizzie, Mourad, Shibl, Brenner, Michael P.

The cycle of scientific discovery is frequently bottlenecked by the slow, manual creation of software to support computational experiments. To address this, we present an AI system that creates expert-level scientific software whose goal is to maximize a quality metric. The system uses a Large Language Model (LLM) and Tree Search (TS) to systematically improve the quality metric and intelligently navigate the large space of possible solutions. The system achieves expert-level results when it explores and integrates complex research ideas from external sources. The effectiveness of tree search is demonstrated across a wide range of benchmarks. In bioinformatics, it discovered 40 novel methods for single-cell data analysis that outperformed the top human-developed methods on a public leaderboard. In epidemiology, it generated 14 models that outperformed the CDC ensemble and all other individual models for forecasting COVID-19 hospitalizations. Our method also produced state-of-the-art software for geospatial analysis, neural activity prediction in zebrafish, time series forecasting and numerical solution of integrals. By devising and implementing novel solutions to diverse tasks, the system represents a significant step towards accelerating scientific progress.

large language model, machine learning, natural language, (19 more...)

2509.06503

Country: North America (0.46)

Genre:

Research Report > Promising Solution (0.54)
Research Report > New Finding (0.50)

Industry:

Health & Medicine > Public Health (1.00)
Health & Medicine > Epidemiology (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.35)
Health & Medicine > Therapeutic Area > Immunology (0.35)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Biswas, Subrata, Khan, Mohammad Nur Hossain, Islam, Bashima

QUADS: QUAntized Distillation Framework for Efficient Speech Language Understanding

arXiv.org Artificial IntelligenceAug-18-2025

Spoken Language Understanding (SLU) systems must balance performance and efficiency, particularly in resource-constrained environments. Existing methods apply distillation and quantization separately, leading to suboptimal compression as distillation ignores quantization constraints. We propose QUADS, a unified framework that optimizes both through multi-stage training with a pre-tuned model, enhancing adaptability to low-bit regimes while maintaining accuracy. QUADS achieves 71.13\% accuracy on SLURP and 99.20\% on FSC, with only minor degradations of up to 5.56\% compared to state-of-the-art models. Additionally, it reduces computational complexity by 60--73$\times$ (GMACs) and model size by 83--700$\times$, demonstrating strong robustness under extreme quantization. These results establish QUADS as a highly efficient solution for real-world, resource-constrained SLU applications.

artificial intelligence, natural language, quantization, (16 more...)

doi: 10.21437/Interspeech.2025-532

2505.14723

Genre: Research Report (1.00)

Industry: Information Technology (0.68)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)

Neural Information Processing SystemsAug-15-2025, 13:34:18 GMT

Supplementary: Fast Bayesian Inference with Batch Bayesian Quadrature via Kernel Recombination 1 Convergence analysis

Moreover, the provable recombination algorithm is slower than an LP solver implementation.

hyperparameter, kernel, quad, (17 more...)

Country:

North America (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Industry:

Energy > Energy Storage (0.67)
Electrical Industrial Apparatus (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.50)

arXiv.org Artificial IntelligenceJun-11-2025

CLARIFY: Contrastive Preference Reinforcement Learning for Untangling Ambiguous Queries

Mu, Ni, Hu, Hao, Hu, Xiao, Yang, Yiqin, Xu, Bo, Jia, Qing-Shan

Preference-based reinforcement learning (PbRL) bypasses explicit reward engineering by inferring reward functions from human preference comparisons, enabling better alignment with human intentions. However, humans often struggle to label a clear preference between similar segments, reducing label efficiency and limiting PbRL's real-world applicability. To address this, we propose an offline PbRL method: Contrastive LeArning for ResolvIng Ambiguous Feedback (CLARIFY), which learns a trajectory embedding space that incorporates preference information, ensuring clearly distinguished segments are spaced apart, thus facilitating the selection of more unambiguous queries. Extensive experiments demonstrate that CLARIFY outperforms baselines in both non-ideal teachers and real human feedback settings. Our approach not only selects more distinguished queries but also learns meaningful trajectory embeddings.

artificial intelligence, machine learning, reinforcement learning, (12 more...)

2506.00388

Country: Asia > China (0.14)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)