AITopics

344ef5151be171062f42f03e69663ecf-Supplemental.pdf

Neural Information Processing SystemsMay-28-2025, 16:23:58 GMT

artificial intelligence, machine learning, speech-t, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.68)

Add feedback

344ef5151be171062f42f03e69663ecf-Paper.pdf

Neural Information Processing SystemsMay-28-2025, 16:23:55 GMT

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country: Asia (0.14)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Speech (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Fast Asymptotically Optimal Algorithms for Non-Parametric Stochastic Bandits

Neural Information Processing SystemsMay-28-2025, 16:23:46 GMT

We consider the problem of regret minimization in non-parametric stochastic bandits. When the rewards are known to be bounded from above, there exists asymptotically optimal algorithms, with asymptotic regret depending on an infimum of Kullback-Leibler divergences (KL). These algorithms are computationally expensive and require storing all past rewards, thus simpler but non-optimal algorithms are often used instead. We introduce several methods to approximate the infimum KL which reduce drastically the computational and memory costs of existing optimal algorithms, while keeping their regret guaranties. We apply our findings to design new variants of the MED and IMED algorithms, and demonstrate their interest with extensive numerical simulations.

artificial intelligence, data mining, machine learning, (20 more...)

Neural Information Processing Systems

Country:

Europe > France > Hauts-de-France (0.14)
North America > United States > Colorado (0.14)

Genre: Research Report > New Finding (0.87)

Industry: Food & Agriculture > Agriculture (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.60)
Information Technology > Data Science > Data Mining > Big Data (0.48)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Add feedback

Embedding-Aligned Language Models

Neural Information Processing SystemsMay-28-2025, 16:23:36 GMT

We propose a novel approach for training large language models (LLMs) to adhere to objectives defined within a latent embedding space.

large language model, machine learning, neural information processing system, (17 more...)

Neural Information Processing Systems

Country:

Genre: Research Report > Experimental Study (0.92)

Industry:

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Sound and Complete Verification of Polynomial Networks

Neural Information Processing SystemsMay-28-2025, 16:23:25 GMT

Polynomial Networks (PNs) have demonstrated promising performance on face and image recognition recently. However, robustness of PNs is unclear and thus obtaining certificates becomes imperative for enabling their adoption in real-world applications. Existing verification algorithms on ReLU neural networks (NNs) based on classical branch and bound (BaB) techniques cannot be trivially applied to PN verification. In this work, we devise a new bounding method, equipped with BaB for global convergence guarantees, called Verification of Polynomial Networks or VPN for short. One key insight is that we obtain much tighter bounds than the interval bound propagation (IBP) and DeepT-Fast [Bonaert et al., 2021] baselines. This enables sound and complete PN verification with empirical validation on MNIST, CIFAR10 and STL10 datasets. We believe our method has its own interest to NN verification.

artificial intelligence, deep learning, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Europe (0.69)
North America > United States (0.46)

Industry: Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Observational Scaling Laws and the Predictability of Language Model Performance Chris J. Maddison 2,3

Neural Information Processing SystemsMay-28-2025, 16:23:18 GMT

Understanding how language model performance varies with scale is critical to benchmark and algorithm development. Scaling laws are one approach to building this understanding, but the requirement of training models across many different scales has limited their use. We propose an alternative, observational approach that bypasses model training and instead builds scaling laws from 100 publically available models. Building a single scaling law from multiple model families is challenging due to large variations in their training compute efficiencies and capabilities. However, we show that these variations are consistent with a simple, generalized scaling law where language model performance is a function of a low-dimensional capability space, and model families only vary in their efficiency in converting training compute to capabilities. Using this approach, we show the surprising predictability of complex scaling phenomena: we show that several emergent phenomena follow a smooth, sigmoidal behavior and are predictable from small models; we show that the agent performance of models such as GPT-4 can be precisely predicted from simpler non-agentic benchmarks; and we show how to predict the impact of post-training interventions like Chain-of-Thought and Self-Consistency as language model capabilities continue to improve.

arxiv preprint arxiv, large language model, machine learning, (20 more...)

Neural Information Processing Systems

Country: North America > Canada > Ontario > Toronto (0.14)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Industry:

Materials > Chemicals > Industrial Gases > Liquified Gas (0.93)
Materials > Chemicals > Commodity Chemicals > Petrochemicals > LNG (0.93)
Energy > Oil & Gas > Midstream (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

X Scaling Up Expressive Human Pose and Shape Estimation Supplementary Material

Neural Information Processing SystemsMay-28-2025, 16:23:10 GMT

Due to space constraints in the main paper, we elaborate the following here: additional details of the 32 datasets, including useful links to find their license statements and other ethics concerns in Sec. B.1 Dataset Descriptions This section describes the 32 datasets we study. Note that all these are public academic datasets, each holding a license. We follow the common practice to use them in our non-commercial research and refer readers to their homepages or papers for more details regarding licenses and their policies to ensure personal information protection. It features accurate SMPL annotations and 60 video sequences captured in diverse environments. We follow the official definition of train, val, and test splits. AGORA [34] (Figure 1b) is a synthetic dataset, rendered with high-quality human scans and realistic 3D scenes. It consists of 4240 textured human scans with diverse poses and appearances, each fitted with accurate SMPL-X annotations.

artificial intelligence, dataset, machine learning, (11 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū (0.14)
Asia > Middle East > Israel (0.14)

Genre: Research Report > Experimental Study (0.34)

Industry:

Education (0.68)
Information Technology > Security & Privacy (0.54)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation

Neural Information Processing SystemsMay-28-2025, 16:23:06 GMT

Expressive human pose and shape estimation (EHPS) unifies body, hands, and face motion capture with numerous applications. Despite encouraging progress, current state-of-the-art methods still depend largely on a confined set of training datasets. In this work, we investigate scaling up EHPS towards the first generalist foundation model (dubbed SMPLer-X), with up to ViT-Huge as the backbone and training with up to 4.5M instances from diverse data sources. With big data and the large model, SMPLer-X exhibits strong performance across diverse test benchmarks and excellent transferability to even unseen environments.

artificial intelligence, dataset, machine learning, (14 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū (0.14)
Asia > Middle East > Israel (0.14)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision > Video Understanding (0.35)

Add feedback

3413ce14d52b87557e87e2c1518c2cbe-Paper.pdf

Neural Information Processing SystemsMay-28-2025, 16:22:56 GMT

artificial intelligence, correspondence, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Learning to summarize from human feedback Jeff Wu

Neural Information Processing SystemsMay-28-2025, 16:22:40 GMT

As language models become more powerful, training and evaluation are increasingly bottlenecked by the data and metrics used for a particular task. For example, summarization models are often trained to predict human reference summaries and evaluated using ROUGE, but both of these metrics are rough proxies for what we really care about--summary quality. In this work, we show that it is possible to significantly improve summary quality by training a model to optimize for human preferences. We collect a large, high-quality dataset of human comparisons between summaries, train a model to predict the human-preferred summary, and use that model as a reward function to fine-tune a summarization policy using reinforcement learning. We apply our method to a version of the TL;DR dataset of Reddit posts [63] and find that our models significantly outperform both human reference summaries and much larger models fine-tuned with supervised learning alone. Our models also transfer to CNN/DM news articles [22], producing summaries nearly as good as the human reference without any news-specific fine-tuning.

arxiv preprint arxiv, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
North America > Canada (0.14)

Industry: Media (0.36)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.67)

Add feedback

Filters

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

344ef5151be171062f42f03e69663ecf-Supplemental.pdf

344ef5151be171062f42f03e69663ecf-Paper.pdf

Fast Asymptotically Optimal Algorithms for Non-Parametric Stochastic Bandits

Embedding-Aligned Language Models

Sound and Complete Verification of Polynomial Networks

Observational Scaling Laws and the Predictability of Language Model Performance Chris J. Maddison 2,3

X Scaling Up Expressive Human Pose and Shape Estimation Supplementary Material

SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation

3413ce14d52b87557e87e2c1518c2cbe-Paper.pdf

Learning to summarize from human feedback Jeff Wu