AITopics

Almost Free: Self-concordance in Natural Exponential Families and an Application to Bandits

Neural Information Processing SystemsMar-23-2025, 04:30:19 GMT

We prove that single-parameter natural exponential families with subexponential tails are self-concordant with polynomial-sized parameters. For subgaussian natural exponential families we establish an exact characterization of the growth rate of the self-concordance parameter. Applying these findings to bandits allows us to fill gaps in the literature: We show that optimistic algorithms for generalized linear bandits enjoy regret bounds that are both second-order (scale with the variance of the optimal arm's reward distribution) and free of an exponential dependence on the bound of the problem parameter in the leading term. To the best of our knowledge, ours is the first regret bound for generalized linear bandits with subexponential tails, broadening the class of problems to include Poisson, exponential and gamma bandits.

artificial intelligence, exp, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > Canada > Alberta (0.14)
Europe > United Kingdom > England (0.14)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.45)

Add feedback

62891522c00cf7323cbacb500e6cfc8d-Paper-Conference.pdf

Neural Information Processing SystemsMar-23-2025, 04:30:09 GMT

artificial intelligence, formula, machine learning, (17 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (1.00)

Industry: Information Technology (0.30)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.72)

Add feedback

Improving Context-Aware Preference Modeling for Language Models Nicolas Le Roux

Neural Information Processing SystemsMar-23-2025, 04:30:01 GMT

While finetuning language models (LMs) from pairwise preferences has proven remarkably effective, the underspecified nature of natural language presents critical challenges. Direct preference feedback is uninterpretable, difficult to provide where multidimensional criteria may apply, and often inconsistent, either because it is based on incomplete instructions or provided by diverse principals. To address these challenges, we consider the two-step preference modeling procedure that first resolves the under-specification by selecting a context, and then evaluates preference with respect to the chosen context. We decompose reward modeling error according to these two steps, which suggests that supervising context in addition to context-specific preference may be a viable approach to aligning models with diverse human preferences. For this to work, the ability of models to evaluate context-specific preference is critical. To this end, we contribute contextconditioned preference datasets and accompanying experiments that investigate the ability of language models to evaluate context-specific preference. We use our datasets to (1) show that existing preference models benefit from, but fail to fully consider, added context, (2) finetune a context-aware reward model with context-specific performance exceeding that of GPT-4 and Llama 3 70B on tested datasets, and (3) investigate the value of context-aware preference modeling.

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country: North America > Canada > Ontario > Toronto (0.14)

Genre: Research Report > Experimental Study (1.00)

Industry:

Education (0.93)
Health & Medicine > Consumer Health (0.92)
Law (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
(2 more...)

Add feedback

fc03d48253286a798f5116ec00e99b2b-Paper.pdf

Neural Information Processing SystemsMar-23-2025, 04:29:51 GMT

artificial intelligence, latency, machine learning, (13 more...)

Neural Information Processing Systems

Country: North America (0.46)

Industry: Information Technology (0.47)

Technology:

Information Technology > Communications (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Fully Neural Network based Model for General Temporal Point Processes

Takahiro Omi, naonori ueda, Kazuyuki Aihara

Neural Information Processing SystemsMar-23-2025, 04:29:37 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, intensity function, machine learning, (19 more...)

Neural Information Processing Systems

Country: North America (0.28)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.72)

Add feedback

39e4973ba3321b80f37d9b55f63ed8b8-AuthorFeedback.pdf

Neural Information Processing SystemsMar-23-2025, 04:29:22 GMT

We greatly appreciate the three reviewers for their valuable comments. The following are our responses. Therefore, the cumulative hazard function also plays a crucial role in generating a median predictor. The performance is evaluated by the mean absolute error, summarized below. These results demonstrate the effectiveness of our model in the prediction task.

artificial intelligence, ct-lstm model, machine learning, (15 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.37)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.33)

Add feedback

51053d7b8473df7d5a2165b2a8ee9629-Paper-Conference.pdf

Neural Information Processing SystemsMar-23-2025, 04:29:19 GMT

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
(2 more...)

Add feedback

Function-Space Distributions over Kernels

Gregory Benton, Wesley J. Maddox, Jayson Salkey, Julio Albinati, Andrew Gordon Wilson

Neural Information Processing SystemsMar-23-2025, 04:29:05 GMT

Gaussian processes are flexible function approximators, with inductive biases controlled by a covariance kernel. Learning the kernel is the key to representation learning and strong predictive performance. In this paper, we develop functional kernel learning (FKL) to directly infer functional posteriors over kernels. In particular, we place a transformed Gaussian process over a spectral density, to induce a non-parametric distribution over kernel functions. The resulting approach enables learning of rich representations, with support for any stationary kernel, uncertainty over the values of the kernel, and an interpretable specification of a prior directly over kernels, without requiring sophisticated initialization or manual intervention. We perform inference through elliptical slice sampling, which is especially well suited to marginalizing posteriors with the strongly correlated priors typical to function space modeling. We develop our approach for nonuniform, large-scale, multi-task, and multidimensional data, and show promising performance in a wide range of settings, including interpolation, extrapolation, and kernel recovery experiments.

artificial intelligence, kernel, machine learning, (14 more...)

Neural Information Processing Systems

Country: North America > United States > Colorado (0.28)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.66)

Add feedback

39d929972619274cc9066307f707d002-AuthorFeedback.pdf

Neural Information Processing SystemsMar-23-2025, 04:28:51 GMT

artificial intelligence, kernel, machine learning, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.31)

Add feedback

Batched Thompson Sampling

Neural Information Processing SystemsMar-23-2025, 04:28:47 GMT

We introduce a novel anytime batched Thompson sampling policy for multi-armed bandits where the agent observes the rewards of her actions and adjusts her policy only at the end of a small number of batches. We show that this policy simultaneously achieves a problem dependent regret of order O(log(T)) and a minimax regret of order O( T log(T)) while the number of batches can be bounded by O(log(T)) independent of the problem instance over a time horizon T. We also prove that in expectation the instance dependent batch complexity of our policy is of order O(log log(T)). These results indicate that Thompson sampling performs competitively with recently proposed algorithms for the batched setting, which optimize the batch structure for a given time horizon T and prioritize exploration in the beginning of the experiment to eliminate suboptimal actions. Unlike these algorithms, the batched Thompson sampling algorithm we propose is an anytime policy, i.e. it operates without the knowledge of the time horizon T, and as such it is the only anytime algorithm that achieves optimal regret with O(log log(T)) expected batch complexity. This is achieved through a dynamic batching strategy, which uses the agents estimates to adaptively increase the batch duration.

data mining, machine learning, thompson, (19 more...)

Neural Information Processing Systems

Genre: Research Report (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.48)

Add feedback

Filters

Collaborating Authors

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Almost Free: Self-concordance in Natural Exponential Families and an Application to Bandits

62891522c00cf7323cbacb500e6cfc8d-Paper-Conference.pdf

Improving Context-Aware Preference Modeling for Language Models Nicolas Le Roux

fc03d48253286a798f5116ec00e99b2b-Paper.pdf

Fully Neural Network based Model for General Temporal Point Processes

39e4973ba3321b80f37d9b55f63ed8b8-AuthorFeedback.pdf

51053d7b8473df7d5a2165b2a8ee9629-Paper-Conference.pdf

Function-Space Distributions over Kernels

39d929972619274cc9066307f707d002-AuthorFeedback.pdf

Batched Thompson Sampling