AITopics | Services

Collaborating Authors

Services

Categorized Bandits

Neural Information Processing SystemsJun-1-2025, 01:39:14 GMT

We introduce a new stochastic multi-armed bandit setting where arms are grouped inside "ordered" categories. The motivating example comes from e-commerce, where a customer typically has a greater appetence for items of a specific wellidentified but unknown category than any other one. We introduce three concepts of ordering between categories, inspired by stochastic dominance between random variables, which are gradually weaker so that more and more bandit scenarios satisfy at least one of them. We first prove instance-dependent lower bounds on the cumulative regret for each of these models, indicating how the complexity of the bandit problems increases with the generality of the ordering concept considered. We also provide algorithms that fully leverage the structure of the model with their associated theoretical guarantees. Finally, we have conducted an analysis on real data to highlight that those ordered categories actually exist in practice.

category, data mining, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Europe > Finland (0.14)
North America > Canada (0.14)
Europe > Netherlands (0.14)

Industry: Information Technology > Services > e-Commerce Services (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.90)

Add feedback

AuctionNet: A Novel Benchmark for Decision-Making in Large-Scale Games

Neural Information Processing SystemsMay-31-2025, 22:38:28 GMT

Decision-making in large-scale games is an essential research area in artificial intelligence (AI) with significant real-world impact. However, the limited access to realistic large-scale game environments has hindered research progress in this area. In this paper, we present AuctionNet, a benchmark for bid decision-making in largescale ad auctions derived from a real-world online advertising platform. AuctionNet is composed of three parts: an ad auction environment, a pre-generated dataset based on the environment, and performance evaluations of several baseline bid decision-making algorithms. More specifically, the environment effectively replicates the integrity and complexity of real-world ad auctions through the interaction of several modules: the ad opportunity generation module employs deep generative networks to bridge the gap between simulated and real-world data while mitigating the risk of sensitive data exposure; the bidding module implements diverse autobidding agents trained with different decision-making algorithms; and the auction module is anchored in the classic Generalized Second Price (GSP) auction but also allows for customization of auction mechanisms as needed.

data mining, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country: Europe > Germany (0.14)

Genre: Research Report > New Finding (0.46)

Industry:

Marketing (1.00)
Information Technology > Services (0.89)
Leisure & Entertainment > Games > Computer Games (0.48)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.94)
Information Technology > Data Science > Data Mining (0.68)

Add feedback

Parameter Symmetry and Noise Equilibrium of Stochastic Gradient Descent Mingze Wang Massachusetts Institute of Technology, Peking University NTT Research

Neural Information Processing SystemsMay-31-2025, 22:08:39 GMT

Symmetries are prevalent in deep learning and can significantly influence the learning dynamics of neural networks. In this paper, we examine how exponential symmetries - a broad subclass of continuous symmetries present in the model architecture or loss function - interplay with stochastic gradient descent (SGD). We first prove that gradient noise creates a systematic motion (a "Noether flow") of the parameters θ along the degenerate direction to a unique initializationindependent fixed point θ

artificial intelligence, machine learning, symmetry, (15 more...)

Neural Information Processing Systems

Country: North America > United States > Massachusetts (0.50)

Genre: Research Report > Experimental Study (0.93)

Industry: Information Technology > Services (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.84)

Add feedback

Supplemental Material: CHIP: A Hawkes Process Model for Continuous-time Networks with Scalable and Consistent Estimation

Neural Information Processing SystemsMay-31-2025, 19:03:36 GMT

The spectral clustering algorithm for directed networks that we consider in this paper is shown in Algorithm A.1. This algorithm is used for the community detection step in our proposed CHIP estimation procedure. For undirected networks, which we use for the theoretical analysis in Section 4, spectral clustering is performed by running k-means clustering on the rows of the eigenvector matrix of N or A, not the rows of the concatenated singular vector matrix. The three parameters µ, α, β can be estimated by maximizing (A.1) using standard numerical methods for non-linear optimization (Nocedal & Wright, 2006). In our CHIP model, we have separate (µ, α, β) parameters for each block pair (a, b).

artificial intelligence, machine learning, matrix, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Genre: Research Report (0.47)

Industry: Information Technology > Services (0.46)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.54)

Add feedback

CHIP: A Hawkes Process Model for Continuous-time Networks with Scalable and Consistent Estimation

Neural Information Processing SystemsMay-31-2025, 19:03:29 GMT

In many application settings involving networks, such as messages between users of an on-line social network or transactions between traders in financial markets, the observed data consist of timestamped relational events, which form a continuoustime network. We propose the Community Hawkes Independent Pairs (CHIP) generative model for such networks. We show that applying spectral clustering to an aggregated adjacency matrix constructed from the CHIP model provides consistent community detection for a growing number of nodes and time duration. We also develop consistent and computationally efficient estimators for the model parameters. We demonstrate that our proposed CHIP model and estimation procedure scales to large networks with tens of thousands of nodes and provides superior fits than existing continuous-time network models on several real networks.

artificial intelligence, machine learning, spectral, (19 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Genre:

Research Report > Strength High (0.46)
Research Report > Experimental Study (0.46)

Industry:

Banking & Finance (0.68)
Information Technology > Services (0.49)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Communications > Networks (0.87)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.67)

Add feedback

Decomposable Transformer Point Processes

Neural Information Processing SystemsMay-31-2025, 18:18:18 GMT

The standard paradigm of modeling marked point processes is by parameterizing the intensity function using an attention-based (Transformer-style) architecture. Despite the flexibility of these methods, their inference is based on the computationally intensive thinning algorithm. In this work, we propose a framework where the advantages of the attention-based architecture are maintained and the limitation of the thinning algorithm is circumvented. The framework depends on modeling the conditional distribution of inter-event times with a mixture of log-normals satisfying a Markov property and the conditional probability mass function for the marks with a Transformer-based architecture. The proposed method attains state-of-the-art performance in predicting the next event of a sequence given its history. The experiments also reveal the efficacy of the methods that do not rely on the thinning algorithm during inference over the ones they do. Finally, we test our method on the challenging long-horizon prediction task and find that it outperforms a baseline developed specifically for tackling this task; importantly, inference requires just a fraction of time compared to the thinning-based baseline.

artificial intelligence, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country:

Europe (0.46)
North America > United States > New York (0.28)

Genre: Research Report > Experimental Study (1.00)

Industry: Information Technology > Services (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.34)

Add feedback

Discriminative Topic Modeling with Logistic LDA

Iryna Korshunova, Hanchen Xiong, Mateusz Fedoryszak, Lucas Theis

Neural Information Processing SystemsMay-31-2025, 17:56:55 GMT

Despite many years of research into latent Dirichlet allocation (LDA), applying LDA to collections of non-categorical items is still challenging. Yet many problems with much richer data share a similar structure and could benefit from the vast literature on LDA. We propose logistic LDA, a novel discriminative variant of latent Dirichlet allocation which is easy to apply to arbitrary inputs. In particular, our model can easily be applied to groups of images, arbitrary text embeddings, and integrates well with deep neural networks. Although it is a discriminative model, we show that logistic LDA can learn from unlabeled data in an unsupervised manner by exploiting the group structure present in the data. In contrast to other recent topic models designed to handle arbitrary inputs, our model does not sacrifice the interpretability and principled motivation of LDA.

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country: North America > Canada (0.14)

Industry: Information Technology > Services (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.90)

Add feedback

Node Classification on Graphs with Few-Shot Novel Labels via Meta Transformed Network Embedding

Neural Information Processing SystemsMay-31-2025, 16:58:18 GMT

We study the problem of node classification on graphs with few-shot novel labels, which has two distinctive properties: (1) There are novel labels to emerge in the graph; (2) The novel labels have only a few representative nodes for training a classifier. The study of this problem is instructive and corresponds to many applications such as recommendations for newly formed groups with only a few users in online social networks. To cope with this problem, we propose a novel Meta Transformed Network Embedding framework (MetaTNE), which consists of three modules: (1) A structural module provides each node a latent representation according to the graph structure.

data mining, machine learning, node, (21 more...)

Neural Information Processing Systems

Country:

North America > United States > Wisconsin (0.14)
Asia > China > Shaanxi Province (0.14)

Genre: Research Report (0.93)

Industry: Information Technology > Services (0.35)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Information Management (0.93)
(3 more...)

Add feedback

Tinder's new head pushes company to move away from 'hookup' reputation and rebrand for Gen Z users

FOX NewsMay-31-2025, 16:00:28 GMT

'The Big Weekend Show' co-hosts discuss Tinder user traffic peaking during'Dating Sunday.' Spencer Rascoff, the CEO of Tinder parent company Match Group, is promising to change the reputation of Tinder as a casual hookup app into a more serious dating app. They don't drink as much alcohol, they don't have as much sex," Rascoff said to a group of investors, according to The Wall Street Journal. "We need to adapt our products to accept that reality." Unlike the millennial generation, which helped popularize Tinder and shaped the dating app into a domestic and international success, Gen Z appears to be less interested in purely casual dating experiences. Some commentators believe that Gen Z is a generation that is tired of "ghosting," which is defined as suddenly cutting off communications with another person without warning, and instead seeking more authentic dating experiences.

artificial intelligence, social media, tinder, (9 more...)

FOX News

Country: North America > United States (0.17)

Industry:

Information Technology > Services (0.87)
Media > News (0.59)

Technology:

Information Technology > Communications > Social Media (0.99)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.99)

Add feedback

Optimal Pricing in Repeated Posted-Price Auctions with Different Patience of the Seller and the Buyer

Arsenii Vanunts, Alexey Drutsa

Neural Information Processing SystemsMay-31-2025, 13:02:15 GMT

We study revenue optimization pricing algorithms for repeated posted-price auctions where a seller interacts with a single strategic buyer that holds a fixed private valuation. When the participants non-equally discount their cumulative utilities, we show that the optimal constant pricing (which offers the Myerson price) is no longer optimal. In the case of more patient seller, we propose a novel multidimensional optimization functional -- a generalization of the one used to determine Myerson's price. This functional allows to find the optimal algorithm and to boost revenue of the optimal static pricing by an efficient low-dimensional approximation. Numerical experiments are provided to support our results.

algorithm, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Country: