AITopics | spg

Collaborating Authors

spg

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

REINFORCE Converges to Optimal Policies with Any Learning Rate

Neural Information Processing SystemsJun-10-2026, 17:22:29 GMT

We prove that the classic REINFORCE stochastic policy gradient (SPG) method converges to globally optimal policies in finite-horizon Markov Decision Processes (MDPs) with $\textit{any}$ constant learning rate. To avoid the need for small or decaying learning rates, we introduce two key innovations in the stochastic bandit setting, which we then extend to MDPs.

artificial intelligence, machine learning, proceedings, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Mixed Membership sub-Gaussian Models

Qing, Huan

arXiv.org Machine LearningApr-27-2026

The Gaussian mixture model is widely used in unsupervised learning, owing to its simplicity and interpretability. However, a fundamental limitation of the classical Gaussian mixture model is that it forces each observation to belong to exactly one component. In many practical applications, such as genetics, social network analysis, and text mining, an observation may naturally belong to multiple components or exhibit partial membership in several latent components. To overcome this limitation, we propose the mixed membership sub-Gaussian model, which extends the classical Gaussian mixture framework by allowing each observation to belong to multiple components. This model inherits the interpretability of the classical Gaussian mixture model while offering greater flexibility for capturing complex overlapping structures. We develop an efficient spectral algorithm to estimate the mixed membership of each individual observation, and under mild separation conditions on the component centres, we prove that the estimation error of the per-individual membership vector can be made arbitrarily small with high probability. To our knowledge, this is the first work to provide a computationally efficient estimator with such a vanishing-error guarantee for a mixed-membership extension of the Gaussian mixture model. Extensive experimental studies demonstrate that our method outperforms existing approaches that ignore mixed memberships.

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Machine Learning

2604.22633

Country: Asia (0.28)

Genre: Research Report > Experimental Study (0.34)

Industry: Health & Medicine > Therapeutic Area (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

f1cf2a082126bf02de0b307778ce73a7-Supplemental.pdf

Neural Information Processing SystemsFeb-11-2026, 02:03:13 GMT

gradient, softmax, spg, (15 more...)

Neural Information Processing Systems

Country:

North America > Canada > Alberta (0.14)
North America > United States > New York (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

f1cf2a082126bf02de0b307778ce73a7-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 02:03:06 GMT

artificial intelligence, machine learning, softmax, (17 more...)

Neural Information Processing Systems

Country:

North America > Canada > Alberta (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.56)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

b613e70fd9f59310cf0a8d33de3f2800-Supplemental.pdf

Neural Information Processing SystemsFeb-10-2026, 20:36:47 GMT

basis function, fvi, spg, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

6c1e55ec7c43dc51a37472ddcbd756fb-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 19:14:58 GMT

algorithm, data provider, learner, (16 more...)

Neural Information Processing Systems

Country:

North America > United States (0.05)
Europe > United Kingdom > England > Hampshire > Southampton (0.04)
North America > Canada (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.82)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Add feedback

6c1e55ec7c43dc51a37472ddcbd756fb-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-8-2026, 19:14:47 GMT

algorithm, data provider, provider, (16 more...)

Neural Information Processing Systems

Country: Asia > Afghanistan > Parwan Province > Charikar (0.05)

Industry: Banking & Finance (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.33)
Information Technology > Artificial Intelligence > Machine Learning (0.32)

Add feedback

Cold-Start Reinforcement Learning with Softmax Policy Gradient

Nan Ding, Radu Soricut

Neural Information Processing SystemsNov-21-2025, 13:58:53 GMT

The exposure-bias problem has recently received attention in neural-network settings with the "data as demonstrator" [

machine learning, natural language, reinforcement learning, (20 more...)

Neural Information Processing Systems

Country: North America > United States > California > Los Angeles County > Long Beach (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.84)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

follows

Neural Information Processing SystemsOct-3-2025, 04:17:15 GMT

Firstly, we thank the reviewers for their valuable comments. Whilst it is not reasonable in practice to assume that data is sampled i.i.d. As previously stated, we believe our work forms a first step in achieving this goal. We believe that our theoretical model captures this dynamic. An insurance company may gather information from a customer to better evaluate potential risk.

artificial intelligence, data provider, machine learning, (18 more...)

Neural Information Processing Systems

Industry: Banking & Finance > Insurance (0.36)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.33)
Information Technology > Artificial Intelligence > Machine Learning (0.32)

Add feedback