AITopics | infinite

Collaborating Authors

infinite

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

On Language Generation in the Limit with Bounded Memory

Kleinberg, Jon, Mehrotra, Anay, Saberi, Amin, Velegkas, Grigoris

arXiv.org Machine LearningMay-29-2026

We study language generation in the limit under bounded memory. In this task, a learner observes examples from an unknown target language one at a time and must eventually output only new valid examples. Prior work assumes access to the entire history, a strong assumption since realistic algorithms retain limited past information. Classical work in learning theory shows memory constraints dramatically alter learnability; we extend this to language generation. First, we study memoryless generators. Under a mild enumeration restriction, every countable collection of infinite languages remains generable without memory. Without this restriction, we exactly characterize when memoryless generation is possible. For finite collections, we characterize the optimal minimax density achievable by memoryless generators -- the best density guaranteed against any collection of a given size. This combinatorial bound relies on Sperner's theorem and symmetric chain decompositions. We further show that a sliding window of the last $W$ examples does not improve this worst-case density, whereas allowing it to store $b$ adaptively chosen past examples improves the achievable density for every $b \geq 1$. Finally, we revisit identification in the limit, where the learner must converge to a single correct hypothesis for the target language. We focus on its incremental variant, where the learner remembers only its previous guess. Here, although exact identification fails on a collection of just three languages, a mild relaxation requiring convergence to an ``approximate'' version of the target is achievable for every finite collection. These results show bounded memory affects these tasks differently: generation remains achievable for every countable collection, while density and identification are confined to finite collections, with guarantees weakening as the collection grows.

generator, machine learning, natural language, (20 more...)

arXiv.org Machine Learning

2605.30324

Country:

North America > United States (0.28)
Europe (0.27)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Generation (0.81)

Add feedback

06abed94583030dd50abe6767bd643b1-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 09:48:27 GMT

algorithm 1, artificial intelligence, dataset, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.75)

Add feedback

Language Generation in the Limit

Neural Information Processing SystemsFeb-16-2026, 00:19:48 GMT

Answers to these questions must begin by formalizing the specification for what a generative algorithm for language should be doing.

artificial intelligence, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > New York > Tompkins County > Ithaca (0.04)
(2 more...)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Natural Language > Generation (0.65)

Add feedback

TensorProgramsV: TuningLargeNeuralNetworksvia Zero-ShotHyperparameterTransfer

Neural Information Processing SystemsFeb-9-2026, 20:37:40 GMT

Manypublished baselines are hard to compare to one another due to varying degrees of HP tuning.

machine learning, natural language, urlhttp, (19 more...)

Neural Information Processing Systems

Country:

Europe > Italy > Tuscany > Florence (0.04)
Europe > Italy > Sardinia (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

06abed94583030dd50abe6767bd643b1-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 07:37:46 GMT

algorithm 1, dataset, heuristic method, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.50)

Add feedback

Certifying Robustness to Programmable Data Bias in Decision Trees

Neural Information Processing SystemsDec-25-2025, 01:02:43 GMT

Datasets can be biased due to societal inequities, human biases, under-representation of minorities, etc. Our goal is to certify that models produced by a learning algorithm are pointwise-robust to dataset biases. This is a challenging problem: it entails learning models for a large, or even infinite, number of datasets, ensuring that they all produce the same prediction. We focus on decision-tree learning due to the interpretable nature of the models. Our approach allows programmatically specifying \emph{bias models} across a variety of dimensions (e.g., label-flipping or missing data), composing types of bias, and targeting bias towards a specific group. To certify robustness, we use a novel symbolic technique to evaluate a decision-tree learner on a large, or infinite, number of datasets, certifying that each and every dataset produces the same prediction for a specific test point. We evaluate our approach on datasets that are commonly used in the fairness literature, and demonstrate our approach's viability on a range of bias models.

certifying robustness, name change, programmable data bias, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)

Add feedback

A comparison between initialization strategies for the infinite hidden Markov model

Cortese, Federico P., Rossini, Luca

arXiv.org Machine LearningDec-4-2025

Infinite hidden Markov models provide a flexible framework for modelling time series with structural changes and complex dynamics, without requiring the number of latent states to be specified in advance. This flexibility is achieved through the hierarchical Dirichlet process prior, while efficient Bayesian inference is enabled by the beam sampler, which combines dynamic programming with slice sampling to truncate the infinite state space adaptively. Despite extensive methodological developments, the role of initialization in this framework has received limited attention. This study addresses this gap by systematically evaluating initialization strategies commonly used for finite hidden Markov models and assessing their suitability in the infinite setting. Results from both simulated and real datasets show that distance-based clustering initializations consistently outperform model-based and uniform alternatives, the latter being the most widely adopted in the existing literature.

initialization, initialization method, initialization strategy, (15 more...)

arXiv.org Machine Learning

2512.03777

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
Europe > Austria > Vienna (0.14)
Europe > Romania (0.04)
(10 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Energy (0.68)
Banking & Finance > Economy (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

Language Generation and Identification From Partial Enumeration: Tight Density Bounds and Topological Characterizations

Kleinberg, Jon, Wei, Fan

arXiv.org Artificial IntelligenceNov-10-2025

The success of large language models (LLMs) has motivated formal theories of language generation and learning. We study the framework of \emph{language generation in the limit}, where an adversary enumerates strings from an unknown language $K$ drawn from a countable class, and an algorithm must generate unseen strings from $K$. Prior work showed that generation is always possible, and that some algorithms achieve positive lower density, revealing a \emph{validity--breadth} trade-off between correctness and coverage. We resolve a main open question in this line, proving a tight bound of $1/2$ on the best achievable lower density. We then strengthen the model to allow \emph{partial enumeration}, where the adversary reveals only an infinite subset $C \subseteq K$. We show that generation in the limit remains achievable, and if $C$ has lower density $α$ in $K$, the algorithm's output achieves density at least $α/2$, matching the upper bound. This generalizes the $1/2$ bound to the partial-information setting, where the generator must recover within a factor $1/2$ of the revealed subset's density. We further revisit the classical Gold--Angluin model of \emph{language identification} under partial enumeration. We characterize when identification in the limit is possible -- when hypotheses $M_t$ eventually satisfy $C \subseteq M \subseteq K$ -- and in the process give a new topological formulation of Angluin's characterization, showing that her condition is precisely equivalent to an appropriate topological space having the $T_D$ separation property.

artificial intelligence, large language model, natural language, (19 more...)

arXiv.org Artificial Intelligence

2511.05295

Country: North America > United States (0.28)

Genre: Research Report (0.63)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Generation (0.91)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)

Add feedback

The Mathematician Who Tried to Convince the Catholic Church of Two Infinities

WIREDNov-4-2025, 12:00:00 GMT

In the late 19th century, Georg Cantor believed his new theory could help the Church understand the infinite nature of the divine. It might have escaped lay people at the time, but for some observers the ascension of Leo XIV as head of the Catholic Church this year was a reminder that the last time a Pope Leo sat in St. Peter's Chair in the Vatican, from 1878 to 1903, the modern view of infinity was born. Georg Cantor's completely original "naïve" set theory caused both revolution and revolt in mathematical circles, with some embracing his ideas and others rejecting them. Cantor was deeply disappointed with the negative reactions, of course, but never with his own ideas. Because he held firm to the belief that he had a main line to the absolute--that his ideas came direct from (the divine intellect).

cantor, catholic church, infinity, (14 more...)

WIRED

Country:

Europe > Holy See (0.25)
North America > United States > New York (0.04)
North America > United States > California (0.04)
(3 more...)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

KARIPAP: Quantum-Inspired Tensor Network Compression of Large Language Models Using Infinite Projected Entangled Pair States and Tensor Renormalization Group

Nazri, Azree

arXiv.org Artificial IntelligenceOct-28-2025

Large Language Models (LLMs) like ChatGPT and LLaMA drive rapid progress in generative AI, yet their huge parameter scales create severe computational and environmental burdens. High training costs, energy use, and limited device deployment hinder accessibility. Existing compression - pruning, distillation, low-rank, and quantization - reduces size but ignores complex inter-layer correlations. We propose KARIPAP, a quantum-inspired tensor network compression using Infinite Projected Entangled Pair States (iPEPS) and Tensor Renormalization Group (TRG) contraction. Unlike 1D Matrix Product States, iPEPS captures multi-directional entanglement in attention and deep transformer layers. TRG ensures polynomial-time contraction, making tensorization feasible while preserving key correlation geometry. Experiments on LLaMA-2 7B show up to 93% memory and 70% parameter reduction, with 50% faster training, 25% faster inference, and only 2-3% accuracy loss. Layer-wise entanglement profiling reveals redundancy in deeper layers, confirming their suitability for tensor factorization. KARIPAP demonstrates that modern LLMs occupy low-dimensional entanglement manifolds, enabling scalable, energy-efficient, and quantum-aware AI architectures.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2510.21844

Country: Asia (0.15)

Genre: Research Report (1.00)

Industry: Energy (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.48)

Add feedback