AITopics | Africa

Collaborating Authors

Africa

cf6501108fced72ee5c47e2151c4e153-Paper-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 00:37:31 GMT

Thus, most meta and transfer-learning HPO methods [7-16] consider a restrictive setting where all tasks must share the same set of hyperparameters so that the input data can be represented as fixed-sizedvectors.

artificial intelligence, machine learning, optimization, (19 more...)

Neural Information Processing Systems

Country: Africa > Ethiopia > Addis Ababa > Addis Ababa (0.04)

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

cf62ec4cd78c8d25d5321708f000d908-Paper-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 00:36:00 GMT

decomposition, self-supervised loss, tensor decomposition, (15 more...)

Neural Information Processing Systems

Country:

Africa > Senegal > Kolda Region > Kolda (0.05)
North America > United States > Massachusetts (0.04)
North America > United States > Illinois > Champaign County > Urbana (0.04)

Genre: Research Report > New Finding (0.67)

Industry:

Health & Medicine > Health Care Technology (0.68)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

55c518a17bd17dcb69aa14d69d085994-Paper-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 00:22:13 GMT

machine learning, natural language, question answering, (17 more...)

Neural Information Processing Systems

Country:

Europe > Netherlands > North Holland > Amsterdam (0.04)
Africa > Ethiopia > Addis Ababa > Addis Ababa (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(4 more...)

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.71)

Add feedback

43612b0662cb6a4986edf859fd6ebafe-Paper-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsFeb-12-2026, 00:21:16 GMT

dataset, flood mapping, kuro siwo, (14 more...)

Neural Information Processing Systems

Country:

Asia > Pakistan (0.05)
Oceania > Australia (0.04)
North America > Honduras (0.04)
(9 more...)

Genre: Research Report (0.46)

Industry:

Banking & Finance (1.00)
Health & Medicine (0.93)
Government (0.68)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Add feedback

3fd60983292458bf7dee75f12d5e9e05-Paper.pdf

Neural Information Processing SystemsFeb-12-2026, 00:12:13 GMT

algorithm, online algorithm, peek search, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada (0.04)
Africa > Middle East > Egypt > Cairo Governorate > Cairo (0.04)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)
Information Technology > Biomedical Informatics > Translational Bioinformatics (0.68)

Add feedback

First-order methods almost always avoid saddle points: The case of vanishing step-sizes

Ioannis Panageas, Georgios Piliouras, Xiao Wang

Neural Information Processing SystemsFeb-12-2026, 00:11:36 GMT

Neural Information Processing Systems http://nips.cc/

converge, saddle point, theorem, (13 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.04)
Asia > Middle East > Jordan (0.04)
North America > Canada > Quebec > Montreal (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.32)

Add feedback

OntheEffectivenessofLipschitz-Driven RehearsalinContinualLearning

Neural Information Processing SystemsFeb-12-2026, 00:08:27 GMT

Rehearsal approaches enjoy immense popularity with Continual Learning (CL) practitioners. These methods collect samples from previously encountered data distributions in a small memory buffer; subsequently, they repeatedly optimize on the latter to prevent catastrophic forgetting. This work draws attention to a hidden pitfallofthis widespread practice: repeated optimization onasmall pool of data inevitably leads to tight and unstable decision boundaries, which are a major hindrance to generalization.

artificial intelligence, learning, machine learning, (16 more...)

Neural Information Processing Systems

Country: Africa > Central African Republic > Ombella-M'Poko > Bimbo (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.35)

Add feedback

Distributional Gradient Matching for Learning Uncertain Neural Dynamics Models

Neural Information Processing SystemsFeb-12-2026, 00:08:15 GMT

Differential equations in general and neural ODEs in particular are an essential technique in continuous-time system identification. While many deterministic learning algorithms have been designed based on numerical integration via the adjoint method, many downstream tasks such as active learning, exploration in reinforcement learning, robust control, or filtering require accurate estimates of predictive uncertainties.

artificial intelligence, dynamic model, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.04)
Asia > Middle East > Republic of Türkiye > Karaman Province > Karaman (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
(4 more...)

Genre: Research Report (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

SoftMatcha 2: A Fast and Soft Pattern Matcher for Trillion-Scale Corpora

Yoneda, Masataka, Matsushita, Yusuke, Kamoda, Go, Suenaga, Kohei, Akiba, Takuya, Waga, Masaki, Yokoi, Sho

arXiv.org Machine LearningFeb-12-2026

We present an ultra-fast and flexible search algorithm that enables search over trillion-scale natural language corpora in under 0.3 seconds while handling semantic variations (substitution, insertion, and deletion). Our approach employs string matching based on suffix arrays that scales well with corpus size. To mitigate the combinatorial explosion induced by the semantic relaxation of queries, our method is built on two key algorithmic ideas: fast exact lookup enabled by a disk-aware design, and dynamic corpus-aware pruning. We theoretically show that the proposed method suppresses exponential growth in the search space with respect to query length by leveraging statistical properties of natural language. In experiments on FineWeb-Edu (Lozhkov et al., 2024) (1.4T tokens), we show that our method achieves significantly lower search latency than existing methods: infini-gram (Liu et al., 2024), infini-gram mini (Xu et al., 2025), and SoftMatcha (Deguchi et al., 2025). As a practical application, we demonstrate that our method identifies benchmark contamination in training corpora, unidentified by existing approaches. We also provide an online demo of fast, soft search across corpora in seven languages.

large language model, machine learning, pattern recognition, (25 more...)

arXiv.org Machine Learning

2602.10908

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Europe > Austria > Styria > Graz (0.04)
Europe > Austria > Vienna (0.04)
(17 more...)

Genre: Research Report (0.82)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Leisure & Entertainment > Sports > Olympic Games (0.95)
Health & Medicine > Therapeutic Area > Immunology (0.92)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(5 more...)

Add feedback

A solvable high-dimensional model where nonlinear autoencoders learn structure invisible to PCA while test loss misaligns with generalization

Mendes, Vicente Conde, Bardone, Lorenzo, Koller, Cédric, Moreira, Jorge Medina, Erba, Vittorio, Troiani, Emanuele, Zdeborová, Lenka

arXiv.org Machine LearningFeb-12-2026

Many real-world datasets contain hidden structure that cannot be detected by simple linear correlations between input features. For example, latent factors may influence the data in a coordinated way, even though their effect is invisible to covariance-based methods such as PCA. In practice, nonlinear neural networks often succeed in extracting such hidden structure in unsupervised and self-supervised learning. However, constructing a minimal high-dimensional model where this advantage can be rigorously analyzed has remained an open theoretical challenge. We introduce a tractable high-dimensional spiked model with two latent factors: one visible to covariance, and one statistically dependent yet uncorrelated, appearing only in higher-order moments. PCA and linear autoencoders fail to recover the latter, while a minimal nonlinear autoencoder provably extracts both. We analyze both the population risk, and empirical risk minimization. Our model also provides a tractable example where self-supervised test loss is poorly aligned with representation quality: nonlinear autoencoders recover latent structure that linear methods miss, even though their reconstruction loss is higher.

artificial intelligence, equation, machine learning, (15 more...)

arXiv.org Machine Learning

2602.1068

Country:

Africa > Middle East > Tunisia > Ben Arous Governorate > Ben Arous (0.04)
Europe > Switzerland > Vaud > Lausanne (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Research Report (0.81)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback