AITopics | xtrain

Collaborating Authors

xtrain

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Just One Layer Norm Guarantees Stable Extrapolation

Neural Information Processing SystemsJun-22-2026, 18:29:52 GMT

In spite of their prevalence, the behaviour of Neural Networks when extrapolating far from the training distribution remains poorly understood, with existing results limited to specific cases. In this work, we prove general results--the first of their kind--by applying Neural Tangent Kernel (NTK) theory to analyse infinitelywide neural networks trained until convergence and prove that the inclusion of just one Layer Norm (LN) fundamentally alters the induced NTK, transforming it into a bounded-variance kernel. As a result, the output of an infinitely wide network with at least one LN remains bounded, even on inputs far from the training data. In contrast, we show that a broad class of networks without LN can produce pathologically large outputs for certain inputs. We support these theoretical findings with empirical experiments on finite-width networks, demonstrating that while standard NNs often exhibit uncontrolled growth outside the training domain, a single LN layer effectively mitigates this instability. Finally, we explore real-world implications of this extrapolatory stability, including applications to predicting residue sizes in proteins larger than those seen during training and estimating age from facial images of underrepresented ethnicities absent from the training set.

artificial intelligence, assumption 3, machine learning, (18 more...)

Neural Information Processing Systems

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.67)

Industry: Education (0.45)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

Tailoring

Neural Information Processing SystemsFeb-11-2026, 22:22:46 GMT

From CNNs toattention mechanisms, encoding inductivebiases intoneural networks has been a fruitful source of improvement in machine learning.

artificial intelligence, ltailor, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > United Kingdom (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.68)

Add feedback

Few-ShotNon-ParametricLearningwithDeepLatent VariableModel

Neural Information Processing SystemsFeb-11-2026, 05:58:09 GMT

By onlytraining agenerativemodel inanunsupervised way,theframeworkutilizes the data distribution to build a compressor. Using a compressor-based distance metric derived from Kolmogorov complexity, together with few labeled data, NPC-LVclassifies without further training.

artificial intelligence, compressor, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > Canada (0.04)
North America > United States > California > San Diego County > San Diego (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)

Add feedback

BGeneraltrade-offs

Neural Information Processing SystemsFeb-10-2026, 19:09:37 GMT

However, we make no serious efforts to find the optimal architecture. In fact, we use the same 13 architecture for allour experiments, across the scales. Webelievethe performance onaparticular task can be further improved by carefully curating the neural architecture.

artificial intelligence, machine learning, qamortv, (18 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.34)

Add feedback

Data-EfficientAugmentationforTrainingNeural Networks

Neural Information Processing SystemsFeb-7-2026, 21:05:00 GMT

Data augmentation is essential to achieve state-of-the-art performance in many deeplearningapplications.

artificial intelligence, augmentation, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > United States > Virginia (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

AWinning Hand: CompressingDeepNetworksCan ImproveOut-Of-DistributionRobustness

Neural Information Processing SystemsFeb-7-2026, 08:06:32 GMT

For example, consider the "Mars rover mission" that uses laser-induced breakdown spectroscopy (LIBS)tosearchformicrobiallife.

artificial intelligence, arxivpreprintarxiv, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Genre: Research Report (0.46)

Industry: Energy (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.96)

Add feedback

Hybrid Feature- and Similarity-Based Models for Joint Prediction and Interpretation

Kueper, Jacqueline K., Rayner, Jennifer, Lizotte, Daniel J.

arXiv.org Artificial IntelligenceFeb-11-2023

Electronic health records (EHRs) include simple features like patient age together with more complex data like care history that are informative but not easily represented as individual features. To better harness such data, we developed an interpretable hybrid feature- and similarity-based model for supervised learning that combines feature and kernel learning for prediction and for investigation of causal relationships. We fit our hybrid models by convex optimization with a sparsity-inducing penalty on the kernel. Depending on the desired model interpretation, the feature and kernel coefficients can be learned sequentially or simultaneously. The hybrid models showed comparable or better predictive performance than solely feature- or similarity-based approaches in a simulation study and in a case study to predict two-year risk of loneliness or social isolation with EHR data from a complex primary health care population. Using the case study we also present new kernels for high-dimensional indicator-coded EHR data that are based on deviations from population-level expectations, and we identify considerations for causal interpretations.

artificial intelligence, data mining, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2204.06076

Country:

North America > Greenland (0.04)
North America > Canada > Ontario > National Capital Region > Ottawa (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Consumer Health (0.93)
Health & Medicine > Health Care Technology > Medical Record (0.54)
Health & Medicine > Therapeutic Area > Oncology (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.47)

Add feedback

Mphasis

#artificialintelligenceApr-27-2022, 13:54:52 GMT

Now that we have an understanding of Baye's Rule, let's try to use it to analyze linear regression models. Where i is the dimensionality of the data X. Yj is the corresponding output for Xj. If i 3, Yj w1* x1j w2* x2j w3* x3j Where j is ranging from 1 to N where N is the number of data points we have. While the process of Bayesian modelling will be taken up in next part, let us consider the below model as true, for now.

regression model, xtrain, ytrain, (11 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback

Hands-on Experience with Gaussian Processes (GPs): Implementing GPs in Python - I

Tiwari, Kshitij

arXiv.org Machine LearningSep-6-2018

This document serves to complement our website which was developed with the aim of exposing the students to Gaussian Processes (GPs). GPs are non-parametric Bayesian regression models that are largely used by statisticians and geospatial data scientists for modeling spatial data. Several open source libraries spanning from Matlab [1], Python [2], R [3] etc., are already available for simple plug-and-use. The objective of this handout and in turn the website was to allow the users to develop stand-alone GPs in Python by relying on minimal external dependencies. To this end, we only use the default python modules and assist the users in developing their own GPs from scratch giving them an in-depth knowledge of what goes on under the hood. The module covers GP inference using maximum likelihood estimation (MLE) and gives examples of 1D (dummy) spatial data.

artificial intelligence, machine learning, maximum likelihood estimation, (16 more...)

arXiv.org Machine Learning

1809.01913

Country: