AITopics | xtx

Collaborating Authors

xtx

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

08fe4b20d554296e503f5a43795c78d6-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 10:53:08 GMT

A.1 Proof of Theorem 2 Consider an instance of semi-supervised active regression with the labeled dataset Xlab being empty, and with the unlabeled dataset composed of k copies of the standard basis vector ei for each i = 1,,d.

algorithm 2, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.54)

Add feedback

Ordinary Least Squares is a Special Case of Transformer

Tan, Xiaojun, Zhao, Yuchen

arXiv.org Machine LearningApr-16-2026

The statistical essence of the Transformer architecture has long remained elusive: Is it a universal approximator, or a neural network version of known computational algorithms? Through rigorous algebraic proof, we show that the latter better describes Transformer's basic nature: Ordinary Least Squares (OLS) is a special case of the single-layer Linear Transformer. Using the spectral decomposition of the empirical covariance matrix, we construct a specific parameter setting where the attention mechanism's forward pass becomes mathematically equivalent to the OLS closed-form projection. This means attention can solve the problem in one forward pass, not by iterating. Building upon this prototypical case, we further uncover a decoupled slow and fast memory mechanism within Transformers. Finally, the evolution from our established linear prototype to standard Transformers is discussed. This progression facilitates the transition of the Hopfield energy function from linear to exponential memory capacity, thereby establishing a clear continuity between modern deep architectures and classical statistical inference.

artificial intelligence, machine learning, transformer, (17 more...)

arXiv.org Machine Learning

2604.13656

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.55)

Add feedback

DeepGraphNeuralNetworksvia Posteriori-Sampling-based Node-Adaptive ResidualModule

Neural Information Processing SystemsFeb-16-2026, 02:32:59 GMT

This type of method introduces a residual connection to the GNNs architecture. For example, JKNet [32] learns node representations by aggregating the outputs of all previous layers at the last layer.

artificial intelligence, justification, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
North America > United States > District of Columbia > Washington (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
Europe > Italy > Tuscany > Florence (0.04)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

DropCov: ASimpleyetEffectiveMethodfor ImprovingDeepArchitectures

Neural Information Processing SystemsFeb-12-2026, 05:53:11 GMT

One of core differences among various deep GCP methods is post-normalization for covariance representations, which plays a crucial role in final performance.

artificial intelligence, dropcov, machine learning, (14 more...)

Neural Information Processing Systems

Country: Asia > China > Tianjin Province > Tianjin (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

Organizationofthesupplementarymaterial Thesupplementarymaterial(SM)isorganizedasfollows. 35thConferenceonNeuralInformationProcessingSystems(NeurIPS2021)

Neural Information Processing SystemsFeb-11-2026, 04:31:50 GMT

Modern deep neural networks areoverparameterizedwith respect to the amount of training data and achieve zero training error, yet generalize well on test data.

artificial intelligence, deep learning, machine learning, (19 more...)

Neural Information Processing Systems

Country: North America > United States (0.14)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

HowDataAugmentationaffectsOptimizationfor LinearRegression

Neural Information Processing SystemsFeb-8-2026, 09:57:57 GMT

Our approach interprets augmented (S)GD as a stochastic optimization method foratime-varying sequence ofproxylosses.

artificial intelligence, augmentation, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.88)

Add feedback

Stable Diffusion Benchmarked: Which GPU Runs AI Fastest

#artificialintelligenceJan-23-2023, 01:29:00 GMT

Artificial Intelligence and deep learning are constantly in the headlines these days, whether it be ChatGPT generating poor advice, self-driving cars, artists being accused of using AI, medical advice from AI, and more. Most of these tools rely on complex servers with lots of hardware for training, but using the trained network via inference can be done on your PC, using its graphics card. But how fast are consumer GPUs for doing AI inference? We've benchmarked Stable Diffusion, a popular AI image creator, on the latest Nvidia, AMD, and even Intel GPUs to see how they stack up. If you've by chance tried to get Stable Diffusion up and running on your own PC, you may have some inkling of how complex -- or simple!

artificial intelligence, deep learning, machine learning, (17 more...)

#artificialintelligence

Industry: Information Technology (0.63)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Variational Gram Functions: Convex Analysis and Optimization

Jalali, Amin, Fazel, Maryam, Xiao, Lin

arXiv.org Machine LearningApr-11-2017

We propose a new class of convex penalty functions, called \emph{variational Gram functions} (VGFs), that can promote pairwise relations, such as orthogonality, among a set of vectors in a vector space. These functions can serve as regularizers in convex optimization problems arising from hierarchical classification, multitask learning, and estimating vectors with disjoint supports, among other applications. We study convexity for VGFs, and give efficient characterizations for their convex conjugates, subdifferentials, and proximal operators. We discuss efficient optimization algorithms for regularized loss minimization problems where the loss admits a common, yet simple, variational representation and the regularizer is a VGF. These algorithms enjoy a simple kernel trick, an efficient line search, as well as computational advantages over first order methods based on the subdifferential or proximal maps. We also establish a general representer theorem for such learning problems. Lastly, numerical experiments on a hierarchical classification problem are presented to demonstrate the effectiveness of VGFs and the associated optimization algorithms.

artificial intelligence, machine learning, optimization problem, (17 more...)

arXiv.org Machine Learning

1507.04734

Country: North America > United States > Washington > King County (0.28)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

This Bank-Beating Trading Powerhouse Doesn't Use Human Traders

#artificialintelligenceOct-13-2016, 05:11:02 GMT

One of the world's fastest-growing trading shops doesn't have any traders. XTX Markets Ltd. has emerged as a foreign-exchange powerhouse, relying on programmers and mathematicians to fuel its rise into the global top five earlier this year. Now, after becoming a formidable player in currencies, XTX has its sights set on growing in stocks, commodities and bonds markets. But in a world where the difference between profit and loss can be tiny fractions of a second, XTX says it relies more on smarts than speed. Instead of building microwave networks to ferret out prices a microsecond before anyone else, XTX uses mathematical models that are tuned with massive data sets.

artificial intelligence, machine learning, xtx, (14 more...)

#artificialintelligence

Country: