AITopics | geometric property

Collaborating Authors

geometric property

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Less is More: Local Intrinsic Dimensions of Contextual Language Models

Neural Information Processing SystemsJun-12-2026, 11:21:38 GMT

Understanding the internal mechanisms of large language models (LLMs) remains a challenging and complex endeavor. Even fundamental questions, such as how fine-tuning affects model behavior, often require extensive empirical evaluation. In this paper, we introduce a novel perspective based on the geometric properties of contextual latent embeddings to study the effects of training and fine-tuning. To that end, we measure the local dimensions of a contextual language model's latent space and analyze their shifts during training and fine-tuning. We show that the local dimensions provide insights into the model's training dynamics and generalization ability. Specifically, the mean of the local dimensions predicts when the model's training capabilities are exhausted, as exemplified in a dialogue state tracking task, overfitting, as demonstrated in an emotion recognition task, and grokking, as illustrated with an arithmetic task. Furthermore, our experiments suggest a practical heuristic: reductions in the mean local dimension tend to accompany and predict subsequent performance gains. Through this exploration, we aim to provide practitioners with a deeper understanding of the implications of fine-tuning on embedding spaces, facilitating informed decisions when configuring models for specific applications. The results of this work contribute to the ongoing discourse on the interpretability, adaptability, and generalizability of LLMs by bridging the gap between intrinsic model mechanisms and geometric properties in the respective embeddings.

artificial intelligence, large language model, natural language, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.82)

Add feedback

RMLR: Extending Multinomial Logistic Regression into General Geometries

Neural Information Processing SystemsMar-21-2026, 01:33:03 GMT

Riemannian neural networks, which extend deep learning techniques to Riemannian spaces, have gained significant attention in machine learning. To better classify the manifold-valued features, researchers have started extending Euclidean multinomial logistic regression (MLR) into Riemannian manifolds. However, existing approaches suffer from limited applicability due to their strong reliance on specific geometric properties. This paper proposes a framework for designing Riemannian MLR over general geometries, referred to as RMLR. Our framework only requires minimal geometric properties, thus exhibiting broad applicability and enabling its use with a wide range of geometries.

artificial intelligence, machine learning, proceedings, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.61)

Add feedback

Fine-grained Optimization of Deep Neural Networks

Mete Ozay

Neural Information Processing SystemsFeb-13-2026, 23:58:51 GMT

Neural Information Processing Systems http://nips.cc/

dnn, manifold, weight manifold, (14 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
North America > Canada (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

ProbabilisticMarginsforInstanceReweighting inAdversarialTraining

Neural Information Processing SystemsFeb-11-2026, 01:32:17 GMT

In this case, one can hardly distinguish non-robust data with LPS being 10 and safe data that are insensitive to be attacked.

artificial intelligence, machine learning, robustness, (18 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)
North America > Canada > Ontario > Toronto (0.04)
Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

f05da679342107f92111ad9d65959cd3-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-11-2026, 01:21:15 GMT

latent space, manifold, optimization, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.31)

Add feedback

1def1713ebf17722cbe300cfc1c88558-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-7-2026, 17:24:36 GMT

artificial intelligence, reviewer, tensor, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.50)

Add feedback

Sample and Computationally Efficient Learning Algorithms under S-Concave Distributions

Maria-Florina F. Balcan, Hongyang Zhang

Neural Information Processing SystemsNov-21-2025, 15:02:07 GMT

Developing provable learning algorithms is one of the central challenges in learning theory. The study of such algorithms has led to significant advances in both the theory and practice of passive and active learning.

active learning, artificial intelligence, machine learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.34)

Add feedback

1def1713ebf17722cbe300cfc1c88558-AuthorFeedback.pdf

Neural Information Processing SystemsOct-2-2025, 09:13:34 GMT

artificial intelligence, reviewer, tensor, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.50)

Add feedback

From Internal Representations to Text Quality: A Geometric Approach to LLM Evaluation

Yusupov, Viacheslav, Maksimov, Danil, Alaeva, Ameliia, Vasileva, Anna, Antipina, Anna, Zaitseva, Tatyana, Ermilova, Alina, Burnaev, Evgeny, Shvetsov, Egor

arXiv.org Artificial IntelligenceOct-1-2025

This paper bridges internal and external analysis approaches to large language models (LLMs) by demonstrating that geometric properties of internal model representations serve as reliable proxies for evaluating generated text quality. We validate a set of metrics--including Maximum Explainable V ariance, Effective Rank, Intrinsic Dimensionality, MAUVE score, and Schatten Norms measured across different layers of LLMs, demonstrating that Intrinsic Dimensionality and Effective Rank can serve as universal assessments of text naturalness and quality. Our key finding reveals that different models consistently rank text from various sources in the same order based on these geometric properties, indicating that these metrics reflect inherent text characteristics rather than model-specific artifacts. This allows a reference-free text quality evaluation that does not require human-annotated datasets, offering practical advantages for automated evaluation pipelines. The rapid advancement of large language models (LLMs) has necessitated the development of methods for analyzing their internal mechanisms and the properties of generated text. Approaches to studying the geometric properties of representations in language models can be broadly categorized into two categories: internal or mechanistic methods, which investigate the model's intermediate representations, and external methods, which analyze the properties of text embeddings captured via some embedding model. Internal evaluation mainly considers model properties within which, these measures were made (Yin et al., 2024; Viswanathan et al., 2025; Roy & V etterli, 2007), while external measures mainly focus on evaluation of text properties given text embeddings (Zhao et al., 2019; Tulchinskii et al., 2023; Kuznetsov et al., 2024).

arxiv preprint arxiv, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2509.25359

Country: North America > United States (0.46)

Genre: Research Report > New Finding (0.46)

Technology: