AITopics | vect

Collaborating Authors

vect

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Self-Attention as a Parametric Endofunctor: A Categorical Framework for Transformer Architectures

O'Neill, Charles

arXiv.org Artificial IntelligenceJan-14-2025

Self-attention mechanisms have revolutionised deep learning architectures, yet their core mathematical structures remain incompletely understood. In this work, we develop a category-theoretic framework focusing on the linear components of self-attention. Specifically, we show that the query, key, and value maps naturally define a parametric 1-morphism in the 2-category $\mathbf{Para(Vect)}$. On the underlying 1-category $\mathbf{Vect}$, these maps induce an endofunctor whose iterated composition precisely models multi-layer attention. We further prove that stacking multiple self-attention layers corresponds to constructing the free monad on this endofunctor. For positional encodings, we demonstrate that strictly additive embeddings correspond to monoid actions in an affine sense, while standard sinusoidal encodings, though not additive, retain a universal property among injective (faithful) position-preserving maps. We also establish that the linear portions of self-attention exhibit natural equivariance to permutations of input tokens, and show how the "circuits" identified in mechanistic interpretability can be interpreted as compositions of parametric 1-morphisms. This categorical perspective unifies geometric, algebraic, and interpretability-based approaches to transformer analysis, making explicit the underlying structures of attention. We restrict to linear maps throughout, deferring the treatment of nonlinearities such as softmax and layer normalisation, which require more advanced categorical constructions. Our results build on and extend recent work on category-theoretic foundations for deep learning, offering deeper insights into the algebraic structure of attention mechanisms.

category, morphism, vect, (17 more...)

arXiv.org Artificial Intelligence

2501.02931

Country:

North America > United States (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Multi-Bennett 8R Mechanism Obtained From Factorization of Bivariate Motion Polynomials

Frischauf, Johanna, Pfurner, Martin, Scharler, Daniel F., Schröcker, Hans-Peter

arXiv.org Artificial IntelligenceOct-25-2022

Overconstrained linkages is a long-lasting but still highly active topic of research in mechanism science. For several decades, researchers focused on overconstrained mechanisms consisting of a single loop of n 6 revolute joints (R), prismatic joints (P), or, sometimes, helical joints (H). New linkages of that type are continuously being discovered, often by craftily combining known linkages [2, 3, 28], sometimes via novel concepts for their construction. One of these concepts is the factorization of motion polynomials [8]. It gave rise to the construction of the only class of overconstrained 6R linkages with still unknown relations between its Denavit-Hartenberg parameters. In [6, 9, 17-20], motion polynomial factorization was exploited for the synthesis of linkages. In spite of some attempts, a complete classification of overconstrained single-loop linkages is currently out of reach. It is thus natural that research efforts shifted towards the investigation of single-loop linkages consisting of n 7 links with, generically, n 6 1 degrees of freedom.

artificial intelligence, factorization, polynomial, (18 more...)

arXiv.org Artificial Intelligence

2208.11407

Country:

Europe > Netherlands > South Holland > Dordrecht (0.04)
Europe > Netherlands > South Holland > Delft (0.04)
Europe > Austria > Tyrol > Innsbruck (0.04)

Genre: Research Report (0.84)

Technology: Information Technology > Artificial Intelligence > Robots (0.46)

Add feedback

AI Neurotechnology for Aging Societies -- Task-load and Dementia EEG Digital Biomarker Development Using Information Geometry Machine Learning Methods

Rutkowski, Tomasz M., Zhao, Qibin, Abe, Masao S., Otake, Mihoko

arXiv.org Artificial IntelligenceNov-30-2018

Dementia and especially Alzheimer's disease (AD) are the most common causes of cognitive decline in elderly people. A spread of the above mentioned mental health problems in aging societies is causing a significant medical and economic burden in many countries around the world. According to a recent World Health Organization (WHO) report, it is approximated that currently, worldwide, about 47 million people live with a dementia spectrum of neurocognitive disorders. This number is expected to triple by 2050, which calls for possible application of AI-based technologies to support an early screening for preventive interventions and a subsequent mental wellbeing monitoring as well as maintenance with so-called digital-pharma or beyond a pill therapeutical approaches. This paper discusses our attempt and preliminary results of brainwave (EEG) techniques to develop digital biomarkers for dementia progress detection and monitoring. We present an information geometry-based classification approach for automatic EEG-derived event related responses (ERPs) discrimination of low versus high task-load auditory or tactile stimuli recognition, of which amplitude and latency variabilities are similar to those in dementia. The discussed approach is a step forward to develop AI, and especially machine learning (ML) approaches, for the subsequent application to mild-cognitive impairment (MCI) and AD diagnostics.

artificial intelligence, erp, machine learning, (12 more...)

arXiv.org Artificial Intelligence

1811.12642

Country: Asia > Japan > Honshū > Kantō (0.14)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine > Therapeutic Area > Neurology > Dementia (1.00)
Health & Medicine > Therapeutic Area > Neurology > Alzheimer's Disease (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

OpenIAS Hybrid Generative-Discriminative Deep Models

@machinelearnbotJun-1-2017, 11:50:08 GMT

Deep discriminative classifiers perform remarkably well on problems with a lot of labeled data. So-called deep generative models tend to excel when labeled training data is scarce. Can we do a hybrid, combining the best of both worlds? In this post I outline a hybrid generative-discriminative deep model loosely based on the importance weighted autoencoder (Burda et al., 2015). Don't miss the pretty pictures.

artificial intelligence, machine learning, vect, (16 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Add feedback