Discovering Universal Geometry in Embeddings with ICA
Yamagiwa, Hiroaki, Oyama, Momose, Shimodaira, Hidetoshi
–arXiv.org Artificial Intelligence
This study utilizes Independent Component Analysis (ICA) to unveil a consistent semantic structure within embeddings of words or images. Our approach extracts independent semantic components from the embeddings of a pre-trained model by leveraging anisotropic information that remains after the whitening process in Principal Component Analysis (PCA). We demonstrate that each embedding can be expressed as a composition of a few intrinsic interpretable axes and that these semantic axes remain consistent across different languages, algorithms, and modalities. The discovery of a universal semantic structure in the geometric patterns of embeddings enhances our understanding of the representations in embeddings.
arXiv.org Artificial Intelligence
Nov-2-2023
- Country:
- Asia
- China
- India > Maharashtra
- Mumbai (0.04)
- Japan
- Honshū > Kansai
- Kyoto Prefecture > Kyoto (0.04)
- Kyūshū & Okinawa > Okinawa (0.04)
- Honshū > Kansai
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- Singapore (0.04)
- Europe
- Austria (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Germany (0.04)
- Italy (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- North America
- Canada
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.14)
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Dominican Republic (0.04)
- United States
- Arizona (0.04)
- Colorado (0.04)
- Louisiana (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- Texas > Travis County
- Austin (0.04)
- Canada
- Oceania > Australia
- South America > Chile
- Asia
- Genre:
- Research Report > New Finding (1.00)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning
- Neural Networks (0.93)
- Statistical Learning (0.66)
- Natural Language > Text Processing (0.88)
- Representation & Reasoning (1.00)
- Vision (0.93)
- Machine Learning
- Information Technology > Artificial Intelligence