BERT's Conceptual Cartography: Mapping the Landscapes of Meaning
–arXiv.org Artificial Intelligence
Conceptual Engineers want to make words better. However, they often underestimate how varied our usage of words is. In this paper, we take the first steps in exploring the contextual nuances of words by creating conceptual landscapes -- 2D surfaces representing the pragmatic usage of words -- that conceptual engineers can use to inform their projects. We use the spoken component of the British National Corpus and BERT to create contextualised word embeddings, and use Gaussian Mixture Models, a selection of metrics, and qualitative analysis to visualise and numerically represent lexical landscapes. Such an approach has not yet been used in the conceptual engineering literature and provides a detailed examination of how different words manifest in various contexts that is potentially useful to conceptual engineering projects. Our findings highlight the inherent complexity of conceptual engineering, revealing that each word exhibits a unique and intricate landscape. Conceptual Engineers cannot, therefore, use a one-size-fits-all approach when improving words -- a task that may be practically intractable at scale.
arXiv.org Artificial Intelligence
Aug-13-2024
- Country:
- Africa > Middle East (0.04)
- Oceania
- New Zealand (0.04)
- Australia (0.04)
- North America > United States
- New York (0.14)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Europe
- Middle East (0.04)
- Bulgaria (0.04)
- Germany (0.04)
- France (0.04)
- United Kingdom
- Northern Ireland (0.04)
- England
- Oxfordshire > Oxford (0.04)
- Cambridgeshire > Cambridge (0.04)
- Spain
- Galicia > Madrid (0.04)
- Canary Islands > Tenerife (0.04)
- Norway > Eastern Norway
- Oslo (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Asia > Middle East
- Genre:
- Personal > Interview (1.00)
- Research Report > New Finding (0.65)
- Industry:
- Media (1.00)
- Law (1.00)
- Government > Regional Government (0.92)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.67)
- Consumer Products & Services > Food, Beverage, Tobacco & Cannabis (0.67)
- Leisure & Entertainment
- Sports (1.00)
- Games > Computer Games (0.67)
- Health & Medicine > Therapeutic Area
- Immunology (0.45)
- Technology:
- Information Technology
- Communications > Social Media (0.67)
- Data Science (0.67)
- Artificial Intelligence
- Natural Language (1.00)
- Representation & Reasoning (0.67)
- Machine Learning
- Statistical Learning (0.92)
- Neural Networks (0.67)
- Information Technology