Unsupervised embedding of trajectories captures the latent structure of scientific migration
Murray, Dakota, Yoon, Jisung, Kojaku, Sadamori, Costas, Rodrigo, Jung, Woo-Sung, Milojević, Staša, Ahn, Yong-Yeol
–arXiv.org Artificial Intelligence
Human migration and mobility drives major societal phenomena including epidemics, economies, innovation, and the diffusion of ideas. Although human mobility and migration have been heavily constrained by geographic distance throughout the history, advances and globalization are making other factors such as language and culture increasingly more important. Advances in neural embedding models, originally designed for natural language, provide an opportunity to tame this complexity and open new avenues for the study of migration. Here, we demonstrate the ability of the model word2vec to encode nuanced relationships between discrete locations from migration trajectories, producing an accurate, dense, continuous, and meaningful vector-space representation. The resulting representation provides a functional distance between locations, as well as a digital double that can be distributed, re-used, and itself interrogated to understand the many dimensions of migration. We show that the unique power of word2vec to encode migration patterns stems from its mathematical equivalence with the gravity model of mobility. Focusing on the case of scientific migration, we apply word2vec to a database of three million migration trajectories of scientists derived from the affiliations listed on their publication records. Using techniques that leverage its semantic structure, we demonstrate that embeddings can learn the rich structure that underpins scientific migration, such as cultural, linguistic, and prestige relationships at multiple levels of granularity. Our results provide a theoretical foundation and methodological framework for using neural embeddings to represent and understand migration both within and beyond science.
arXiv.org Artificial Intelligence
Nov-17-2023
- Country:
- South America
- Oceania
- Australia (0.04)
- New Zealand (0.04)
- North America
- Mexico (0.04)
- Jamaica (0.04)
- Central America (0.04)
- United States
- New York (0.05)
- Arizona (0.05)
- New Jersey (0.04)
- North Carolina (0.04)
- Connecticut (0.04)
- Missouri (0.04)
- Oregon (0.04)
- Ohio (0.04)
- Idaho (0.04)
- Tennessee (0.04)
- Colorado (0.04)
- Montana (0.04)
- Alabama (0.04)
- Wisconsin (0.04)
- New Hampshire (0.04)
- Rhode Island (0.04)
- Oklahoma (0.04)
- Louisiana (0.04)
- Nebraska (0.04)
- District of Columbia > Washington (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Florida
- Pinellas County > St. Petersburg (0.04)
- Hillsborough County > Tampa (0.04)
- Indiana
- Monroe County > Bloomington (0.14)
- Marion County > Indianapolis (0.04)
- Texas
- El Paso County > El Paso (0.04)
- Dallas County > Dallas (0.04)
- Bexar County > San Antonio (0.04)
- Michigan > Genesee County
- Flint (0.04)
- Maryland
- Baltimore County (0.04)
- Baltimore (0.04)
- Alaska > Fairbanks North Star Borough
- Fairbanks (0.04)
- Illinois
- Champaign County > Urbana (0.04)
- Cook County
- Washington > King County
- Seattle (0.04)
- Massachusetts
- Suffolk County > Boston (0.14)
- Hampshire County > Amherst (0.04)
- Bristol County > Dartmouth (0.04)
- California
- San Francisco County > San Francisco (0.28)
- Santa Cruz County > Santa Cruz (0.04)
- Santa Clara County > Stanford (0.04)
- Fresno County > Fresno (0.04)
- San Diego County
- San Diego (0.04)
- San Marcos (0.04)
- Los Angeles County
- Los Angeles (0.14)
- Long Beach (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Canada > Quebec
- Montreal (0.04)
- Europe
- Czechia (0.14)
- Spain (0.04)
- Portugal (0.04)
- Russia (0.04)
- Austria (0.04)
- Poland (0.04)
- Denmark (0.04)
- Finland (0.04)
- Norway (0.04)
- Greece (0.04)
- Belgium (0.04)
- Sweden (0.04)
- United Kingdom (0.04)
- Italy > Sardinia (0.04)
- Bulgaria (0.04)
- Germany > Berlin (0.04)
- Slovakia (0.04)
- Slovenia (0.04)
- Serbia (0.04)
- Latvia (0.04)
- Western Europe (0.04)
- Lithuania (0.04)
- Romania (0.04)
- Eastern Europe (0.04)
- Ukraine (0.04)
- Croatia (0.04)
- Ireland (0.04)
- Northern Europe (0.04)
- North Macedonia (0.04)
- Switzerland > Zürich
- Zürich (0.04)
- Hungary > Budapest
- Budapest (0.04)
- Netherlands > South Holland
- Leiden (0.05)
- France > Île-de-France
- Asia
- Southeast Asia (0.14)
- China (0.04)
- Russia (0.04)
- India (0.04)
- Taiwan (0.04)
- Japan (0.04)
- Thailand (0.04)
- Malaysia (0.04)
- Singapore (0.04)
- Philippines (0.04)
- Indonesia (0.04)
- Vietnam (0.04)
- Sri Lanka (0.04)
- Bangladesh (0.04)
- North Korea (0.04)
- Pakistan (0.04)
- South Korea
- Gyeongsangbuk-do > Pohang (0.04)
- Seoul > Seoul (0.04)
- Middle East
- Africa > Middle East
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Transportation (1.00)
- Energy (0.67)
- Health & Medicine
- Therapeutic Area (1.00)
- Health Care Providers & Services (1.00)
- Pharmaceuticals & Biotechnology (0.67)
- Government
- Education > Educational Setting
- Higher Education (0.67)
- Technology: