Demo2Vec: Learning Region Embedding with Demographic Information
–arXiv.org Artificial Intelligence
Demographic data, such as income, education level, and employment rate, contain valuable information of urban regions, yet few studies have integrated demographic information to generate region embedding. In this study, we show how the simple and easy-to-access demographic data can improve the quality of state-of-the-art region embedding and provide better predictive performances in urban areas across three common urban tasks, namely check-in prediction, crime rate prediction, and house price prediction. We find that existing pre-train methods based on KL divergence are potentially biased towards mobility information and propose to use Jenson-Shannon divergence as a more appropriate loss function for multi-view representation learning. Experimental results from both New York and Chicago show that mobility + income is the best pre-train data combination, providing up to 10.22\% better predictive performances than existing models. Considering that mobility big data can be hardly accessible in many developing cities, we suggest geographic proximity + income to be a simple but effective data combination for region embedding pre-training.
arXiv.org Artificial Intelligence
Sep-25-2024
- Country:
- Asia
- China
- Heilongjiang Province > Daqing (0.04)
- Hong Kong (0.05)
- Japan > Honshū
- Kantō > Kanagawa Prefecture > Yokohama (0.05)
- Singapore > Central Region
- Singapore (0.04)
- China
- North America > United States
- District of Columbia > Washington (0.05)
- Illinois > Cook County
- Chicago (0.27)
- New York
- Bronx County > New York City (0.04)
- Kings County > New York City (0.04)
- New York County > New York City (0.14)
- Queens County > New York City (0.04)
- Richmond County > New York City (0.04)
- Asia
- Genre:
- Research Report > New Finding (0.34)
- Industry:
- Transportation (0.47)
- Technology: