Unveiling Location-Specific Price Drivers: A Two-Stage Cluster Analysis for Interpretable House Price Predictions
Gümmer, Paul, Rosenberger, Julian, Kraus, Mathias, Zschech, Patrick, Hambauer, Nico
–arXiv.org Artificial Intelligence
House price valuation remains challenging due to localized market variations. Existing approaches often rely on black-box machine learning models, which lack interpretability, or simplistic methods like linear regression (LR), which fail to capture market heterogeneity. To address this, we propose a machine learning approach that applies two-stage clustering, first grouping properties based on minimal location-based features before incorporating additional features. Each cluster is then modeled using either LR or a generalized additive model (GAM), balancing predictive performance with interpretability. Constructing and evaluating our models on 43,309 German house property listings from 2023, we achieve a 36% improvement for the GAM and 58% for LR in mean absolute error compared to models without clustering. Additionally, graphical analyses unveil pattern shifts between clusters. These findings emphasize the importance of cluster-specific insights, enhancing interpretability and offering practical value for buyers, sellers, and real estate analysts seeking more reliable property valuations.
arXiv.org Artificial Intelligence
Aug-6-2025
- Country:
- Asia > South Korea (0.04)
- Europe
- Austria > Vienna (0.14)
- Germany
- Bavaria
- Regensburg (0.04)
- Upper Bavaria > Munich (0.04)
- Hesse > Darmstadt Region
- Frankfurt (0.04)
- North Rhine-Westphalia > Münster Region
- Münster (0.04)
- Saxony
- Bavaria
- Switzerland (0.04)
- North America > United States
- Hawaii (0.04)
- Virginia > Fairfax County (0.04)
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Banking & Finance > Real Estate (1.00)
- Technology: