Predicting and Explaining Customer Data Sharing in the Open Banking
de Brito, João B. G., Heldt, Rodrigo, Silveira, Cleo S., Bogaert, Matthias, Bucco, Guilherme B., Luce, Fernando B., Becker, João L., Zabala, Filipe J., Anzanello, Michel J.
–arXiv.org Artificial Intelligence
The emergence of Open Banking represents a significant shift in financial data management, influencing financial institutions' market dynamics and marketing strategies. This increased competition creates opportunities and challenges, as institutions manage data inflow to improve products and services while mitigating data outflow that could aid competitors. This study introduces a framework to predict customers' propensity to share data via Open Banking and interprets this behavior through Explanatory Model Analysis (EMA). Using data from a large Brazilian financial institution with approximately 3.2 million customers, a hybrid data balancing strategy incorporating ADASYN and NEARMISS techniques was employed to address the infrequency of data sharing and enhance the training of XGBoost models. These models accurately predicted customer data sharing, achieving 91.39% accuracy for inflow and 91.53% for outflow. The EMA phase combined the Shapley Additive Explanations (SHAP) method with the Classification and Regression Tree (CART) technique, revealing the most influential features on customer decisions. Key features included the number of transactions and purchases in mobile channels, interactions within these channels, and credit-related features, particularly credit card usage across the national banking system. These results highlight the critical role of mobile engagement and credit in driving customer data-sharing behaviors, providing financial institutions with strategic insights to enhance competitiveness and innovation in the Open Banking environment.
arXiv.org Artificial Intelligence
Jul-4-2025
- Country:
- South America > Brazil
- São Paulo (0.04)
- Rio Grande do Sul > Porto Alegre (0.04)
- Oceania
- New Zealand (0.04)
- Australia (0.04)
- North America
- Canada (0.04)
- United States
- New Jersey (0.04)
- District of Columbia > Washington (0.04)
- New York
- New York County > New York City (0.04)
- Monroe County > Rochester (0.04)
- Massachusetts > Suffolk County
- Boston (0.04)
- California > San Diego County
- San Diego (0.04)
- Europe
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Cambridgeshire > Cambridge (0.04)
- Belgium > Flanders
- East Flanders > Ghent (0.04)
- United Kingdom > England
- Asia
- Singapore (0.04)
- Taiwan (0.04)
- South Korea (0.04)
- Japan (0.04)
- India (0.04)
- South America > Brazil
- Genre:
- Research Report (1.00)
- Industry:
- Banking & Finance > Credit (0.89)
- Information Technology > Security & Privacy (0.68)
- Technology:
- Information Technology
- Data Science (1.00)
- Artificial Intelligence > Machine Learning
- Decision Tree Learning (0.70)
- Statistical Learning (0.69)
- Ensemble Learning (0.51)
- Neural Networks (0.46)
- Information Technology