Assessment of creditworthiness models privacy-preserving training with synthetic data

December 31, 2022 · Declared Dead · 🏛 Hybrid Artificial Intelligence Systems

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Ricardo Muñoz-Cancino, Cristián Bravo, Sebastián A. Ríos, Manuel Graña arXiv ID 2301.01212 Category q-fin.RM Cross-listed cs.LG, cs.SI Citations 3 Venue Hybrid Artificial Intelligence Systems Last Checked 3 months ago

Abstract

Credit scoring models are the primary instrument used by financial institutions to manage credit risk. The scarcity of research on behavioral scoring is due to the difficult data access. Financial institutions have to maintain the privacy and security of borrowers' information refrain them from collaborating in research initiatives. In this work, we present a methodology that allows us to evaluate the performance of models trained with synthetic data when they are applied to real-world data. Our results show that synthetic data quality is increasingly poor when the number of attributes increases. However, creditworthiness assessment models trained with synthetic data show a reduction of 3\% of AUC and 6\% of KS when compared with models trained with real data. These results have a significant impact since they encourage credit risk investigation from synthetic data, making it possible to maintain borrowers' privacy and to address problems that until now have been hampered by the availability of information.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — q-fin.RM

R.I.P. 👻 Ghosted

Can Deep Learning Predict Risky Retail Investors? A Case Study in Financial Risk Behavior Forecasting

Yaodong Yang, Alisa Kolesnikova, ... (+4 more)

q-fin.RM 🏛 European Journal of Operational Research 📚 121 cites 7 years ago

R.I.P. 👻 Ghosted

Sequential Deep Learning for Credit Risk Monitoring with Tabular Financial Data

Jillian M. Clements, Di Xu, ... (+2 more)

q-fin.RM 🏛 arXiv 📚 53 cites 5 years ago

R.I.P. 👻 Ghosted

Explainable AI for Interpretable Credit Scoring

Lara Marie Demajo, Vince Vella, Alexiei Dingli

q-fin.RM 🏛 Computer Science & Information Technology (CS & IT) 📚 48 cites 5 years ago

R.I.P. 👻 Ghosted

Preference Elicitation and Robust Optimization with Multi-Attribute Quasi-Concave Choice Functions

William B. Haskell, Wenjie Huang, Huifu Xu

q-fin.RM 🏛 arXiv 📚 17 cites 8 years ago

R.I.P. 👻 Ghosted

Leveraging Convolutional Neural Network-Transformer Synergy for Predictive Modeling in Risk-Based Applications

Yuhan Wang, Zhen Xu, ... (+3 more)

q-fin.RM 🏛 ICEIECC 📚 13 cites 1 year ago

R.I.P. 👻 Ghosted

Advanced Risk Prediction and Stability Assessment of Banks Using Time Series Transformer Models

Wenying Sun, Zhen Xu, ... (+4 more)

q-fin.RM 🏛 BigData 📚 11 cites 1 year ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Konečný, H. Brendan McMahan, ... (+4 more)

cs.LG 🏛 arXiv 📚 5.2K cites 9 years ago

R.I.P. 👻 Ghosted

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, Cliff Young, ... (+73 more)

cs.AR 🏛 ISCA 📚 5.1K cites 9 years ago

R.I.P. 👻 Ghosted

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, ... (+7 more)

cs.CV 🏛 IEEE TMI 📚 4.9K cites 10 years ago

R.I.P. 👻 Ghosted

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

cs.AI 🏛 AI 📚 4.9K cites 8 years ago