Assessment of creditworthiness models privacy-preserving training with synthetic data
December 31, 2022 Β· Declared Dead Β· π Hybrid Artificial Intelligence Systems
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Ricardo MuΓ±oz-Cancino, CristiΓ‘n Bravo, SebastiΓ‘n A. RΓos, Manuel GraΓ±a
arXiv ID
2301.01212
Category
q-fin.RM
Cross-listed
cs.LG,
cs.SI
Citations
3
Venue
Hybrid Artificial Intelligence Systems
Last Checked
3 months ago
Abstract
Credit scoring models are the primary instrument used by financial institutions to manage credit risk. The scarcity of research on behavioral scoring is due to the difficult data access. Financial institutions have to maintain the privacy and security of borrowers' information refrain them from collaborating in research initiatives. In this work, we present a methodology that allows us to evaluate the performance of models trained with synthetic data when they are applied to real-world data. Our results show that synthetic data quality is increasingly poor when the number of attributes increases. However, creditworthiness assessment models trained with synthetic data show a reduction of 3\% of AUC and 6\% of KS when compared with models trained with real data. These results have a significant impact since they encourage credit risk investigation from synthetic data, making it possible to maintain borrowers' privacy and to address problems that until now have been hampered by the availability of information.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β q-fin.RM
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
Sequential Deep Learning for Credit Risk Monitoring with Tabular Financial Data
R.I.P.
π»
Ghosted
Explainable AI for Interpretable Credit Scoring
R.I.P.
π»
Ghosted
Preference Elicitation and Robust Optimization with Multi-Attribute Quasi-Concave Choice Functions
R.I.P.
π»
Ghosted
Leveraging Convolutional Neural Network-Transformer Synergy for Predictive Modeling in Risk-Based Applications
R.I.P.
π»
Ghosted
Advanced Risk Prediction and Stability Assessment of Banks Using Time Series Transformer Models
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted