Assessment of creditworthiness models privacy-preserving training with synthetic data

December 31, 2022 Β· Declared Dead Β· πŸ› Hybrid Artificial Intelligence Systems

πŸ‘» CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Ricardo MuΓ±oz-Cancino, CristiΓ‘n Bravo, SebastiΓ‘n A. RΓ­os, Manuel GraΓ±a arXiv ID 2301.01212 Category q-fin.RM Cross-listed cs.LG, cs.SI Citations 3 Venue Hybrid Artificial Intelligence Systems Last Checked 3 months ago
Abstract
Credit scoring models are the primary instrument used by financial institutions to manage credit risk. The scarcity of research on behavioral scoring is due to the difficult data access. Financial institutions have to maintain the privacy and security of borrowers' information refrain them from collaborating in research initiatives. In this work, we present a methodology that allows us to evaluate the performance of models trained with synthetic data when they are applied to real-world data. Our results show that synthetic data quality is increasingly poor when the number of attributes increases. However, creditworthiness assessment models trained with synthetic data show a reduction of 3\% of AUC and 6\% of KS when compared with models trained with real data. These results have a significant impact since they encourage credit risk investigation from synthetic data, making it possible to maintain borrowers' privacy and to address problems that until now have been hampered by the availability of information.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” q-fin.RM

Died the same way β€” πŸ‘» Ghosted