Differentially Private Non Parametric Copulas: Generating synthetic data with non parametric copulas under privacy guarantees

September 27, 2024 ยท Declared Dead ยท ๐Ÿ› 2025 IEEE 38th International Symposium on Computer-Based Medical Systems (CBMS)

๐Ÿ‘ป CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Pablo A. Osorio-Marulanda, John Esteban Castro Ramirez, Mikel Hernรกndez Jimรฉnez, Nicolas Moreno Reyes, Gorka Epelde Unanue arXiv ID 2409.18611 Category cs.LG: Machine Learning Cross-listed cs.DB Citations 1 Venue 2025 IEEE 38th International Symposium on Computer-Based Medical Systems (CBMS) Last Checked 4 months ago
Abstract
Creation of synthetic data models has represented a significant advancement across diverse scientific fields, but this technology also brings important privacy considerations for users. This work focuses on enhancing a non-parametric copula-based synthetic data generation model, DPNPC, by incorporating Differential Privacy through an Enhanced Fourier Perturbation method. The model generates synthetic data for mixed tabular databases while preserving privacy. We compare DPNPC with three other models (PrivBayes, DP-Copula, and DP-Histogram) across three public datasets, evaluating privacy, utility, and execution time. DPNPC outperforms others in modeling multivariate dependencies, maintaining privacy for small $ฮต$ values, and reducing training times. However, limitations include the need to assess the model's performance with different encoding methods and consider additional privacy attacks. Future research should address these areas to enhance privacy-preserving synthetic data generation.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Machine Learning

Died the same way โ€” ๐Ÿ‘ป Ghosted