A Brief Tutorial on Sample Size Calculations for Fairness Audits

December 07, 2023 ยท The Cartographer ยท ๐Ÿ› arXiv.org

๐Ÿ“š THE CARTOGRAPHER: The Cartographer
Survey/review paper โ€” maps the landscape rather than implementing a method.

"No code URL or promise found in abstract"
"Title-pattern auto-detect: A Brief Tutorial on Sample Size Calculations for Fairness Audits"

Evidence collected by the PWNC Scanner

Authors Harvineet Singh, Fan Xia, Mi-Ok Kim, Romain Pirracchio, Rumi Chunara, Jean Feng arXiv ID 2312.04745 Category stat.AP Cross-listed cs.LG Citations 2 Venue arXiv.org Last Checked 4 days ago
Abstract
In fairness audits, a standard objective is to detect whether a given algorithm performs substantially differently between subgroups. Properly powering the statistical analysis of such audits is crucial for obtaining informative fairness assessments, as it ensures a high probability of detecting unfairness when it exists. However, limited guidance is available on the amount of data necessary for a fairness audit, lacking directly applicable results concerning commonly used fairness metrics. Additionally, the consideration of unequal subgroup sample sizes is also missing. In this tutorial, we address these issues by providing guidance on how to determine the required subgroup sample sizes to maximize the statistical power of hypothesis tests for detecting unfairness. Our findings are applicable to audits of binary classification models and multiple fairness metrics derived as summaries of the confusion matrix. Furthermore, we discuss other aspects of audit study designs that can increase the reliability of audit results.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” stat.AP

R.I.P. ๐Ÿ‘ป Ghosted

Forecasting: theory and practice

Fotios Petropoulos, Daniele Apiletti, ... (+78 more)

stat.AP ๐Ÿ› International Journal of Forecasting ๐Ÿ“š 481 cites 5 years ago