Most Ligand-Based Classification Benchmarks Reward Memorization Rather than Generalization

June 20, 2017 ยท Declared Dead ยท ๐Ÿ› Journal of Chemical Information and Modeling

๐Ÿ‘ป CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Izhar Wallach, Abraham Heifets arXiv ID 1706.06619 Category q-bio.QM Cross-listed cs.LG, stat.ML Citations 143 Venue Journal of Chemical Information and Modeling Last Checked 2 months ago
Abstract
Undetected overfitting can occur when there are significant redundancies between training and validation data. We describe AVE, a new measure of training-validation redundancy for ligand-based classification problems that accounts for the similarity amongst inactive molecules as well as active. We investigated seven widely-used benchmarks for virtual screening and classification, and show that the amount of AVE bias strongly correlates with the performance of ligand-based predictive methods irrespective of the predicted property, chemical fingerprint, similarity measure, or previously-applied unbiasing techniques. Therefore, it may be that the previously-reported performance of most ligand-based methods can be explained by overfitting to benchmarks rather than good prospective accuracy.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” q-bio.QM

Died the same way โ€” ๐Ÿ‘ป Ghosted