A Survey of Methods for Handling Disk Data Imbalance

October 13, 2023 ยท The Cartographer ยท ๐Ÿ› International Journal on Cybernetics & Informatics

๐Ÿ“š THE CARTOGRAPHER: The Cartographer
Survey/review paper โ€” maps the landscape rather than implementing a method.

"No code URL or promise found in abstract"
"Title-pattern auto-detect: A Survey of Methods for Handling Disk Data Imbalance"

Evidence collected by the PWNC Scanner

Authors Shuangshuang Yuan, Peng Wu, Yuehui Chen, Qiang Li arXiv ID 2310.08867 Category cs.LG: Machine Learning Cross-listed cs.DB, stat.ME Citations 0 Venue International Journal on Cybernetics & Informatics Last Checked 4 days ago
Abstract
Class imbalance exists in many classification problems, and since the data is designed for accuracy, imbalance in data classes can lead to classification challenges with a few classes having higher misclassification costs. The Backblaze dataset, a widely used dataset related to hard discs, has a small amount of failure data and a large amount of health data, which exhibits a serious class imbalance. This paper provides a comprehensive overview of research in the field of imbalanced data classification. The discussion is organized into three main aspects: data-level methods, algorithmic-level methods, and hybrid methods. For each type of method, we summarize and analyze the existing problems, algorithmic ideas, strengths, and weaknesses. Additionally, the challenges of unbalanced data classification are discussed, along with strategies to address them. It is convenient for researchers to choose the appropriate method according to their needs.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Machine Learning