A Survey of Methods for Handling Disk Data Imbalance
October 13, 2023 ยท The Cartographer ยท ๐ International Journal on Cybernetics & Informatics
"No code URL or promise found in abstract"
"Title-pattern auto-detect: A Survey of Methods for Handling Disk Data Imbalance"
Evidence collected by the PWNC Scanner
Authors
Shuangshuang Yuan, Peng Wu, Yuehui Chen, Qiang Li
arXiv ID
2310.08867
Category
cs.LG: Machine Learning
Cross-listed
cs.DB,
stat.ME
Citations
0
Venue
International Journal on Cybernetics & Informatics
Last Checked
4 days ago
Abstract
Class imbalance exists in many classification problems, and since the data is designed for accuracy, imbalance in data classes can lead to classification challenges with a few classes having higher misclassification costs. The Backblaze dataset, a widely used dataset related to hard discs, has a small amount of failure data and a large amount of health data, which exhibits a serious class imbalance. This paper provides a comprehensive overview of research in the field of imbalanced data classification. The discussion is organized into three main aspects: data-level methods, algorithmic-level methods, and hybrid methods. For each type of method, we summarize and analyze the existing problems, algorithmic ideas, strengths, and weaknesses. Additionally, the challenges of unbalanced data classification are discussed, along with strategies to address them. It is convenient for researchers to choose the appropriate method according to their needs.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Machine Learning
๐ฎ
๐ฎ
The Ethereal
๐ฎ
๐ฎ
The Ethereal
Continuous control with deep reinforcement learning
๐
๐
Old Age
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
๐
๐
Old Age
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
๐
๐
Old Age
SGDR: Stochastic Gradient Descent with Warm Restarts
๐ฎ
๐ฎ
The Ethereal