Superior Parallel Big Data Clustering through Competitive Stochastic Sample Size Optimization in Big-means

March 27, 2024 ยท Declared Dead ยท ๐Ÿ› Asian Conference on Intelligent Information and Database Systems

๐Ÿ‘ป CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Rustam Mussabayev, Ravil Mussabayev arXiv ID 2403.18766 Category cs.LG: Machine Learning Cross-listed cs.AI, cs.DC, cs.IR Citations 4 Venue Asian Conference on Intelligent Information and Database Systems Last Checked 4 months ago
Abstract
This paper introduces a novel K-means clustering algorithm, an advancement on the conventional Big-means methodology. The proposed method efficiently integrates parallel processing, stochastic sampling, and competitive optimization to create a scalable variant designed for big data applications. It addresses scalability and computation time challenges typically faced with traditional techniques. The algorithm adjusts sample sizes dynamically for each worker during execution, optimizing performance. Data from these sample sizes are continually analyzed, facilitating the identification of the most efficient configuration. By incorporating a competitive element among workers using different sample sizes, efficiency within the Big-means algorithm is further stimulated. In essence, the algorithm balances computational time and clustering quality by employing a stochastic, competitive sampling strategy in a parallel computing setting.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Machine Learning

Died the same way โ€” ๐Ÿ‘ป Ghosted