Topology-Preserving Scaling in Data Augmentation
November 29, 2024 Β· Declared Dead Β· π Maltepe journal of mathematics
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Vu-Anh Le, Mehmet Dik
arXiv ID
2411.19512
Category
math.AT
Cross-listed
cs.IT,
cs.LG
Citations
0
Venue
Maltepe journal of mathematics
Last Checked
3 months ago
Abstract
We propose an algorithmic framework for dataset normalization in data augmentation pipelines that preserves topological stability under non-uniform scaling transformations. Given a finite metric space \( X \subset \mathbb{R}^n \) with Euclidean distance \( d_X \), we consider scaling transformations defined by scaling factors \( s_1, s_2, \ldots, s_n > 0 \). Specifically, we define a scaling function \( S \) that maps each point \( x = (x_1, x_2, \ldots, x_n) \in X \) to \[ S(x) = (s_1 x_1, s_2 x_2, \ldots, s_n x_n). \] Our main result establishes that the bottleneck distance \( d_B(D, D_S) \) between the persistence diagrams \( D \) of \( X \) and \( D_S \) of \( S(X) \) satisfies: \[ d_B(D, D_S) \leq (s_{\max} - s_{\min}) \cdot \operatorname{diam}(X), \] where \( s_{\min} = \min_{1 \leq i \leq n} s_i \), \( s_{\max} = \max_{1 \leq i \leq n} s_i \), and \( \operatorname{diam}(X) \) is the diameter of \( X \). Based on this theoretical guarantee, we formulate an optimization problem to minimize the scaling variability \( Ξ_s = s_{\max} - s_{\min} \) under the constraint \( d_B(D, D_S) \leq Ξ΅\), where \( Ξ΅> 0 \) is a user-defined tolerance. We develop an algorithmic solution to this problem, ensuring that data augmentation via scaling transformations preserves essential topological features. We further extend our analysis to higher-dimensional homological features, alternative metrics such as the Wasserstein distance, and iterative or probabilistic scaling scenarios. Our contributions provide a rigorous mathematical framework for dataset normalization in data augmentation pipelines, ensuring that essential topological characteristics are maintained despite scaling transformations.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β math.AT
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
Persistence Diagrams with Linear Machine Learning Models
R.I.P.
π»
Ghosted
Comparing persistence diagrams through complex vectors
R.I.P.
π»
Ghosted
A Riemannian Framework for Statistical Analysis of Topological Persistence Diagrams
R.I.P.
π»
Ghosted
Path homologies of deep feedforward networks
R.I.P.
π»
Ghosted
From trees to barcodes and back again: theoretical and statistical perspectives
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted