Optimal Embedding Dimension for Sparse Subspace Embeddings
November 17, 2023 Β· Declared Dead Β· π Symposium on the Theory of Computing
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Shabarish Chenakkod, MichaΕ DereziΕski, Xiaoyu Dong, Mark Rudelson
arXiv ID
2311.10680
Category
cs.DS: Data Structures & Algorithms
Cross-listed
cs.LG,
math.NA,
stat.ML
Citations
22
Venue
Symposium on the Theory of Computing
Last Checked
4 months ago
Abstract
A random $m\times n$ matrix $S$ is an oblivious subspace embedding (OSE) with parameters $Ξ΅>0$, $Ξ΄\in(0,1/3)$ and $d\leq m\leq n$, if for any $d$-dimensional subspace $W\subseteq R^n$, $P\big(\,\forall_{x\in W}\ (1+Ξ΅)^{-1}\|x\|\leq\|Sx\|\leq (1+Ξ΅)\|x\|\,\big)\geq 1-Ξ΄.$ It is known that the embedding dimension of an OSE must satisfy $m\geq d$, and for any $ΞΈ> 0$, a Gaussian embedding matrix with $m\geq (1+ΞΈ) d$ is an OSE with $Ξ΅= O_ΞΈ(1)$. However, such optimal embedding dimension is not known for other embeddings. Of particular interest are sparse OSEs, having $s\ll m$ non-zeros per column, with applications to problems such as least squares regression and low-rank approximation. We show that, given any $ΞΈ> 0$, an $m\times n$ random matrix $S$ with $m\geq (1+ΞΈ)d$ consisting of randomly sparsified $\pm1/\sqrt s$ entries and having $s= O(\log^4(d))$ non-zeros per column, is an oblivious subspace embedding with $Ξ΅= O_ΞΈ(1)$. Our result addresses the main open question posed by Nelson and Nguyen (FOCS 2013), who conjectured that sparse OSEs can achieve $m=O(d)$ embedding dimension, and it improves on $m=O(d\log(d))$ shown by Cohen (SODA 2016). We use this to construct the first oblivious subspace embedding with $O(d)$ embedding dimension that can be applied faster than current matrix multiplication time, and to obtain an optimal single-pass algorithm for least squares regression. We further extend our results to Leverage Score Sparsification (LESS), which is a recently introduced non-oblivious embedding technique. We use LESS to construct the first subspace embedding with low distortion $Ξ΅=o(1)$ and optimal embedding dimension $m=O(d/Ξ΅^2)$ that can be applied in current matrix multiplication time.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β Data Structures & Algorithms
π
π
The Cartographer
R.I.P.
π»
Ghosted
Route Planning in Transportation Networks
R.I.P.
π»
Ghosted
Near-linear time approximation algorithms for optimal transport via Sinkhorn iteration
R.I.P.
π»
Ghosted
Hierarchical Clustering: Objective Functions and Algorithms
R.I.P.
π»
Ghosted
Graph Isomorphism in Quasipolynomial Time
π
π
The Cartographer
Simulation optimization: A review of algorithms and applications
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted