Fast Compressed Self-Indexes with Deterministic Linear-Time Construction
July 06, 2017 Β· Declared Dead Β· π Algorithmica
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
J. Ian Munro, Gonzalo Navarro, Yakov Nekrich
arXiv ID
1707.01743
Category
cs.DS: Data Structures & Algorithms
Citations
7
Venue
Algorithmica
Last Checked
4 months ago
Abstract
We introduce a compressed suffix array representation that, on a text $T$ of length $n$ over an alphabet of size $Ο$, can be built in $O(n)$ deterministic time, within $O(n\logΟ)$ bits of working space, and counts the number of occurrences of any pattern $P$ in $T$ in time $O(|P| + \log\log_w Ο)$ on a RAM machine of $w=Ξ©(\log n)$-bit words. This new index outperforms all the other compressed indexes that can be built in linear deterministic time, and some others. The only faster indexes can be built in linear time only in expectation, or require $Ξ(n\log n)$ bits. We also show that, by using $O(n\logΟ)$ bits, we can build in linear time an index that counts in time $O(|P|/\log_Οn + \log n(\log\log n)^2)$, which is RAM-optimal for $w=Ξ(\log n)$ and sufficiently long patterns.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β Data Structures & Algorithms
π
π
The Cartographer
R.I.P.
π»
Ghosted
Route Planning in Transportation Networks
R.I.P.
π»
Ghosted
Near-linear time approximation algorithms for optimal transport via Sinkhorn iteration
R.I.P.
π»
Ghosted
Hierarchical Clustering: Objective Functions and Algorithms
R.I.P.
π»
Ghosted
Graph Isomorphism in Quasipolynomial Time
π
π
The Cartographer
Simulation optimization: A review of algorithms and applications
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted