Approximate Nearest Neighbor Search with Window Filters
February 01, 2024 ยท Entered Twilight ยท ๐ International Conference on Machine Learning
Repo contents: .gitignore, CMakeLists.txt, ParlayANN, README.md, artifacts, bin, experiments, figures, generate_datasets, python_bindings, requirements.txt, setup.py, src, tests, triangle_experiments
Authors
Joshua Engels, Benjamin Landrum, Shangdi Yu, Laxman Dhulipala, Julian Shun
arXiv ID
2402.00943
Category
cs.DS: Data Structures & Algorithms
Cross-listed
cs.IR,
cs.LG
Citations
23
Venue
International Conference on Machine Learning
Repository
https://github.com/JoshEngels/RangeFilteredANN
โญ 45
Last Checked
2 months ago
Abstract
We define and investigate the problem of $\textit{c-approximate window search}$: approximate nearest neighbor search where each point in the dataset has a numeric label, and the goal is to find nearest neighbors to queries within arbitrary label ranges. Many semantic search problems, such as image and document search with timestamp filters, or product search with cost filters, are natural examples of this problem. We propose and theoretically analyze a modular tree-based framework for transforming an index that solves the traditional c-approximate nearest neighbor problem into a data structure that solves window search. On standard nearest neighbor benchmark datasets equipped with random label values, adversarially constructed embeddings, and image search embeddings with real timestamps, we obtain up to a $75\times$ speedup over existing solutions at the same level of recall.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Data Structures & Algorithms
R.I.P.
๐ป
Ghosted
R.I.P.
๐ป
Ghosted
Relief-Based Feature Selection: Introduction and Review
R.I.P.
๐ป
Ghosted
Route Planning in Transportation Networks
R.I.P.
๐ป
Ghosted
Near-linear time approximation algorithms for optimal transport via Sinkhorn iteration
R.I.P.
๐ป
Ghosted
Hierarchical Clustering: Objective Functions and Algorithms
R.I.P.
๐ป
Ghosted