Approximate Nearest Neighbor Search with Window Filters

February 01, 2024 ยท Entered Twilight ยท ๐Ÿ› International Conference on Machine Learning

๐Ÿ’ค TWILIGHT: Eternal Rest
Repo abandoned since publication

Repo contents: .gitignore, CMakeLists.txt, ParlayANN, README.md, artifacts, bin, experiments, figures, generate_datasets, python_bindings, requirements.txt, setup.py, src, tests, triangle_experiments

Authors Joshua Engels, Benjamin Landrum, Shangdi Yu, Laxman Dhulipala, Julian Shun arXiv ID 2402.00943 Category cs.DS: Data Structures & Algorithms Cross-listed cs.IR, cs.LG Citations 23 Venue International Conference on Machine Learning Repository https://github.com/JoshEngels/RangeFilteredANN โญ 45 Last Checked 2 months ago
Abstract
We define and investigate the problem of $\textit{c-approximate window search}$: approximate nearest neighbor search where each point in the dataset has a numeric label, and the goal is to find nearest neighbors to queries within arbitrary label ranges. Many semantic search problems, such as image and document search with timestamp filters, or product search with cost filters, are natural examples of this problem. We propose and theoretically analyze a modular tree-based framework for transforming an index that solves the traditional c-approximate nearest neighbor problem into a data structure that solves window search. On standard nearest neighbor benchmark datasets equipped with random label values, adversarially constructed embeddings, and image search embeddings with real timestamps, we obtain up to a $75\times$ speedup over existing solutions at the same level of recall.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Data Structures & Algorithms