BERT Rankers are Brittle: a Study using Adversarial Document Perturbations

June 23, 2022 · Declared Dead · 🏛 International Conference on the Theory of Information Retrieval

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Yumeng Wang, Lijun Lyu, Avishek Anand arXiv ID 2206.11724 Category cs.IR: Information Retrieval Cross-listed cs.AI Citations 31 Venue International Conference on the Theory of Information Retrieval Last Checked 4 months ago

Abstract

Contextual ranking models based on BERT are now well established for a wide range of passage and document ranking tasks. However, the robustness of BERT-based ranking models under adversarial inputs is under-explored. In this paper, we argue that BERT-rankers are not immune to adversarial attacks targeting retrieved documents given a query. Firstly, we propose algorithms for adversarial perturbation of both highly relevant and non-relevant documents using gradient-based optimization methods. The aim of our algorithms is to add/replace a small number of tokens to a highly relevant or non-relevant document to cause a large rank demotion or promotion. Our experiments show that a small number of tokens can already result in a large change in the rank of a document. Moreover, we find that BERT-rankers heavily rely on the document start/head for relevance prediction, making the initial part of the document more susceptible to adversarial attacks. More interestingly, we find a small set of recurring adversarial words that when added to documents result in successful rank demotion/promotion of any relevant/non-relevant document respectively. Finally, our adversarial tokens also show particular topic preferences within and across datasets, exposing potential biases from BERT pre-training or downstream datasets.