Machine Generated Text: A Comprehensive Survey of Threat Models and Detection Methods
October 13, 2022 ยท The Cartographer ยท ๐ IEEE Access
"No code URL or promise found in abstract"
"Title-pattern auto-detect: Machine Generated Text: A Comprehensive Survey of Threat Models and Detection Methods"
Evidence collected by the PWNC Scanner
Authors
Evan Crothers, Nathalie Japkowicz, Herna Viktor
arXiv ID
2210.07321
Category
cs.CL: Computation & Language
Cross-listed
cs.CR,
cs.CY,
cs.LG
Citations
166
Venue
IEEE Access
Last Checked
1 day ago
Abstract
Machine generated text is increasingly difficult to distinguish from human authored text. Powerful open-source models are freely available, and user-friendly tools that democratize access to generative models are proliferating. ChatGPT, which was released shortly after the first edition of this survey, epitomizes these trends. The great potential of state-of-the-art natural language generation (NLG) systems is tempered by the multitude of avenues for abuse. Detection of machine generated text is a key countermeasure for reducing abuse of NLG models, with significant technical challenges and numerous open problems. We provide a survey that includes both 1) an extensive analysis of threat models posed by contemporary NLG systems, and 2) the most complete review of machine generated text detection methods to date. This survey places machine generated text within its cybersecurity and social context, and provides strong guidance for future work addressing the most critical threat models, and ensuring detection systems themselves demonstrate trustworthiness through fairness, robustness, and accountability.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Computation & Language
๐
๐
Old Age
๐
๐
Old Age
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
๐
๐
Old Age
XLNet: Generalized Autoregressive Pretraining for Language Understanding
๐๏ธ
๐๏ธ
Transcended
Effective Approaches to Attention-based Neural Machine Translation
๐
๐
Old Age
A large annotated corpus for learning natural language inference
๐
๐
Old Age