Optimizing watermarks for large language models

Published 28 Dec 2023 in cs.CR, cs.AI, and cs.CL | (2312.17295v1)

Abstract: With the rise of LLMs and concerns about potential misuse, watermarks for generative LLMs have recently attracted much attention. An important aspect of such watermarks is the trade-off between their identifiability and their impact on the quality of the generated text. This paper introduces a systematic approach to this trade-off in terms of a multi-objective optimization problem. For a large class of robust, efficient watermarks, the associated Pareto optimal solutions are identified and shown to outperform the currently default watermark.