Performance-lossless Black-box Model Watermarking
Abstract: With the development of deep learning, high-value and high-cost models have become valuable assets, and related intellectual property protection technologies have become a hot topic. However, existing model watermarking work in black-box scenarios mainly originates from training-based backdoor methods, which probably degrade primary task performance. To address this, we propose a branch backdoor-based model watermarking protocol to protect model intellectual property, where a construction based on a message authentication scheme is adopted as the branch indicator after a comparative analysis with secure cryptographic technologies primitives. We prove the lossless performance of the protocol by reduction. In addition, we analyze the potential threats to the protocol and provide a secure and feasible watermarking instance for LLMs.
- OpenAI, Gpt-4 technical report (2023). arXiv:2303.08774.
- arXiv:1406.2661.
- arXiv:1809.11096.
- arXiv:2106.12423.
- arXiv:2105.05233.
- arXiv:2204.03458.
- arXiv:2112.10752.
- doi:10.1109/FOCS54457.2022.00092.
- Q. H. Dang, Secure hash standard (2015).
- A. Code, Keyed-hash message authentication code (hmac) (2002).
- doi:10.1145/3474085.3475591. URL https://doi.org/10.1145/3474085.3475591
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.