Does instruction-based unlearning extend beyond large language models to other generative models?
Determine whether instruction-based unlearning—modifying model behavior at inference time via natural-language instructions—extends from large language models to other generative models, including diffusion-based image generation systems.
References
Instruction-based unlearning has proven effective for modifying the behavior of LLMs at inference time, but whether this paradigm extends to other generative models remains unclear.
— Why Instruction-Based Unlearning Fails in Diffusion Models?
(2604.01514 - zhang et al., 2 Apr 2026) in Abstract