reproducibilityindex.ai

ROBIN: Robust and Invisible Watermarks for Diffusion Models with Adversarial Optimization

Authors: Huayang Huang, Yu Wu, Qian Wang

NeurIPS 2024 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Experiments on various diffusion models demonstrate the watermark remains verifiable even under significant image tampering and shows superior invisibility compared to other state-of-the-art robust watermarking methods.
Researcher Affiliation	Academia	Huayang Huang1, Yu Wu1, , Qian Wang2 1School of Computer Science, Wuhan University 2School of Cyber Science and Engineering, Wuhan University {hyhuang,wuyucs,qianwang}@whu.edu.cn
Pseudocode	Yes	Algorithm 1 Adversarial Optimization Algorithm Input: Dataset X, P; max epoch N; hyper-parameters α, β, λ; watermark mask M Output: Optimized watermark pair wi, wp;
Open Source Code	Yes	The codes will be available at https://github.com/Hannah1102/ROBIN.
Open Datasets	Yes	For the latent diffusion model, we utilize the widely available Stable Diffusion-v2 [31] and the stable-diffusion-prompts dataset from Gustavosta [1]. We also test on a guided diffusion model [2] trained on the Image Net [12]
Dataset Splits	No	The paper states, "We optimize the watermark and the hiding prompt using 50 generated images," but does not provide explicit training, validation, and test dataset splits with percentages or sample counts for the main datasets used.
Hardware Specification	Yes	All experiments are conducted on an NVIDIA Ge Force RTX 3090 GPU.
Software Dependencies	No	The paper does not explicitly list specific software dependencies with version numbers (e.g., Python, PyTorch, or specific library versions).
Experiment Setup	Yes	We utilize 50 steps of deterministic sampling for both models. Stable Diffusion employs the second-order multistep DPM-Solver algorithm [23] with a default guidance scale of 7.5. Image Net diffusion model leverages the DDIM sampling algorithm [35]. We optimize the watermark and the hiding prompt using 50 generated images. The learning rates for the image watermark and prompt guidance are 0.8 and 5e-04, respectively, with a total of 1,000 optimization rounds. The default image watermark covers 70% of the image frequency domain.