Comparing asymptotic variances of adversarial versus non-adversarial R-NCE
Determine whether a general relationship (e.g., inequality, equivalence, or dominance) exists between the asymptotic variance of the R-NCE estimator under non-adversarial training, Vθ = −∇θ^2 L(θ⋆, ξ⋆)^{-1}, and the asymptotic variance under adversarial (Stackelberg) training, Vθ^a = −∇θ^2 L(θ⋆, ξ̄)^{-1}, when the proposal family F_n is not realizable; specifically, ascertain conditions under which adversarial training improves, worsens, or leaves unchanged the estimator’s asymptotic efficiency relative to non-adversarial training.
References
Given the quite non-trivial relationship between the proposal parameters that a non-adversarial negative sampler converges to versus the Stackelberg adversary, it is not clear that a general statement can be made comparing these two variances. Therefore, the theoretical benefits of adversarial training are quite unclear.