Assess performance of projection-based training under alternative metrics
Determine whether the performance pattern observed in the synthetic learning study—namely, that training with targets defined by the Kullback–Leibler projection of the model’s prediction onto the admissible set F^{box}(π) induced by the possibilistic annotation π yields higher test performance than using the fixed antipignistic probability target—also holds when evaluated with metrics other than top‑1 accuracy, specifically negative log-likelihood, Brier score, and measures of constraint satisfaction with respect to the linear constraint sets C_i that define F^{box}(π).
References
We focus on top-$1$ accuracy as the primary evaluation metric in this study. Assessing whether the same pattern holds for other criteria (e.g., negative log-likelihood , Brier score , or measures of constraint satisfaction with respect to the constraint sets $C_i$ defining $\mathcal F{\mathrm{box}$) requires additional experiments and is left for future work.