Optimal configuration of PRISM’s spectral shaping (single-sided vs. dual-sided, and dimension choice)
Determine whether dual-sided spectral preconditioning—applying innovation-augmented polar decomposition on both row and column correlations—is superior to single-sided preconditioning in the PRISM optimizer, and, if single-sided preconditioning is used, identify which dimension should be targeted for optimal performance, specifically whether to apply left-sided (row-correlation) or right-sided (column-correlation) preconditioning.
References
While our current method defaults to a single-sided approach for efficiency—contrasting with the dual-sided preconditioners in Kronecker-factored methods—the optimal configuration remains an open question. Specifically, whether a dual-sided approach is superior , and which dimension is optimal to target in the single-sided regime, are yet to be determined.