Extend mixture reuse theory beyond parametric regression models
Extend the theoretical analysis of the FullMixtureReuse mechanism for data mixture recomputation—currently developed under log-linear parametric regression models—to non-parametric regression models (e.g., tree-based or Gaussian process regressors) that induce non-convex and non-differentiable mixture optimization objectives. Explicitly formulate conditions and guarantees analogous to those proved under the log-linear assumption, or characterize fundamental obstacles if such guarantees cannot hold.
References
Second, our theoretical analysis of mixture reuse assumes log-linear regression models. The analysis should extend naturally to other parametric models like AutoScale and BiMix, but it is less clear how to extend it to non-parametric models that yield non-convex and non-differentiable mixing objectives.