Optimal data splitting for shrinkage mean estimation
Determine the optimal sample-splitting strategy for estimating the base location estimator \widehat{\kappa} and the shrinkage mean estimator \widehat{\mu}(\widehat{\kappa}; X_{1:n}), including the proportion of data that should be allocated to computing \widehat{\kappa} to enforce independence (Assumption 4), and ascertain whether computing both \widehat{\kappa} and \widehat{\mu}(\widehat{\kappa}; X_{1:n}) on the full sample—thereby violating Assumption 4—yields superior performance.
References
Splitting the sample is a natural way to fulfill Assumption 4. Nevertheless, it is not clear how much of the available data should be dedicated to the base estimator, or even if computing the base estimator and the shrinkage estimator on the full sample (ignoring Assumption 4) is the better approach.