Composite Quasi-Likelihood (CQL)
- Composite Quasi-Likelihood (CQL) is a method for estimating group-specific dynamic panel models with spatial correlation while addressing fixed effect misspecification.
- It leverages pairwise Gaussian quasi-likelihoods and the DC algorithm to jointly optimize model parameters and latent group classifications in high-dimensional settings.
- Empirical results show high classification accuracy (>95%) and reduced bias, validating its use in complex spatial dynamic panel models.
Composite quasi-likelihood (CQL) refers to a quasi-likelihood methodology for estimation and classification in large dynamic panel models with group-specific heterogeneity and spatially dependent errors. Developed to address the computational infeasibility of the full likelihood in high-dimensional panels with unobserved group structure and spatial correlation, the CQL approach leverages composite (instrumental or marginal) likelihoods, paired with sophisticated optimization algorithms, to enable consistent and efficient parameter and group membership estimation in models subject to fixed effects misspecification and spatial dependence (Chu, 2017).
1. Model Framework and Specification
The foundational setting is a balanced panel , , . Each observational unit belongs to one of unknown latent groups, . Conditional on group membership, the core dynamic error-correction model is: where:
- is an individual fixed effect (which may be misspecified),
- are group-specific slope and adjustment parameters,
- is an innovation permitting both serial and (contemporaneous) spatial correlation.
Spatial error dependence is modeled as
where , is a spatial-weights matrix (diagonal elements zero), and is spherical, . This yields (Chu, 2017).
2. Composite Quasi-Likelihood Objective
The infeasibility of the -dimensional Gaussian likelihood at large motivates the use of a composite likelihood. Specifically, the CQL objective is formed by summing pairwise (bivariate) Gaussian quasi-log-likelihood contributions: where
- is the collection of all structural parameters,
- are user-specified weights (commonly ),
- is the Gaussian quasi-log-likelihood for , constructed using the 2×2 implied covariance from the spatial process (Chu, 2017).
3. Joint Parameter Estimation and Latent Group Classification
Both group memberships and model parameters are estimated jointly by maximizing the composite quasi-likelihood. This leads to the optimization problem: Introducing binary indicators , the problem becomes a nonconvex mixed-integer program (MIP), as the group assignments interact nonlinearly with both the likelihood and spatial covariance structure (Chu, 2017).
4. Optimization via Difference-of-Convex Decomposition and DCA
Chu demonstrates that , given fixed , can be decomposed as the difference of two proper convex functions in the continuous parameters: where captures convexified quadratic forms (log-determinant and trace terms, regularized as needed) and corrects over-regularization to restore the exact composite objective. This structure enables the application of the DC Algorithm (DCA):
- At each iteration with membership fixed, compute a subgradient of in .
- Minimize the resulting strongly convex surrogate to update .
- Optionally, update group memberships by re-assigning each to the group that minimizes its local objective, or perform a discrete optimization step.
- Alternate until convergence.
This iterative block-coordinate approach efficiently heuristically solves the MIP, providing a scalable solution route for high-dimensional panels (Chu, 2017).
5. Theoretical Properties
Under moderate regularity:
- Asymptotic unbiasedness and consistency of the slope parameters holds jointly as , even when individual fixed effects are arbitrary or misspecified.
- The estimator attains the oracle property: estimation and classification are such that the limiting distribution of the parameters matches that with perfect (oracle) knowledge of group memberships and fixed effects.
- Asymptotic normality holds for each group: where is group size and is the appropriate sandwich covariance (Chu, 2017).
6. Regularity Assumptions
Establishing these properties relies on:
- (A1) Stationarity and -mixing: is strictly stationary in and -mixing across with mixing coefficients decaying sufficiently fast.
- (A2) Group-heterogeneity: Each group contains a non-negligible share of units, and group-specific parameters are separated by at least in norm.
- (A3) Spatial-error process: is invertible, and the spatial process is -mixing in cross-section.
- (A4) Moments: Uniform fourth moment bounds on and ; parameters contained in a compact set (Chu, 2017).
7. Finite-Sample Behavior and Simulation Results
Monte Carlo experiments with , , and indicate:
- Classification accuracy of unit grouping exceeds 95% even at moderate .
- Bias and RMSE of decrease at a rate, consistent with theory.
- The CQL-DCA solution outperforms both naive pooled QMLE (ignoring groups) and pre-clustering (K-means plus two-step QMLE) on both parameter estimation bias and group assignment accuracy.
The empirical findings substantiate the theoretical properties, demonstrating both robustness to fixed effect misspecification and the computational viability of the overall estimation framework (Chu, 2017).
Chu's CQL methodology, by constructing a pairwise quasi-likelihood, embedding a mixed-integer group allocation, and applying DC algorithmic optimization, provides a rigorous, computationally tractable, and asymptotically efficient solution to high-dimensional, group-heterogeneous, spatially dependent dynamic panel models (Chu, 2017).