Misspecified Engle-Granger Test
- Misspecified Engle-Granger test is a methodological error where first differences of I(1) series are used instead of levels, falsely indicating cointegration.
- This misapplication produces false-positive rates approaching 100% in simulation studies, undermining the validity of long-run equilibrium analysis.
- Empirical studies in migration and labor market contexts reveal that using differenced data for cointegration tests leads to misleading policy implications.
A misspecified Engle–Granger test refers to the incorrect application of the Engle–Granger (EG) cointegration procedure, wherein the test is applied to first differences of I(1) series rather than their levels. This practice artificially guarantees a rejection of the null hypothesis of no cointegration, yielding spurious statistical evidence for long-run equilibrium relationships that do not exist. Recent critiques demonstrate that such misspecification results in false positive rates approaching 100 percent in simulation studies, undermining any subsequent inference concerning cointegration in macroeconomic or migration data (Rodríguez et al., 24 Dec 2025, Rodriguez et al., 24 Dec 2025).
1. Standard Engle–Granger Procedure
The canonical Engle–Granger two-step procedure is designed to assess cointegration between two I(1) time series and :
- Stage 1 (levels regression): Estimate via OLS, obtaining residuals .
- Stage 2 (ADF test): Test the residuals for a unit root:
The null hypothesis (no cointegration), alternative (cointegration).
The test statistic follows a nonstandard distribution under ; critical values are provided by MacKinnon (2010), e.g., , for models without trend or intercept (Rodríguez et al., 24 Dec 2025).
2. Theoretical Basis for Cointegration Testing in Levels
Cointegration tests are meaningful only when applied to the levels of I(1) series. If and are I(1) but a linear combination is I(0), then and are said to be cointegrated. The stationarity of implies a long-run equilibrium relationship.
- I(1): A process is integrated of order one if is I(0) (stationary).
- I(0): Weakly stationary (mean, variance, autocovariances constant).
Testing for cointegration among first differences (, ) is invalid because these series are I(0) by construction. Any regression between I(0) series produces I(0) residuals, so the cointegration test, when misapplied to differences, will always indicate cointegration regardless of true data-generating process (Rodríguez et al., 24 Dec 2025).
3. Consequences of Misspecification
Misspecification arises from first-differencing I(1) series before cointegration testing:
- Misapplied Steps:
- Regress on :
- Test for a unit root via ADF:
- False Positives:
Under general conditions, the residual is stationary (I(0)), so the subsequent ADF test always rejects the null of no cointegration. Monte Carlo simulations using independent random walks confirm a 100% empirical rejection rate for the misspecified test, compared to the nominal 5% rate for the correctly specified test (Rodríguez et al., 24 Dec 2025). Table 1 summarizes these rates:
| Test Specification | Mean EG Statistic (τ) | Rejection Rate (5%) |
|---|---|---|
| EG on levels (correct) | –2.078 (0.843) | 5.3% |
| EG on first differences | –6.956 (1.055) | 100% |
Figure 1 in Rodríguez–Bravo illustrates dramatically shifted distributions for in the misspecified test.
4. Empirical Examples: Migration and Labor Market Applications
Case studies in recent literature have exposed widespread consequences of misspecified Engle–Granger tests:
- Bahar and Hausmann (2025): The apparent cointegration between Venezuelan oil revenues and migration flows results from testing cointegration using first differences. Application to correctly specified levels of the logged variables fails to reject the null of no cointegration in all variants of the test. Consequently, subsequent long-run and error-correction estimations lack foundation (Rodríguez et al., 24 Dec 2025).
- Bahar (2025): Cointegration between US job vacancies and Southwest border crossings is similarly artifactual, created by applying EG tests to differenced series. Replication of the correct EG procedure on levels yields test statistics between –1.85 and –4.06, with only one in twelve specifications rejecting at the 5% threshold; most fail to reject (Rodriguez et al., 24 Dec 2025). The entire approach to estimating short- and long-run elasticities is uninformative absent genuine cointegration.
5. Methodological Implications
The prevalence of misspecified cointegration testing has several implications for empirical practice:
- Test for integration order: Always pre-test each series for I(0) versus I(1).
- Apply cointegration tests to levels: Cointegration frameworks (EG, Johansen) should only be applied to I(1) levels, not differences.
- Avoid pre-differencing: Differencing before cointegration testing induces spurious findings.
- Post-cointegration estimation: Upon finding genuine cointegration, estimate the long-run vector using bias-corrected estimators (FM-OLS, DOLS).
- Include deterministic terms: Add drift, trend, or seasonal dummies as justified by theory.
- Critical values: Employ appropriate critical values (e.g., MacKinnon tables), matching test specification.
A plausible implication is that results from models premised on spurious cointegration cannot be trusted for policy analysis, as short-run and long-run elasticities derived from misspecified regressions have no equilibrium interpretation.
6. Controversy and Correction in Recent Literature
The misapplication of the Engle–Granger test has led to significant reversals of claimed relationships in the literature on migration and macroeconomic flows. Notably, Bahar and Hausmann’s core findings regarding the linkage of Venezuelan oil revenues with US migration, and Bahar’s estimated labor-market effects on migration, are invalidated on methodological grounds (Rodríguez et al., 24 Dec 2025, Rodriguez et al., 24 Dec 2025). In both cases, the absence of cointegration in the levels of the series nullifies policy claims about equilibrium relationships and adjustment mechanisms. This suggests a need for heightened diagnostic rigor and re-evaluation of empirical strategies in time-series econometrics.
7. Summary Table: Correct vs. Misspecified Engle–Granger Procedure
| Step | Correct EG Test (Levels) | Misspecified EG Test (First Differences) |
|---|---|---|
| Regression | ||
| Residual Unit Root | ||
| Statistical Outcome | Cointegration only if is I(0) | Always finds “cointegration” |
The misspecified Engle–Granger test on first differences manifests perfectly spurious rates of cointegration, nullifying any substantive interpretation of cointegrating behavior or long-run elasticities in empirical applications.