- The paper demonstrates that production functions can be identified nonparametrically without assuming a first-order Markov productivity process, significantly reducing bias in output elasticities.
- It employs a cross-sectional conditional independence restriction among flexible input shocks combined with spectral methods and a three-block GMM estimator.
- Empirical and simulation evidence shows that the proposed estimator yields lower, more accurate markups and mitigates bias compared to conventional Markov-based methods.
Nonparametric Identification and Estimation of Production Functions Invariant to Productivity Dynamics
Introduction and Problem Statement
The paper "Nonparametric Identification and Estimation of Production Functions Invariant to Productivity Dynamics" (2604.04458) addresses fundamental limitations in the canonical identification strategy for firm-level production function parameters. Since Olley and Pakes (1996), nearly all state-of-the-art estimators (including Levinsohn and Petrin, Ackerberg–Caves–Frazer (ACF), Gandhi–Navarro–Rivers (GNR), and their dynamic panel analogues) impose that the unobserved productivity process follows a first-order Markov process. The Markov property is not ancillary—it is the identification device that pins down the output elasticity of flexible inputs via the structure of the productivity transition equation.
This paper demonstrates both theoretically and empirically that the first-order Markov restriction is not necessary for identification. More critically, it documents that Markov misspecification induces persistent, substantial bias in estimated output elasticities and all policy-relevant downstream quantities (markups, measured misallocation, treatment effects in event studies). The author proposes a new identification strategy that replaces the Markov restriction with a cross-sectional conditional independence restriction: mutual independence of input demand shocks across three distinct flexible inputs within a given cross-section. Using the spectral identification strategy of Hu and Schennach (2008), the paper gives a nonparametric identification result that holds without any assumptions on the evolution of productivity, and constructs a GMM estimator consistent under arbitrary productivity dynamics.
Identification Theory and Assumptions
The production technology is modeled by a canonical gross output function
yjt​=ft​(kjt​,ljt​,mjt​,ejt​,wjt​,ωjt​)+εjt​
where kjt​ and ljt​ are predetermined primary inputs (capital, labor), mjt​, ejt​, wjt​ are three observable flexible intermediate inputs (materials, electricity, water), ωjt​ is the latent productivity, and εjt​ the ex-post production shock.
The paper's main identifying condition is not on the law of motion of ωjt​ (as in Olley–Pakes/ACF/GNR), but instead posits that, conditional on the latent ωjt​ and observable state kjt​0, the input-specific demand shocks kjt​1 affecting materials, electricity, and water are mutually independent: kjt​2
No restriction is placed on the stochastic process for kjt​3: the identification argument is entirely cross-sectional and static. The author exploits the mutual independence to use spectral methods (Hu–Schennach decomposition), leveraging three noisy signals (input demands) of a common latent (kjt​4) to recover the conditional density and thus nonparametrically identify both the production function and the productivity distribution.
Regularity conditions (injectivity of the measurement models, distinct eigenvalues, normalization) are formally stated and interpreted. The conditional independence is motivated both economically (segmentation of input markets yields input-specific demand shocks beyond what can be absorbed with observed controls) and through microfoundations (e.g., cost minimization with input-specific markdowns).
The theory includes a complete characterization of observational equivalence: any two structures kjt​5 yielding the same observables can differ only by a location shift in kjt​6 that is an arbitrary continuous function of primary inputs kjt​7. This "location indeterminacy" is the only ambiguity created by the fully static assumption set.
To close this indeterminacy, two alternatives are developed:
- Nonparametric point identification by exclusion restrictions: if at least one input's demand is driven solely by productivity and not by capital/labor, point identification is restored.
- Parametric point identification by curvature: if the expected productivity conditional on kjt​8 exhibits sufficient nonlinearity (e.g., homothetic CES curvature), this regularity pins down the scale, and the entire vector of elasticities is point identified.
Estimation Methodology
Given identification, the author develops a three-block GMM estimator. The first block (A) leverages cross-equation orthogonality (elimination of kjt​9 across residuals); block (B) exploits cross-covariances implied by the independence structure; block (C) imposes moment conditions arising from parameterization of the curvature in expected productivity as a function of ljt​0. In the absence of the latter, the procedure consistently identifies the intermediate input elasticities but not ljt​1. With both, the full vector is identified.
The estimator is computationally tractable and written for panel data, but relies only on cross-sectional moment conditions, so is robust to arbitrary time-series patterns of productivity, including non-Markovian dynamics, regime shifts, aggregate shocks, and potential outcomes frameworks where treatment affects the law of motion for ljt​2.
Monte Carlo Evidence
High-dimensional simulations document that all current Markov-based estimators (ACF, GNR, variants) exhibit persistent bias when their Markov restriction is violated, even as ljt​3. For materials elasticity, the upward bias can reach ljt​4 (63% of true value) under dynamically misspecified environments, leading to grossly inflated markups and misleading treatment effect estimates. In contrast, the proposed estimator yields negligible bias across simulated AR(1), AR(2), and potential outcome dynamics, demonstrating robustness to dynamic misspecification.
Simulations also examine conditional independence violations (e.g., correlated utility demand shocks arising from common energy/water cost shocks): moderate violations induce upward bias in the materials elasticity, but the magnitude is quantitatively much smaller than the Markov misspecification bias. Thus, the empirical finding that the proposed estimator often yields lower markups than standard methods cannot be explained by conditional independence failure but instead points to Markov misspecification in mainstream approaches.
Empirical Evidence and Specification Testing
Empirical implementation uses 502 Japanese manufacturing industries, leveraging census panels containing physical inputs for materials, electricity, and water. The estimator yields systematically lower materials elasticities, and thus lower markups, than ACF across the entire industry distribution (median markup 0.93 vs. 1.03 under ACF). The share of industries with markups above unity drops from 54% to 37%. Empirical productivity effects of large policy shocks—such as the 2011 Tohoku earthquake—are substantially attenuated: the ACF estimator overstates productivity loss by ljt​5 percentage points (or about ljt​6 billion USD/year).
Specification diagnostics are developed:
- Pairwise comparison of exclusion-based and curvature-based routes to identifying capital and labor elasticities shows high concordance for capital but reveals consistent exclusion failure for labor, due to its marginal adjustability within the time structure of manufacturing.
- Parameters recovered via block C (homothetic regularity) are consistent across the sample when CES curvature is significant.
- Ancillary regressions document that productivity residuals under the proposed approach are more strongly associated with log wage rates, suggesting improved signal-to-noise over ACF.
The conditional independence restriction is empirically supported in the input triplet studied (materials/electricity/water), both by institutional context and by formal testable implications.
Implications and Future Directions
Theoretical
Dropping the Markov assumption in proxy variable approaches involves no loss of identification power for the production function or for the productivity distribution conditional on observed signals. Identification is achieved cross-sectionally using economic structure and measurement error arguments, not dynamic law-of-motion restrictions. The paper provides a complete characterization of what can and cannot be identified without Markovity and presents generic sufficient conditions for closing the remaining gaps. The framework robustly nests earlier methods but does not rely on scalar unobservability, thus accommodating persistent, input-specific nonparametric heterogeneity (e.g., markdowns, procurement frictions) commonly present in administrative data.
Empirical
Markov misspecification induces systematic, economically meaningful upward bias in output elasticities for flexible inputs (especially materials), which propagates directly to overestimated firm-level markups, measured misallocation, and treatment effects in policy evaluation. The consequences are first-order: reinterpretation of the magnitude and prevalence of market power, and thus policy evaluation based on previous mainstream estimates, is warranted. Because the estimator is invariant to productivity dynamics, it supports truly dynamic panel studies, structural break/event study designs, and nonstationary environments, with consistent estimation of production function parameters even when productivity is an object of policy intervention.
Future Work
Open directions include: (i) generalizing to settings with only two observable flexible inputs by exploiting rapid labor adjustment (as in certain services or industries with casualized labor); (ii) implementing over-identification-robust specification tests via block C or structural restrictions, to empirically evaluate input demand independence; (iii) extending to non-Hicks-neutral, factor-augmenting, or higher-order flexible functional forms; (iv) scaling the framework to address international patterns, exploring the sensitivity of global markup estimates to dynamic identification assumptions.
Conclusion
The paper establishes that the standard Markov assumption embedded in nearly all contemporary proxy variable production function estimators is not only unnecessary for structural identification, but is a major source of bias—often exaggerating fundamental parameters such as materials elasticity and industry markups. By instead invoking conditional independence among input-specific demand shocks for multiple flexible inputs, nonparametric identification is achieved from static, cross-sectional data alone. The consequences for both empirical measurement and structural policy evaluation are substantial: measured market power, productivity decompositions, and treatment effect estimates in event studies are all highly sensitive to the identification assumption. The framework substantially expands the reliability and scope of semi/nonparametric production function estimation, without the risk of misspecified productivity dynamics inhering in prior protocols.
Citation: "Nonparametric Identification and Estimation of Production Functions Invariant to Productivity Dynamics" (2604.04458)