Choice Models and Permutation Invariance: Demand Estimation in Differentiated Products Markets
Abstract: Choice modeling is at the core of understanding how changes to the competitive landscape affect consumer choices and reshape market equilibria. In this paper, we propose a fundamental characterization of choice functions that encompasses a wide variety of extant choice models. We demonstrate how non-parametric estimators like neural nets can easily approximate such functionals and overcome the curse of dimensionality that is inherent in the non-parametric estimation of choice functions. We demonstrate through extensive simulations that our proposed functionals can flexibly capture underlying consumer behavior in a completely data-driven fashion and outperform traditional parametric models. As demand settings often exhibit endogenous features, we extend our framework to incorporate estimation under endogenous features. Further, we also describe a formal inference procedure to construct valid confidence intervals on objects of interest like price elasticity. Finally, to assess the practical applicability of our estimator, we utilize a real-world dataset from S. Berry, Levinsohn, and Pakes (1995). Our empirical analysis confirms that the estimator generates realistic and comparable own- and cross-price elasticities that are consistent with the observations reported in the existing literature.
- A method to estimate discrete choice models that is robust to consumer search. Technical report, National Bureau of Economic Research, 2020.
- J. Abrevaya. Rank estimation of a generalized fixed-effects regression model. Journal of Econometrics, 95(1):1–23, 2000.
- Asymptotic efficiency of semiparametric two-step gmm. Review of Economic Studies, 81(3):919–943, 2014.
- C. Ai and X. Chen. Estimation of possibly misspecified semiparametric conditional moment restriction models with different conditioning variables. Journal of Econometrics, 141(1):5–43, 2007.
- P. Albuquerque and B. J. Bronnenberg. Estimating demand heterogeneity using aggregated data: an application to the frozen pizza category. Marketing Science, 28(2):356–372, 2009.
- R. Allen and J. Rehbeck. Identification with additively separable heterogeneity. Econometrica, 87(3):1021–1054, 2019.
- A choice model for packaged goods: Dealing with discrete quantities and quantity discounts. Marketing Science, 23(1):95–108, 2004.
- A. Aouad and A. Désir. Representing random utility choice models with neural networks. arXiv preprint arXiv:2207.12877, 2022.
- E. Bakhitov and A. Singh. Causal gradient boosting: Boosted instrumental variable regression. In Proceedings of the 23rd ACM Conference on Economics and Computation, pages 604–605, 2022.
- Estimation of travel choice models with randomly distributed values of time. Transportation Research Record, 1413:88–97, 1993.
- Y. Bentz and D. Merunka. Neural networks and the multinomial logit for brand choice modelling: a hybrid approach. Journal of Forecasting, 19(3):177–200, 2000.
- Automobile prices in market equilibrium. Econometrica: Journal of the Econometric Society, pages 841–890, 1995.
- Identification in differentiated products markets using market level data. Econometrica, 82(5):1749–1797, 2014.
- Nonparametric identification of differentiated products demand using micro data. Technical report, National Bureau of Economic Research, 2020.
- Logit demand estimation under competitive pricing behavior: An equilibrium framework. Management Science, 44(11-part-1):1533–1547, 1998.
- A markov chain approximation to choice modeling. Operations Research, 64(4):886–905, 2016.
- Nonparametric estimation of a nonseparable demand function under the slutsky inequality restriction. Review of Economics and Statistics, 99(2):291–304, 2017.
- The effect of fuel economy standards on the us automotive market: an hedonic demand analysis. Transportation Research Part A: General, 14(5-6):367–378, 1980.
- Nonparametric discrete choice models with unobserved heterogeneity. Journal of Business & Economic Statistics, 28(2):291–307, 2010.
- Deep learning for choice modeling. arXiv preprint arXiv:2208.09325, 2022.
- Measuring the societal impacts of automobile downsizing. Transportation Research Part A: General, 14(5-6):423–434, 1980.
- S. Chatterjee and J. Jafarov. Prediction error of cross-validated lasso. arXiv preprint arXiv:1502.06291, 2015.
- X. Chen and T. M. Christensen. Optimal sup-norm rates and uniform inference on nonlinear functionals of nonparametric iv regression. Quantitative Economics, 9(1):39–84, 2018.
- Automatic debiased machine learning via neural nets for generalized linear regression. arXiv preprint arXiv:2104.14737, 2021.
- Locally robust semiparametric estimation. Econometrica, 90(4):1501–1535, 2022a.
- Automatic debiased machine learning of causal and structural effects. Econometrica, 90(3):967–1027, 2022b.
- P. K. Chintagunta. Endogeneity and heterogeneity in a probit demand model: Estimation using aggregate data. Marketing Science, 20(4):442–456, 2001.
- Nonparametric demand estimation in the presence of unobserved factors. Available at SSRN, 2022.
- G. Compiani. Market counterfactuals and the specification of multiproduct demand: A nonparametric approach. Quantitative Economics, 13(2):545–591, 2022.
- C. Conlon and J. Gortmaker. Best practices for differentiated products demand estimation with PyBLP. The RAND Journal of Economics, 51(4):1108–1161, 2020.
- Application and interpretation of nested logit models of intercity mode choice. Transportation research record, (1413), 1993.
- M. Fosgerau and D. Kristensen. Identification of a class of index models: A topological approach. The Econometrics Journal, 24(1):121–133, 2021.
- J. T. Fox and A. Gandhi. Nonparametric identification and estimation of random coefficients in multinomial choice models. The RAND Journal of Economics, 47(1):118–139, 2016.
- Doubly robust estimation of causal effects. American journal of epidemiology, 173(7):761–767, 2011.
- X. Gabaix. Behavioral inattention. In Handbook of behavioral economics: Applications and foundations 1, volume 2, pages 261–343. Elsevier, 2019.
- S. Gabel and A. Timoshenko. Product choice with large assortments: A scalable deep-learning model. Management Science, 68(3):1808–1827, 2022.
- A. Gandhi and J.-F. Houde. Measuring substitution patterns in differentiated-products industries. NBER Working paper, (w26375), 2019.
- M. S. Goeree. Limited information and advertising in the us personal computer industry. Econometrica, 76(5):1017–1074, 2008.
- A latent class model for discrete choice analysis: contrasts with mixed logit. Transportation Research Part B: Methodological, 37(8):681–698, 2003.
- L. Grigolon and F. Verboven. Nested logit or random coefficients logit? a comparison of alternative discrete choice models of product differentiation. Review of Economics and Statistics, 96(5):916–935, 2014.
- Universal approximation of symmetric and anti-symmetric functions. arXiv preprint arXiv:1912.01765, 2019.
- A neural-embedded discrete choice model: Learning taste representation with strengthened interpretability. Transportation Research Part B: Methodological, 163:166–186, 2022.
- Individual heterogeneity and average welfare. Econometrica, 84(3):1225–1248, 2016.
- A conditional probit model for qualitative choice: Discrete decisions recognizing interdependence and heterogeneous preferences. Econometrica: Journal of the econometric society, pages 403–426, 1978.
- D. A. Hirshberg and S. Wager. Debiased inference of average partial effects in single-index models: Comment on wooldridge and zhu. Journal of Business & Economic Statistics, 38(1):19–24, 2020.
- Empirical search and consideration sets. In Handbook of the Economics of Marketing, volume 1, pages 193–257. Elsevier, 2019.
- B. E. Honoré and E. Kyriazidou. Panel data discrete choice models with lagged dependent variables. Econometrica, 68(4):839–874, 2000.
- A. Hortaçsu and C. Syverson. Product differentiation, search costs, and competition in the mutual fund industry: A case study of s&p 500 index funds. The Quarterly journal of economics, 119(2):403–456, 2004.
- H. Ichimura and W. K. Newey. The influence function of semiparametric estimators. Quantitative Economics, 13(1):29–61, 2022.
- Consumer search and purchase: An empirical investigation of retargeting based on consumer online behaviors. Marketing Science, 40(2):219–240, 2021.
- J. Joo. Rational inattention as an empirical framework for discrete choice and consumer-welfare evaluation. Journal of Marketing Research, 60(2):278–298, 2023.
- A probabilistic choice model for market segmentation and elasticity structure. Journal of marketing research, 26(4):379–390, 1989.
- Inference on semiparametric multinomial response models. Quantitative Economics, 12(3):743–777, 2021.
- Online demand under limited consumer search. Marketing science, 29(6):1001–1023, 2010.
- A. Lewbel. Semiparametric qualitative response model estimation with unknown heteroscedasticity or instrumental variables. Journal of econometrics, 97(1):145–177, 2000.
- Semi-nonparametric estimation of random coefficients logit model for aggregate demand. Journal of Econometrics, 2023.
- R. D. Luce. On the possible psychophysical laws. Psychological review, 66(2):81, 1959.
- C. F. Manski. Semiparametric analysis of random effects linear models from binary panel data. Econometrica: Journal of the Econometric Society, pages 357–362, 1987.
- J. Marschak. Binary choice constraints on random utility indicators. 1959.
- D. McFadden and K. Train. Mixed mnl models for discrete response. Journal of applied Econometrics, 15(5):447–470, 2000.
- D. McFadden et al. Conditional logit analysis of qualitative choice behavior. 1973.
- S. R. Mehndiratta. Time-of-day effects in inter-city business travel. University of California, Berkeley, 1996.
- Price uncertainty and consumer search: A structural model of consideration set formation. Marketing science, 22(1):58–84, 2003.
- A. Nevo. Mergers with differentiated products: The case of the ready-to-eat cereal industry. The RAND Journal of Economics, pages 395–421, 2000.
- A. Nevo. New products, quality changes, and welfare measures computed from estimated demand systems. Review of Economics and statistics, 85(2):266–275, 2003.
- W. K. Newey. The asymptotic variance of semiparametric estimators. Econometrica: Journal of the Econometric Society, pages 1349–1382, 1994.
- A. Pakes and J. Porter. Moment inequalities for multinomial choice with fixed effects. Quantitative Economics, Forthcoming, 2022.
- A. Petrin. Quantifying the benefits of new products: The case of the minivan. Journal of political Economy, 110(4):705–729, 2002.
- A. Petrin and K. Train. A control function approach to endogeneity in consumer choice models. Journal of marketing research, 47(1):3–13, 2010.
- D. Revelt and K. Train. Mixed logit with repeated choices: households’ choices of appliance efficiency level. Review of economics and statistics, 80(4):647–657, 1998.
- Estimation of regression coefficients when some regressors are not always observed. Journal of the American statistical Association, 89(427):846–866, 1994.
- A. Sannai and M. Imaizumi. Improved generalization bound of permutation invariant deep neural networks. 2019.
- Estimating semi-parametric panel multinomial choice models using cyclic monotonicity. Econometrica, 86(2):737–761, 2018.
- Enhancing discrete choice models with representation learning. Transportation Research Part B: Methodological, 140:236–261, 2020.
- Machine learning instrument variables for causal inference. In Proceedings of the 21st ACM Conference on Economics and Computation, pages 835–836, 2020.
- K. Sudhir. Competitive pricing behavior in the auto market: A structural analysis. Marketing Science, 20(1):42–60, 2001.
- Nonparametric estimates of demand in the california health insurance exchange. Econometrica, 91(1):107–146, 2023.
- L. L. Thurstone. A law of comparative judgment. Psychological review, 34(4):273, 1927.
- K. Train. A comparison of hierarchical bayes and maximum simulated likelihood for mixed logit. University of California, Berkeley, pages 1–13, 2001.
- K. E. Train. Discrete choice methods with simulation. Cambridge university press, 2009.
- The demand for local telephone service: A fully discrete model of residential calling patterns and service choices. The RAND Journal of Economics, pages 109–123, 1987.
- Discrete choice in marketing through the lens of rational inattention. 2023.
- Retrieving unobserved consideration sets from household panel data. Journal of Marketing Research, 47(1):63–74, 2010.
- On the limitations of representing functions on sets. In International Conference on Machine Learning, pages 6487–6494. PMLR, 2019.
- M. J. Wainwright. High-dimensional statistics: A non-asymptotic viewpoint, volume 48. Cambridge university press, 2019.
- A. Wang. Sieve blp: A semi-nonparametric model of demand for differentiated products. Journal of Econometrics, 235(2):325–351, 2023.
- Deep neural networks for choice analysis: Extracting complete economic information for interpretation. Transportation Research Part C: Emerging Technologies, 118:102701, 2020.
- Y. Wei and Z. Jiang. Estimating parameters of structural models using neural networks. USC Marshall School of Business Research Paper, 2022.
- M. Weitzman. Optimal search for the best alternative, volume 78. Department of Energy, 1978.
- T. G. Wollmann. Trucks without bailouts: Equilibrium product characteristics for commercial vehicles. American Economic Review, 108(6):1364–1406, 2018.
- M. Wong and B. Farooq. Reslogit: A residual neural network logit model for data-driven choice modelling. Transportation Research Part C: Emerging Technologies, 126:103050, 2021.
- Deep sets. Advances in neural information processing systems, 30, 2017.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.