Papers
Topics
Authors
Recent
Search
2000 character limit reached

Bandit Profit-maximization for Targeted Marketing

Published 3 Mar 2024 in cs.LG, cs.GT, econ.GN, q-fin.EC, and q-fin.GN | (2403.01361v2)

Abstract: We study a sequential profit-maximization problem, optimizing for both price and ancillary variables like marketing expenditures. Specifically, we aim to maximize profit over an arbitrary sequence of multiple demand curves, each dependent on a distinct ancillary variable, but sharing the same price. A prototypical example is targeted marketing, where a firm (seller) wishes to sell a product over multiple markets. The firm may invest different marketing expenditures for different markets to optimize customer acquisition, but must maintain the same price across all markets. Moreover, markets may have heterogeneous demand curves, each responding to prices and marketing expenditures differently. The firm's objective is to maximize its gross profit, the total revenue minus marketing costs. Our results are near-optimal algorithms for this class of problems in an adversarial bandit setting, where demand curves are arbitrary non-adaptive sequences, and the firm observes only noisy evaluations of chosen points on the demand curves. For $n$ demand curves (markets), we prove a regret upper bound of $\tilde{O}(nT{3/4})$ and a lower bound of $\Omega((nT){3/4})$ for monotonic demand curves, and a regret bound of $\tilde{\Theta}(nT{2/3})$ for demands curves that are monotonic in price and concave in the ancillary variables.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (33)
  1. The multiplicative weights update method: A meta-algorithm and applications. Theory of Computing 8, 1 (2012), 121–164.
  2. Gambling in a rigged casino: The adversarial multi-armed bandit problem. Proceedings of IEEE 36th Annual Symposium on Foundations of Computer Science (1995), 322–331.
  3. The nonstochastic multiarmed bandit problem. SIAM J. Comput. 32, 1 (2002), 48–77.
  4. Omar Besbes and Assaf Zeevi. 2009. Dynamic pricing without knowing the demand function: Risk bounds and near-optimal algorithms. Operations Research 57, 6 (2009), 1407–1420.
  5. Omar Besbes and Assaf Zeevi. 2015. On the (surprising) sufficiency of linear models for dynamic pricing with demand learning. Management Science 61, 4 (2015), 723–739.
  6. Jean Bretagnolle and Catherine Huber. 1979. Estimation des densités: risque minimax. Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete 47 (1979), 119–137.
  7. Regret analysis of stochastic and nonstochastic multi-armed bandit problems. Foundations and Trends® in Machine Learning 5, 1 (2012), 1–122.
  8. Kernel-based methods for bandit convex optimization. In Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing. 72–85.
  9. Dynamic pricing and demand learning with limited price experimentation. Operations Research 65, 6 (2017), 1722–1731.
  10. Online display advertising markets: A literature review and future directions. Information Systems Research 31, 2 (2020), 556–575.
  11. Thomas M Cover. 1999. Elements of information theory. John Wiley & Sons.
  12. Arnoud V Den Boer. 2015. Dynamic pricing and learning: Historical origins, current research, and new directions. Surveys in Operations Research and Management Science 20, 1 (2015), 1–18.
  13. Arnoud V den Boer and Bert Zwart. 2014. Simultaneously learning and optimizing using controlled variance pricing. Management Science 60, 3 (2014), 770–783.
  14. Online convex optimization in the bandit setting: gradient descent without a gradient. In Proceedings of the 16th Annual ACM-SIAM Symposium on Discrete Algorithms. 385–394.
  15. Website morphing 2.0: Switching costs, partial exposure, random exit, and when to morph. Management Science 60, 6 (2014), 1594–1616.
  16. Website morphing. Marketing Science 28, 2 (2009), 202–223.
  17. Elad Hazan et al. 2016. Introduction to online convex optimization. Foundations and Trends® in Optimization 2, 3-4 (2016), 157–325.
  18. Elad Hazan and Kfir Levy. 2014. Bandit convex optimization: Towards tight bounds. Advances in Neural Information Processing Systems 27 (2014), 784–792.
  19. Effective Adaptive Exploration of Prices and Promotions in Choice-Based Demand Models. Available at SSRN 4438537 (2023).
  20. Adel Javanmard. 2017. Perishability of data: dynamic pricing under varying-coefficient models. The Journal of Machine Learning Research 18, 1 (2017), 1714–1744.
  21. Adel Javanmard and Hamid Nazerzadeh. 2019. Dynamic pricing in high-dimensions. The Journal of Machine Learning Research 20, 1 (2019), 315–363.
  22. N Bora Keskin and Assaf Zeevi. 2014. Dynamic pricing with an unknown demand model: Asymptotically optimal semi-myopic policies. Operations Research 62, 5 (2014), 1142–1167.
  23. Robert Kleinberg and Tom Leighton. 2003. The value of knowing a demand curve: Bounds on regret for online posted-price auctions. In Proceedings of IEEE 44th Annual Symposium on Foundations of Computer Science. IEEE, 594–605.
  24. Tor Lattimore and Csaba Szepesvári. 2020. Bandit algorithms. Cambridge University Press.
  25. Gui Liberali and Alina Ferecatu. 2022. Morphing for consumer dynamics: Bandits meet hidden Markov models. Marketing Science 41, 4 (2022), 769–794.
  26. Dynamic online pricing with incomplete information using multiarmed bandit experiments. Marketing Science 38, 2 (2019), 226–252.
  27. Georgia Perakis and Divya Singhvi. 2023. Dynamic pricing with unknown nonparametric demand and limited price changes. Operations Research (2023).
  28. Ivan Png. 2022. Managerial economics. Routledge.
  29. Contextual multi-armed bandits for causal marketing. arXiv preprint arXiv:1810.01859 (2018).
  30. Customer acquisition via display advertising using multi-armed bandit experiments. Marketing Science 36, 4 (2017), 500–522.
  31. Morphing banner advertising. Marketing Science 33, 1 (2014), 27–46.
  32. Multimodal dynamic pricing. Management Science 67, 10 (2021), 6136–6152.
  33. Jianyu Xu and Yu-Xiang Wang. 2021. Logarithmic regret in feature-based dynamic pricing. Advances in Neural Information Processing Systems 34 (2021), 13898–13910.

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 2 tweets with 0 likes about this paper.