Data-Driven Long-Term Asset Allocation with Tsallis Entropy Regularization
Abstract: This paper addresses the problem of dynamic asset allocation under uncertainty, which can be formulated as a linear quadratic (LQ) control problem with multiplicative noise. To handle exploration exploitation trade offs and induce sparse control actions, we introduce Tsallis entropy as a regularization term. We develop an entropy regularized policy iteration scheme and provide theoretical guarantees for its convergence. For cases where system dynamics are unknown, we further propose a fully data driven algorithm that estimates Q functions using an instrumental variable least squares approach, allowing efficient and stable policy updates. Our framework connects entropy-regularized stochastic control with model free reinforcement learning, offering new tools for intelligent decision making in finance and automation.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.