View : 318 Download: 0

Adaptive inventory replenishment using structured reinforcement learning by exploiting a policy structure

Title
Adaptive inventory replenishment using structured reinforcement learning by exploiting a policy structure
Authors
ParkHyungjunChoiDong GuMinDaiki
Ewha Authors
민대기
SCOPUS Author ID
민대기scopus
Issue Date
2023
Journal Title
International Journal of Production Economics
ISSN
0925-5273JCR Link
Citation
International Journal of Production Economics vol. 266
Keywords
Inventory replenishment policyReinforcement learningStochastic approximationStructural properties
Publisher
Elsevier B.V.
Indexed
SCIE; SCOPUS WOS scopus
Document Type
Article
Abstract
We consider an inventory replenishment problem with unknown and non-stationary demand. We design a structured reinforcement learning algorithm that efficiently adapts the replenishment policy to changing demand without any prior knowledge. Our proposed method integrates the known structural properties of a well-performing inventory replenishment policy with reinforcement learning. By exploiting the policy structure, we tune reinforcement learning to characterize the inventory replenishment policy and approximate the value function. In particular, we propose two methods for stochastic approximation on the gradient of the objective function. These novel reinforcement learning algorithms ensure an efficient convergence rate and lower algorithmic complexity for solving practical problems. The numerical results demonstrate that the proposed algorithms adaptively update the policy to changing demand and lower inventory costs compared to various benchmarks. We also conduct a numerical validation for a South Korean retail shop to validate the practical feasibility of the proposed method. Understanding the policy structure is beneficial for designing reinforcement learning algorithms that can address the inventory replenishment problem. These well-designed reinforcement learning algorithms are particularly promising when we require policy updates based on observations without precise knowledge of non-stationary demand. These research findings could be extended to address the various inventory decisions in which policy structures are available. © 2023
DOI
10.1016/j.ijpe.2023.109029
Appears in Collections:
경영대학 > 경영학전공 > Journal papers
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

BROWSE