Dynamic Assortment Optimization with Changing Contextual Information
October 31, 2018 Β· Declared Dead Β· π Journal of machine learning research
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Xi Chen, Yining Wang, Yuan Zhou
arXiv ID
1810.13069
Category
econ.EM
Cross-listed
cs.LG,
stat.ML
Citations
56
Venue
Journal of machine learning research
Last Checked
3 months ago
Abstract
In this paper, we study the dynamic assortment optimization problem under a finite selling season of length $T$. At each time period, the seller offers an arriving customer an assortment of substitutable products under a cardinality constraint, and the customer makes the purchase among offered products according to a discrete choice model. Most existing work associates each product with a real-valued fixed mean utility and assumes a multinomial logit choice (MNL) model. In many practical applications, feature/contexutal information of products is readily available. In this paper, we incorporate the feature information by assuming a linear relationship between the mean utility and the feature. In addition, we allow the feature information of products to change over time so that the underlying choice model can also be non-stationary. To solve the dynamic assortment optimization under this changing contextual MNL model, we need to simultaneously learn the underlying unknown coefficient and makes the decision on the assortment. To this end, we develop an upper confidence bound (UCB) based policy and establish the regret bound on the order of $\widetilde O(d\sqrt{T})$, where $d$ is the dimension of the feature and $\widetilde O$ suppresses logarithmic dependence. We further established the lower bound $Ξ©(d\sqrt{T}/K)$ where $K$ is the cardinality constraint of an offered assortment, which is usually small. When $K$ is a constant, our policy is optimal up to logarithmic factors. In the exploitation phase of the UCB algorithm, we need to solve a combinatorial optimization for assortment optimization based on the learned information. We further develop an approximation algorithm and an efficient greedy heuristic. The effectiveness of the proposed policy is further demonstrated by our numerical studies.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β econ.EM
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
Machine Learning Advances for Time Series Forecasting
R.I.P.
π»
Ghosted
Deep Neural Networks for Estimation and Inference
R.I.P.
π»
Ghosted
Take a Look Around: Using Street View and Satellite Images to Estimate House Prices
R.I.P.
π»
Ghosted
Discrete Choice and Rational Inattention: a General Equivalence Result
R.I.P.
π»
Ghosted
Estimating Heterogeneous Consumer Preferences for Restaurants and Travel Time Using Mobile Location Data
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted