Skip to main content

The big Data newsvendor: Practical insights from machine learning

Publication ,  Journal Article
Ban, GY; Rudin, C
Published in: Operations Research
January 1, 2019

We investigate the data-driven newsvendor problem when one has n observations of p features related to the demand as well as historical demand data. Rather than a two-step process of first estimating a demand distribution then optimizing for the optimal order quantity, we propose solving the “big data” newsvendor problem via single-step machine-learning algorithms. Specifically, we propose algorithms based on the empirical risk minimization (ERM) principle, with and without regularization, and an algorithm based on kernel-weights optimization (KO). The ERM approaches, equivalent to high-dimensional quantile regression, can be solved by convex optimization problems and the KO approach by a sorting algorithm. We analytically justify the use of features by showing that their omission yields inconsistent decisions. We then derive finite-sample performance bounds on the out-of-sample costs of the feature-based algorithms, which quantify the effects of dimensionality and cost parameters. Our bounds, based on algorithmic stability theory, generalize known analyses for the newsvendor problem without feature information. Finally, we apply the feature-based algorithms for nurse staffing in a hospital emergency room using a data set from a large UK teaching hospital and find that (1) the best ERM and KO algorithms beat the best practice benchmark by 23% and 24%, respectively, in the out-of-sample cost, and (2) the best KO algorithm is faster than the best ERM algorithm by three orders of magnitude and the best practice benchmark by two orders of magnitude.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Operations Research

DOI

EISSN

1526-5463

ISSN

0030-364X

Publication Date

January 1, 2019

Volume

67

Issue

1

Start / End Page

90 / 108

Related Subject Headings

  • Operations Research
  • 3507 Strategy, management and organisational behaviour
  • 1503 Business and Management
  • 0802 Computation Theory and Mathematics
  • 0102 Applied Mathematics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Ban, G. Y., & Rudin, C. (2019). The big Data newsvendor: Practical insights from machine learning. Operations Research, 67(1), 90–108. https://doi.org/10.1287/opre.2018.1757
Ban, G. Y., and C. Rudin. “The big Data newsvendor: Practical insights from machine learning.” Operations Research 67, no. 1 (January 1, 2019): 90–108. https://doi.org/10.1287/opre.2018.1757.
Ban GY, Rudin C. The big Data newsvendor: Practical insights from machine learning. Operations Research. 2019 Jan 1;67(1):90–108.
Ban, G. Y., and C. Rudin. “The big Data newsvendor: Practical insights from machine learning.” Operations Research, vol. 67, no. 1, Jan. 2019, pp. 90–108. Scopus, doi:10.1287/opre.2018.1757.
Ban GY, Rudin C. The big Data newsvendor: Practical insights from machine learning. Operations Research. 2019 Jan 1;67(1):90–108.

Published In

Operations Research

DOI

EISSN

1526-5463

ISSN

0030-364X

Publication Date

January 1, 2019

Volume

67

Issue

1

Start / End Page

90 / 108

Related Subject Headings

  • Operations Research
  • 3507 Strategy, management and organisational behaviour
  • 1503 Business and Management
  • 0802 Computation Theory and Mathematics
  • 0102 Applied Mathematics