Scholars@Duke publication: Learning algorithms for online principal-agent problems (and selling goods online)

Learning algorithms for online principal-agent problems (and selling goods online)

Publication , Journal Article

Conitzer, V; Garera, N

Published in: ACM International Conference Proceeding Series

December 1, 2006

In a principal-agent problem, a principal seeks to motivate an agent to take a certain action beneficial to the principal, while spending as little as possible on the reward. This is complicated by the fact that the principal does not know the agent's utility function (or type). We study the online setting where at each round, the principal encounters a new agent, and the principal sets the rewards anew. At the end of each round, the principal only finds out the action that the agent took, but not his type. The principal must learn how to set the rewards optimally. We show that this setting generalizes the setting of selling a digital good online. We study and experimentally compare three main approaches to this problem. First, we show how to apply a standard bandit algorithm to this setting. Second, for the case where the distribution of agent types is fixed (but unknown to the principal), we introduce a new gradient ascent algorithm. Third, for the case where the distribution of agents' types is fixed, and the principal has a prior belief (distribution) over a limited class of type distributions, we study a Bayesian approach.

Duke Scholars

Author Vincent Conitzer Computer Science

Published In

ACM International Conference Proceeding Series

DOI

10.1145/1143844.1143871

Publication Date

December 1, 2006

Volume

148

Start / End Page

209 / 216

Citation

APA

Chicago

ICMJE

MLA

NLM

Conitzer, V., & Garera, N. (2006). Learning algorithms for online principal-agent problems (and selling goods online). ACM International Conference Proceeding Series, 148, 209–216. https://doi.org/10.1145/1143844.1143871

Conitzer, V., and N. Garera. “Learning algorithms for online principal-agent problems (and selling goods online).” ACM International Conference Proceeding Series 148 (December 1, 2006): 209–16. https://doi.org/10.1145/1143844.1143871.

Conitzer V, Garera N. Learning algorithms for online principal-agent problems (and selling goods online). ACM International Conference Proceeding Series. 2006 Dec 1;148:209–16.

Conitzer, V., and N. Garera. “Learning algorithms for online principal-agent problems (and selling goods online).” ACM International Conference Proceeding Series, vol. 148, Dec. 2006, pp. 209–16. Scopus, doi:10.1145/1143844.1143871.

Conitzer V, Garera N. Learning algorithms for online principal-agent problems (and selling goods online). ACM International Conference Proceeding Series. 2006 Dec 1;148:209–216.

Published In

ACM International Conference Proceeding Series

DOI

10.1145/1143844.1143871

Publication Date

December 1, 2006

Volume

148

Start / End Page

209 / 216