Skip to main content

The rate of convergence of AdaBoost

Publication ,  Conference
Mukherjee, I; Rudin, C; Schapire, RE
Published in: Journal of Machine Learning Research
January 1, 2011

The AdaBoost algorithm of Freund and Schapire (1997) was designed to combine many "weak" hypotheses that perform slightly better than a random guess into a "strong" hypo-thesis that has very low error. We study the rate at which AdaBoost iteratively converges to the minimum of the "exponential loss" with a fast rate of convergence. Our proofs do not require a weak-learning assumption, nor do they require that minimizers of the exponential loss are finite. Specifically, our first result shows that at iteration t, the exponential loss of AdaBoost 's computed parameter vector will be at most ε more than that of any parameter vector of ℓ1- norm bounded by B in a number of rounds that is bounded by a polynomial in B and 1/ε. We also provide rate lower bound examples showing a polynomial dependence on these parameters is necessary. Our second result is that within C/ε iterations, AdaBoost achieves a value of the exponential loss that is at most ε more than the best possible value, where C depends on the dataset. We show that this dependence of the rate on ε is optimal up to constant factors, i.e. at least Ω( 1/ε) rounds are necessary to achieve within ε of the optimal exponential loss. © 2011 I. Mukherjee, C. Rudin & R.E. Schapire.

Duke Scholars

Published In

Journal of Machine Learning Research

EISSN

1533-7928

ISSN

1532-4435

Publication Date

January 1, 2011

Volume

19

Start / End Page

537 / 557

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 17 Psychology and Cognitive Sciences
  • 08 Information and Computing Sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Mukherjee, I., Rudin, C., & Schapire, R. E. (2011). The rate of convergence of AdaBoost. In Journal of Machine Learning Research (Vol. 19, pp. 537–557).
Mukherjee, I., C. Rudin, and R. E. Schapire. “The rate of convergence of AdaBoost.” In Journal of Machine Learning Research, 19:537–57, 2011.
Mukherjee I, Rudin C, Schapire RE. The rate of convergence of AdaBoost. In: Journal of Machine Learning Research. 2011. p. 537–57.
Mukherjee, I., et al. “The rate of convergence of AdaBoost.” Journal of Machine Learning Research, vol. 19, 2011, pp. 537–57.
Mukherjee I, Rudin C, Schapire RE. The rate of convergence of AdaBoost. Journal of Machine Learning Research. 2011. p. 537–557.

Published In

Journal of Machine Learning Research

EISSN

1533-7928

ISSN

1532-4435

Publication Date

January 1, 2011

Volume

19

Start / End Page

537 / 557

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 17 Psychology and Cognitive Sciences
  • 08 Information and Computing Sciences