Skip to main content

Exploiting product distributions to identify relevant variables of correlation immune functions

Publication ,  Journal Article
Hellerstein, L; Roseli, B; Bach, E; Ray, S; Page, D
Published in: Journal of Machine Learning Research
November 30, 2009

A Boolean function f is correlation immune if each input variable is independent of the output, under the uniform distribution on inputs. For example, the parity function is correlation immune. We consider the problem of identifying relevant variables of a correlation immune function, in the presence of irrelevant variables. We address this problem in two different contexts. First, we analyze Skewing, a heuristic method that was developed to improve the ability of greedy decision tree algorithms to identify relevant variables of correlation immune Boolean functions, given examples drawn from the uniform distribution (Page and Ray, 2003). We present theoretical results revealing both the capabilities and limitations of skewing. Second, we explore the problem of identifying relevant variables in the Product Distribution Choice (PDC) learning model, a model in which the learner can choose product distributions and obtain examples from them. We prove a lemma establishing a property of Boolean functions that may be of independent interest. Using this lemma, we give two new algorithms for finding relevant variables of correlation immune functions in the PDC model. © 2009 Lisa Hellerstein, Bernard Roseli, Eric Bach, Soumya Ray and David Page.

Duke Scholars

Published In

Journal of Machine Learning Research

EISSN

1533-7928

ISSN

1532-4435

Publication Date

November 30, 2009

Volume

10

Start / End Page

2375 / 2411

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 4905 Statistics
  • 4611 Machine learning
  • 17 Psychology and Cognitive Sciences
  • 08 Information and Computing Sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Hellerstein, L., Roseli, B., Bach, E., Ray, S., & Page, D. (2009). Exploiting product distributions to identify relevant variables of correlation immune functions. Journal of Machine Learning Research, 10, 2375–2411.
Hellerstein, L., B. Roseli, E. Bach, S. Ray, and D. Page. “Exploiting product distributions to identify relevant variables of correlation immune functions.” Journal of Machine Learning Research 10 (November 30, 2009): 2375–2411.
Hellerstein L, Roseli B, Bach E, Ray S, Page D. Exploiting product distributions to identify relevant variables of correlation immune functions. Journal of Machine Learning Research. 2009 Nov 30;10:2375–411.
Hellerstein, L., et al. “Exploiting product distributions to identify relevant variables of correlation immune functions.” Journal of Machine Learning Research, vol. 10, Nov. 2009, pp. 2375–411.
Hellerstein L, Roseli B, Bach E, Ray S, Page D. Exploiting product distributions to identify relevant variables of correlation immune functions. Journal of Machine Learning Research. 2009 Nov 30;10:2375–2411.

Published In

Journal of Machine Learning Research

EISSN

1533-7928

ISSN

1532-4435

Publication Date

November 30, 2009

Volume

10

Start / End Page

2375 / 2411

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 4905 Statistics
  • 4611 Machine learning
  • 17 Psychology and Cognitive Sciences
  • 08 Information and Computing Sciences