Scholars@Duke publication: Exact and consistent interpretation for piecewise linear neural networks: A closed form solution

Exact and consistent interpretation for piecewise linear neural networks: A closed form solution

Publication , Conference

Chu, L; Hu, X; Hu, J; Wang, L; Pei, J

Published in: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

July 19, 2018

Strong intelligent machines powered by deep neural networks are increasingly deployed as black boxes to make decisions in risk-sensitive domains, such as finance and medical. To reduce potential risk and build trust with users, it is critical to interpret how such machines make their decisions. Existing works interpret a pre-trained neural network by analyzing hidden neurons, mimicking pre-trained models or approximating local predictions. However, these methods do not provide a guarantee on the exactness and consistency of their interpretations. In this paper, we propose an elegant closed form solution named OpenBox to compute exact and consistent interpretations for the family of Piecewise Linear Neural Networks (PLNN). The major idea is to first transform a PLNN into a mathematically equivalent set of linear classifiers, then interpret each linear classifier by the features that dominate its prediction. We further apply OpenBox to demonstrate the effectiveness of nonnegative and sparse constraints on improving the interpretability of PLNNs. The extensive experiments on both synthetic and real world data sets clearly demonstrate the exactness and consistency of our interpretation.

Duke Scholars

Author Jian Pei Computer Science

Published In

Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

DOI

10.1145/3219819.3220063

Publication Date

July 19, 2018

Start / End Page

1244 / 1253

Citation

APA

Chicago

ICMJE

MLA

NLM

Chu, L., Hu, X., Hu, J., Wang, L., & Pei, J. (2018). Exact and consistent interpretation for piecewise linear neural networks: A closed form solution. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 1244–1253). https://doi.org/10.1145/3219819.3220063

Chu, L., X. Hu, J. Hu, L. Wang, and J. Pei. “Exact and consistent interpretation for piecewise linear neural networks: A closed form solution.” In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1244–53, 2018. https://doi.org/10.1145/3219819.3220063.

Chu L, Hu X, Hu J, Wang L, Pei J. Exact and consistent interpretation for piecewise linear neural networks: A closed form solution. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2018. p. 1244–53.

Chu, L., et al. “Exact and consistent interpretation for piecewise linear neural networks: A closed form solution.” Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2018, pp. 1244–53. Scopus, doi:10.1145/3219819.3220063.

Chu L, Hu X, Hu J, Wang L, Pei J. Exact and consistent interpretation for piecewise linear neural networks: A closed form solution. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2018. p. 1244–1253.

Published In

Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

DOI

10.1145/3219819.3220063

Publication Date

July 19, 2018

Start / End Page

1244 / 1253