Skip to main content

Random forests can hash

Publication ,  Journal Article
Qiu, Q; Sapiro, G; Bronstein, A
Published in: 3rd International Conference on Learning Representations, ICLR 2015 - Workshop Track Proceedings
January 1, 2015

© 2015 International Conference on Learning Representations, ICLR. All rights reserved. Hash codes are a very efficient data representation needed to be able to cope with the ever growing amounts of data. We introduce a random forest semantic hashing scheme with information-theoretic code aggregation, showing for the first time how random forest, a technique that together with deep learning have shown spectacular results in classification, can also be extended to large-scale retrieval. Traditional random forest fails to enforce the consistency of hashes generated from each tree for the same class data, i.e., to preserve the underlying similarity, and it also lacks a principled way for code aggregation across trees. We start with a simple hashing scheme, where independently trained random trees in a forest are acting as hashing functions. We the propose a subspace model as the splitting function, and show that it enforces the hash consistency in a tree for data from the same class. We also introduce an information-theoretic approach for aggregating codes of individual trees into a single hash code, producing a near-optimal unique hash for each class. Experiments on large-scale public datasets are presented, showing that the proposed approach significantly outperforms state-of-the-art hashing methods for retrieval tasks.

Duke Scholars

Published In

3rd International Conference on Learning Representations, ICLR 2015 - Workshop Track Proceedings

Publication Date

January 1, 2015
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Qiu, Q., Sapiro, G., & Bronstein, A. (2015). Random forests can hash. 3rd International Conference on Learning Representations, ICLR 2015 - Workshop Track Proceedings.
Qiu, Q., G. Sapiro, and A. Bronstein. “Random forests can hash.” 3rd International Conference on Learning Representations, ICLR 2015 - Workshop Track Proceedings, January 1, 2015.
Qiu Q, Sapiro G, Bronstein A. Random forests can hash. 3rd International Conference on Learning Representations, ICLR 2015 - Workshop Track Proceedings. 2015 Jan 1;
Qiu, Q., et al. “Random forests can hash.” 3rd International Conference on Learning Representations, ICLR 2015 - Workshop Track Proceedings, Jan. 2015.
Qiu Q, Sapiro G, Bronstein A. Random forests can hash. 3rd International Conference on Learning Representations, ICLR 2015 - Workshop Track Proceedings. 2015 Jan 1;

Published In

3rd International Conference on Learning Representations, ICLR 2015 - Workshop Track Proceedings

Publication Date

January 1, 2015