Skip to main content

PQ-VAE: Learning Hierarchical Discrete Representations with Progressive Quantization

Publication ,  Conference
Huang, L; Qiu, Q; Sapiro, G
Published in: IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
January 1, 2024

Variational auto-encoders (VAEs) are widely used in generative modeling and representation learning, with applications ranging from image generation to data compression. However, conventional VAEs face challenges in balancing the tradeoff between compactness and informativeness of the learned latent codes. In this work, we propose Progressive Quantization VAE (PQ-VAE), which aims to learn a progressive sequential structure for data representation that maximizes the mutual information between the latent representations and the original data in a limited description length. The resulting representations provide a global, compact, and hierarchical understanding of the data semantics, making it suitable for high-level tasks while achieving high compression rates. The proposed model offers an effective solution for generative modeling and data compression while enabling improved performance in high-level tasks such as image understanding and generation.

Duke Scholars

Published In

IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops

DOI

EISSN

2160-7516

ISSN

2160-7508

Publication Date

January 1, 2024

Start / End Page

7550 / 7558
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Huang, L., Qiu, Q., & Sapiro, G. (2024). PQ-VAE: Learning Hierarchical Discrete Representations with Progressive Quantization. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (pp. 7550–7558). https://doi.org/10.1109/CVPRW63382.2024.00750
Huang, L., Q. Qiu, and G. Sapiro. “PQ-VAE: Learning Hierarchical Discrete Representations with Progressive Quantization.” In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 7550–58, 2024. https://doi.org/10.1109/CVPRW63382.2024.00750.
Huang L, Qiu Q, Sapiro G. PQ-VAE: Learning Hierarchical Discrete Representations with Progressive Quantization. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. 2024. p. 7550–8.
Huang, L., et al. “PQ-VAE: Learning Hierarchical Discrete Representations with Progressive Quantization.” IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2024, pp. 7550–58. Scopus, doi:10.1109/CVPRW63382.2024.00750.
Huang L, Qiu Q, Sapiro G. PQ-VAE: Learning Hierarchical Discrete Representations with Progressive Quantization. IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. 2024. p. 7550–7558.

Published In

IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops

DOI

EISSN

2160-7516

ISSN

2160-7508

Publication Date

January 1, 2024

Start / End Page

7550 / 7558