Skip to main content

INVERTIBLE VOICE CONVERSION WITH PARALLEL DATA

Publication ,  Conference
Cai, Z; Li, M
Published in: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
January 1, 2024

This paper introduces an innovative deep learning framework for parallel voice conversion to mitigate inherent risks associated with such systems. Our approach focuses on developing an invertible model capable of countering potential spoofing threats. Specifically, we present a conversion model that allows for the retrieval of source voices, thereby facilitating the identification of the source speaker. This framework is constructed using a series of invertible modules composed of affine coupling layers to ensure the reversibility of the conversion process. We conduct comprehensive training and evaluation of the proposed framework using parallel training data. Our experimental results reveal that this approach achieves comparable performance to non-invertible systems in voice conversion tasks. Notably, the converted outputs can be seamlessly reverted to the original source inputs using the same parameters employed during the forwarding process. This advancement holds considerable promise for elevating the security and reliability of voice conversion.

Duke Scholars

Published In

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

DOI

ISSN

1520-6149

Publication Date

January 1, 2024

Start / End Page

10041 / 10045
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Cai, Z., & Li, M. (2024). INVERTIBLE VOICE CONVERSION WITH PARALLEL DATA. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (pp. 10041–10045). https://doi.org/10.1109/ICASSP48485.2024.10447701
Cai, Z., and M. Li. “INVERTIBLE VOICE CONVERSION WITH PARALLEL DATA.” In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 10041–45, 2024. https://doi.org/10.1109/ICASSP48485.2024.10447701.
Cai Z, Li M. INVERTIBLE VOICE CONVERSION WITH PARALLEL DATA. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2024. p. 10041–5.
Cai, Z., and M. Li. “INVERTIBLE VOICE CONVERSION WITH PARALLEL DATA.” ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2024, pp. 10041–45. Scopus, doi:10.1109/ICASSP48485.2024.10447701.
Cai Z, Li M. INVERTIBLE VOICE CONVERSION WITH PARALLEL DATA. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2024. p. 10041–10045.

Published In

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

DOI

ISSN

1520-6149

Publication Date

January 1, 2024

Start / End Page

10041 / 10045