Skip to main content

Enhancement and analysis of conversational speech: JSALT 2017

Publication ,  Conference
Ryanta, N; Bergelson, E; Church, K; Cristia, A; Du, J; Ganapathy, S; Khudanpur, S; Kowalski, D; Krishnamoorthy, M; Kulshreshta, R; Liberman, M ...
Published in: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
September 10, 2018

Automatic speech recognition is more and more widely and effectively used. Nevertheless, in some automatic speech analysis tasks the state of the art is surprisingly poor. One of these is 'diarization', the task of determining who spoke when. Diarization is key to processing meeting audio and clinical interviews, extended recordings such as police body cam or child language acquisition data, and any other speech data involving multiple speakers whose voices are not cleanly separated into individual channels. Overlapping speech, environmental noise and suboptimal recording techniques make the problem harder. During the JSALT Summer Workshop at CMU in 2017, an international team of researchers worked on several aspects of this problem, including calibration of the state of the art, detection of overlaps, enhancement of noisy recordings, and classification of shorter speech segments. This paper sketches the workshop's results, and announces plans for a 'Diarization Challenge' to encourage further progress.

Duke Scholars

Published In

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

DOI

ISSN

1520-6149

ISBN

9781538646588

Publication Date

September 10, 2018

Volume

2018-April

Start / End Page

5154 / 5158
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Ryanta, N., Bergelson, E., Church, K., Cristia, A., Du, J., Ganapathy, S., … Yu, Z. (2018). Enhancement and analysis of conversational speech: JSALT 2017. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (Vol. 2018-April, pp. 5154–5158). https://doi.org/10.1109/ICASSP.2018.8462468
Ryanta, N., E. Bergelson, K. Church, A. Cristia, J. Du, S. Ganapathy, S. Khudanpur, et al. “Enhancement and analysis of conversational speech: JSALT 2017.” In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2018-April:5154–58, 2018. https://doi.org/10.1109/ICASSP.2018.8462468.
Ryanta N, Bergelson E, Church K, Cristia A, Du J, Ganapathy S, et al. Enhancement and analysis of conversational speech: JSALT 2017. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2018. p. 5154–8.
Ryanta, N., et al. “Enhancement and analysis of conversational speech: JSALT 2017.” ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, vol. 2018-April, 2018, pp. 5154–58. Scopus, doi:10.1109/ICASSP.2018.8462468.
Ryanta N, Bergelson E, Church K, Cristia A, Du J, Ganapathy S, Khudanpur S, Kowalski D, Krishnamoorthy M, Kulshreshta R, Liberman M, Lu YD, Maciejewski M, Metze F, Profant J, Sun L, Tsao Y, Yu Z. Enhancement and analysis of conversational speech: JSALT 2017. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2018. p. 5154–5158.

Published In

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

DOI

ISSN

1520-6149

ISBN

9781538646588

Publication Date

September 10, 2018

Volume

2018-April

Start / End Page

5154 / 5158