Skip to main content

Sequence-to-Sequence Neural Diarization With Automatic Speaker Detection and Representation

Publication ,  Journal Article
Cheng, M; Lin, Y; Li, M
Published in: IEEE Transactions on Audio, Speech and Language Processing
2025

Duke Scholars

Published In

IEEE Transactions on Audio, Speech and Language Processing

DOI

EISSN

2998-4173

Publication Date

2025

Volume

33

Start / End Page

2719 / 2734

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Related Subject Headings

  • Speech-Language Pathology & Audiology
  • 4603 Computer vision and multimedia computation
  • 4602 Artificial intelligence
  • 4006 Communications engineering
  • 0906 Electrical and Electronic Engineering
  • 0801 Artificial Intelligence and Image Processing
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Cheng, M., Lin, Y., & Li, M. (2025). Sequence-to-Sequence Neural Diarization With Automatic Speaker Detection and Representation. IEEE Transactions on Audio, Speech and Language Processing, 33, 2719–2734. https://doi.org/10.1109/taslpro.2025.3581032
Cheng, Ming, Yuke Lin, and Ming Li. “Sequence-to-Sequence Neural Diarization With Automatic Speaker Detection and Representation.” IEEE Transactions on Audio, Speech and Language Processing 33 (2025): 2719–34. https://doi.org/10.1109/taslpro.2025.3581032.
Cheng M, Lin Y, Li M. Sequence-to-Sequence Neural Diarization With Automatic Speaker Detection and Representation. IEEE Transactions on Audio, Speech and Language Processing. 2025;33:2719–34.
Cheng, Ming, et al. “Sequence-to-Sequence Neural Diarization With Automatic Speaker Detection and Representation.” IEEE Transactions on Audio, Speech and Language Processing, vol. 33, Institute of Electrical and Electronics Engineers (IEEE), 2025, pp. 2719–34. Crossref, doi:10.1109/taslpro.2025.3581032.
Cheng M, Lin Y, Li M. Sequence-to-Sequence Neural Diarization With Automatic Speaker Detection and Representation. IEEE Transactions on Audio, Speech and Language Processing. Institute of Electrical and Electronics Engineers (IEEE); 2025;33:2719–2734.

Published In

IEEE Transactions on Audio, Speech and Language Processing

DOI

EISSN

2998-4173

Publication Date

2025

Volume

33

Start / End Page

2719 / 2734

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Related Subject Headings

  • Speech-Language Pathology & Audiology
  • 4603 Computer vision and multimedia computation
  • 4602 Artificial intelligence
  • 4006 Communications engineering
  • 0906 Electrical and Electronic Engineering
  • 0801 Artificial Intelligence and Image Processing