Skip to main content

Joint Attention for Automated Video Editing

Publication ,  Conference
Wu, HY; Santarra, T; Leece, M; Vargas, R; Jhala, A
Published in: Imx 2020 Proceedings of the 2020 ACM International Conference on Interactive Media Experiences
June 17, 2020

Joint attention refers to the shared focal points of attention for occupants in a space. In this work, we introduce a computational definition of joint attention for the automated editing of meetings in multi-camera environments from the AMI corpus. Using extracted head pose and individual headset amplitude as features, we developed three editing methods: (1) a naive audio-based method that selects the camera using only the headset input, (2) a rule-based edit that selects cameras at a fixed pacing using pose data, and (3) an editing algorithm using LSTM (Long-short term memory) learned joint-attention from both pose and audio data, trained on expert edits. The methods are evaluated qualitatively against the human edit, and quantitatively in a user study with 22 participants. Results indicate that LSTM-trained joint attention produces edits that are comparable to the expert edit, offering a wider range of camera views than audio, while being more generalizable as compared to rule-based methods.

Duke Scholars

Published In

Imx 2020 Proceedings of the 2020 ACM International Conference on Interactive Media Experiences

DOI

Publication Date

June 17, 2020

Start / End Page

55 / 64
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Wu, H. Y., Santarra, T., Leece, M., Vargas, R., & Jhala, A. (2020). Joint Attention for Automated Video Editing. In Imx 2020 Proceedings of the 2020 ACM International Conference on Interactive Media Experiences (pp. 55–64). https://doi.org/10.1145/3391614.3393656
Wu, H. Y., T. Santarra, M. Leece, R. Vargas, and A. Jhala. “Joint Attention for Automated Video Editing.” In Imx 2020 Proceedings of the 2020 ACM International Conference on Interactive Media Experiences, 55–64, 2020. https://doi.org/10.1145/3391614.3393656.
Wu HY, Santarra T, Leece M, Vargas R, Jhala A. Joint Attention for Automated Video Editing. In: Imx 2020 Proceedings of the 2020 ACM International Conference on Interactive Media Experiences. 2020. p. 55–64.
Wu, H. Y., et al. “Joint Attention for Automated Video Editing.” Imx 2020 Proceedings of the 2020 ACM International Conference on Interactive Media Experiences, 2020, pp. 55–64. Scopus, doi:10.1145/3391614.3393656.
Wu HY, Santarra T, Leece M, Vargas R, Jhala A. Joint Attention for Automated Video Editing. Imx 2020 Proceedings of the 2020 ACM International Conference on Interactive Media Experiences. 2020. p. 55–64.

Published In

Imx 2020 Proceedings of the 2020 ACM International Conference on Interactive Media Experiences

DOI

Publication Date

June 17, 2020

Start / End Page

55 / 64