Skip to main content

A Computational Model of Comprehension in Manga Style Visual Narratives

Publication ,  Conference
Chen, YC; Jhala, A
Published in: Proceedings of the 43rd Annual Meeting of the Cognitive Science Society Comparative Cognition Animal Minds Cogsci 2021
January 1, 2021

Understanding a sequence of images as a visual narrative is challenging because it requires not only the understanding of what is shown at a particular moment but also what has changed, been omitted or is out of frame. The human cognitive system makes inferences about the state of the world based on transitions between sequential frames. In this paper, we present a principled analysis of the stylistic differences between two dominant styles of multi-modal narratives, western comics and manga. These two styles differ in terms of screening, ballooning, layout, language, and reading order. We first provide a systematic account of these differences based on an annotated dataset consisting of both comics and manga. We then annotate these datasets with a new feature set and evaluate the contributions of these features through development of a computational model of multi-modal comprehension. The model evaluation is presented through the cloze test that measures the accuracy of the model in predicting unseen next frames given the prior frames in a sequence. Our results provide initial benchmarks and insight into the fundamental challenges that the multi-modal narrative understanding task presents for computational models both for language and vision.

Duke Scholars

Published In

Proceedings of the 43rd Annual Meeting of the Cognitive Science Society Comparative Cognition Animal Minds Cogsci 2021

Publication Date

January 1, 2021

Start / End Page

1153 / 1158
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Chen, Y. C., & Jhala, A. (2021). A Computational Model of Comprehension in Manga Style Visual Narratives. In Proceedings of the 43rd Annual Meeting of the Cognitive Science Society Comparative Cognition Animal Minds Cogsci 2021 (pp. 1153–1158).
Chen, Y. C., and A. Jhala. “A Computational Model of Comprehension in Manga Style Visual Narratives.” In Proceedings of the 43rd Annual Meeting of the Cognitive Science Society Comparative Cognition Animal Minds Cogsci 2021, 1153–58, 2021.
Chen YC, Jhala A. A Computational Model of Comprehension in Manga Style Visual Narratives. In: Proceedings of the 43rd Annual Meeting of the Cognitive Science Society Comparative Cognition Animal Minds Cogsci 2021. 2021. p. 1153–8.
Chen, Y. C., and A. Jhala. “A Computational Model of Comprehension in Manga Style Visual Narratives.” Proceedings of the 43rd Annual Meeting of the Cognitive Science Society Comparative Cognition Animal Minds Cogsci 2021, 2021, pp. 1153–58.
Chen YC, Jhala A. A Computational Model of Comprehension in Manga Style Visual Narratives. Proceedings of the 43rd Annual Meeting of the Cognitive Science Society Comparative Cognition Animal Minds Cogsci 2021. 2021. p. 1153–1158.

Published In

Proceedings of the 43rd Annual Meeting of the Cognitive Science Society Comparative Cognition Animal Minds Cogsci 2021

Publication Date

January 1, 2021

Start / End Page

1153 / 1158