Close to the Action: Eye-Tracking Evaluation of Speaker-Following Subtitles

Kuno Kurzhals¹ Emine Cetinkaya¹ Yongtao Hu² Wenping Wang² Daniel Weiskopf¹

¹ Universität Stuttgart, Germany ² The University of Hong Kong, Hong Kong

The 35th ACM Conference on Human Factors in Computing Systems (CHI 2017)

Abstract

The incorporation of subtitles in multimedia content plays an important role in communicating spoken content. For example, subtitles in the respective language are often preferred to expensive audio translation of foreign movies. The traditional representation of subtitles displays text centered at the bottom of the screen. This layout can lead to large distances between text and relevant image content, causing eye strain and even that we miss visual content. As a recent alternative, the technique of speaker-following subtitles places subtitle text in speech bubbles close to the current speaker. We conducted a controlled eye-tracking laboratory study (n = 40) to compare the regular approach (center-bottom subtitles) with content-sensitive, speaker-following subtitles. We compared different dialog-heavy video clips with the two layouts. Our results show that speaker-following subtitles lead to higher fixation counts on relevant image regions and reduce saccade length, which is an important factor for eye strain.

Downloads

Paper (PDF, 9.3 MB)

Bibtex

@inproceedings{kurzhals2017close,
  title={{Close to the Action: Eye-Tracking Evaluation of Speaker-Following Subtitles}},
  author={Kurzhals, Kuno and Cetinkaya, Emine and Hu, Yongtao and Wang, Wenping and Weiskopf, Daniel},
  booktitle={Proceedings of the 35th ACM Conference on Human Factors in Computing Systems},
  pages={6559--6568},
  year={2017},
  organization={ACM}
}