Acoustic features of multimodal prominences: Do visual beat gestures affect verbal pitch accent realization?

Forskningsoutput: Kapitel i bok/rapport/Conference proceedingKonferenspaper i proceeding


The interplay of verbal and visual prominence cues has attracted recent attention, but previous findings are inconclusive as to whether and how the two modalities are integrated in the production and perception of prominence. In particular, we do not know whether the phonetic realization of pitch accents is influenced by co-speech beat gestures, and previous findings seem to generate different predictions. In this study, we investigate acoustic properties of
prominent words as a function of visual beat gestures in a corpus of read news from Swedish television. The corpus was annotated for head and eyebrow beats as well as sentence-level pitch accents. Four types of prominence cues occurred
particularly frequently in the corpus: (1) pitch accent only, (2) pitch accent plus head, (3) pitch accent plus head plus eyebrows, and (4) head only. The results show that (4) differs from (1-3) in terms of a smaller pitch excursion and shorter syllable duration. They also reveal significantly larger pitch excursions in (2) than in (1), suggesting that the realization of a pitch accent is to some extent influenced by the presence of visual prominence cues. Results are discussed in terms of the interaction between beat gestures and prosody with a potential functional difference between head and eyebrow beats.


  • Gilbert Ambrazaitis
  • David House
Enheter & grupper
Externa organisationer
  • KTH Royal Institute of Technology

Ämnesklassifikation (UKÄ) – OBLIGATORISK

  • Jämförande språkvetenskap och lingvistik


  • Audio-visual prosody , Co-speech gestures, News speech, Swedish, Multimodality
Titel på värdpublikationProceedings of The 14th International Conference on Auditory-Visual Speech Processing (AVSP2017)
RedaktörerSlim Ouni, Chris Davis, Alexandra Jesse, Jonas Beskow
Antal sidor6
StatusPublished - 2017
Peer review utfördJa
EvenemangInternational Conference on Auditory-Visual Speech Processing - KTH Department of Speech Music and Hearing, Stockholm, Sverige
Varaktighet: 2017 aug 252017 aug 26
Konferensnummer: 14


ISSN (elektroniskt)2308-975X


KonferensInternational Conference on Auditory-Visual Speech Processing
Förkortad titelAVSP 2017


Ingen tillgänglig data

Relaterad forskningsoutput

Ambrazaitis, G. & House, D., 2017, I : Speech Communication. s. 110-113

Forskningsoutput: TidskriftsbidragArtikel i vetenskaplig tidskrift

Ambrazaitis, G. & House, D., 2016, s. 319-319. 1 s.

Forskningsoutput: KonferensbidragPoster

Ambrazaitis, G., Malin Svensson Lundmark & House, D., 2015, Proceedings from Fonetik 2015: Lund, June 8-10, 2015. Working Papers 55. 2015. . Svensson Lundmark, M., Ambrazaitis, G. & van de Weijer, J. (red.). Centre for Languages and Literature, Lund University, Vol. 55. s. 11-16 6 s. (Working Papers in General Linguistics and Phonetics; vol. 55).

Forskningsoutput: Kapitel i bok/rapport/Conference proceedingKonferenspaper i proceeding

Visa alla (3)

Related projects

Visa alla (1)