Towards classification of head movements in audiovisual recordings of read news

Forskningsoutput: Kapitel i bok/rapport/Conference proceedingKonferenspaper i proceeding

Abstract

In this paper we develop a system for detection of word-related head movements in audiovisu-al recordings of read news. Our materials consist of Swedish television news broadcasts and comprise audiovisual recordings of five news readers (two female, three male). The corpus was manually labelled for head movement, applying a simplistic annotation scheme consisting of a binary decision about absence/presence of a movement in relation to a word. We use OpenCV for frontal face detection and based on this we calculate velocity and acceleration features. Then we train a machine learning system to predict absence or presence of head movement and achieve an accuracy of 0.892, which is better than the baseline. The system may thus be helpful for head movement labelling.

Detaljer

Författare
Enheter & grupper
Externa organisationer
  • KTH Royal Institute of Technology
Forskningsområden

Ämnesklassifikation (UKÄ) – OBLIGATORISK

  • Språkteknologi (språkvetenskaplig databehandling)
  • Jämförande språkvetenskap och lingvistik
Originalspråkengelska
Titel på värdpublikationProceedings of the 4th European and 7th Nordic Symposium on Multimodal Communication (MMSYM 2016)
RedaktörerPatrizia Paggio, Costanza Navarretta
FörlagLinköping University Electronic Press, Linköpings universitet
Sidor4-9
Antal sidor6
ISBN (elektroniskt)978-91-7685-423-5
StatusPublished - 2017 sep 25
Peer review utfördJa
Evenemang - Copenhagen, Danmark

Publikationsserier

NamnLinköping Electronic Conference Proceedings
FörlagLinköping University Electronic Press
ISSN (tryckt)1650-3686
ISSN (elektroniskt)1650-3740

Konferens

Konferens4th European and 7th Nordic Symposium on Multimodal Communication
LandDanmark
OrtCopenhagen
Period2016/09/292016/09/30
Internetadress

Related projects

Johan Frid & Marianne Gullberg

2014/01/012018/12/31

Projekt: NätverkNationellt samarbete

Visa alla (2)