DAVID: An open-source platform for real-time transformation of infra-segmental emotional cues in running speech

Research output: Contribution to journalArticle

Abstract

We present an open-source software platform that transforms emotional cues expressed by speech signals using audio effects like pitch shifting, inflection, vibrato, and filtering. The emotional transformations can be applied to any audio file, but can also run in real time, using live input from a microphone, with less than 20-ms latency. We anticipate that this tool will be useful for the study of emotions in psychology and neuroscience, because it enables a high level of control over the acoustical and emotional content of experimental stimuli in a variety of laboratory situations, including real-time social situations. We present here results of a series of validation experiments aiming to position the tool against several methodological requirements: that transformed emotions be recognized at above-chance levels, valid in several languages (French, English, Swedish, and Japanese) and with a naturalness comparable to natural speech.

Details

Authors
Organisations
External organisations
  • University College London
  • Waseda University
  • University of Tokyo
  • Pierre and Marie Curie University
  • Swedish Collegium for Advanced Study (SCAS)
Research areas and keywords

Subject classification (UKÄ) – MANDATORY

  • General Language Studies and Linguistics
  • Human Computer Interaction

Keywords

  • Emotional transformations, Infra-segmental cues, Nonverbal behavior, Real-time, Software, Voice
Original languageEnglish
Pages (from-to)323-343
JournalBehavior Research Methods
Volume50
Issue number1
Early online date2017 Apr 3
Publication statusPublished - 2018 Feb
Publication categoryResearch
Peer-reviewedYes