Systematic variation of features

Manipulation of voice quality, f0 height and f0 range

F0 height 80%
range/vqneutralbreathycreakyfalsettotense
50% wav wav wav wav wav
100% wav wav wav wav wav
150% wav wav wav wav wav
F0 height 100%
range/vqneutralbreathycreakyfalsettotense
50% wav wav wav wav wav
100% wav wav wav wav wav
150% wav wav wav wav wav
F0 height 130%
range/vqneutralbreathycreakyfalsettotense
50% wav wav wav wav wav
100% wav wav wav wav wav
150% wav wav wav wav wav

Manipulation of voice quality, speech rate and vowel precision

Speech rate original
vwl prec/vqneutralbreathycreakyfalsettotense
orig wav wav wav wav wav
overshoot wav wav wav wav wav
undershoot wav wav wav wav wav
Speech rate 20% faster
vwl prec/vqneutralbreathycreakyfalsettotense
orig wav wav wav wav wav
overshoot wav wav wav wav wav
undershoot wav wav wav wav wav
Speech rate 20 % slower
vwl prec/vqneutralbreathycreakyfalsettotense
orig wav wav wav wav wav
overshoot wav wav wav wav wav
undershoot wav wav wav wav wav
Speech rate mixed model (slow stressed, fasten unstressed syllables)
vwl prec/vqneutralbreathycreakyfalsettotense
orig wav wav wav wav wav
overshoot wav wav wav wav wav
undershoot wav wav wav wav wav