On the Relative Importance of Different Prosodic Factors in Improving Speech Synthesis (1999) [1 citations — 0 self]
Abstract:
We present results of perceptual experiments geared toward assessing the relative importance of several prosodic factors in synthetic speech, showing that naturalness, relative to a target speaking style, can be significantly improved through both symbolic label prediction and better F0 and duration generation. Our experiments utilized a novel perceptual experiment paradigm, where we supply each test subject with two reference utterances in order to obtain reliable absolute scores that indicate magnitude of improvement. The approach gives ratings that are comparable across experiments. Results also show a strong interaction between detailed F0 and duration controls.
Citations
| 74 | Automatically clustering similar units for unit selection in speech synthesis – Black, Taylor - 1997 |
| 61 | Evaluation of prosodic transcription labeling reliability in the ToBI framework – Pitrelli, Beckman, et al. - 1994 |
| 19 | Perceptual experiment for diagnostic testing of text-to-speech systems. Computer Speech and Language – Santen - 1993 |
| 10 | A dynamical system model for generating fundamental frequency for speech synthesis – Ross, Ostendorf - 1999 |
| 5 | Data driven formant synthesis – Högberg - 1997 |
| 4 | Can we perceive attitudes before the end of sentences? The gating paradigm for prosodic contours – Auberg, Grpillat, et al. - 1997 |
| 3 | Factors affecting perceived quality and intelligibility in the CHATR concatenative speech synthesizer – Campbell, Itoh, et al. - 1997 |
| 2 | Prediction of abstract labels for speech synthesis. Computer Speech and Language – Ross, Ostendorf - 1996 |

