56 citations found. Retrieving documents...
A. Black, P. Taylor, and R. Caley. The Festival Speech Synthesis System: System Documentation Edition 1.4, for Festival Version 1.4.2. Available from http://festvox.org/festival/ or http://www.cstr.ed.ac.uk/projects/festival/.

 Home/Search   Document Not in Database   Summary   Related Articles   Check  

This paper is cited in the following contexts:

First 50 documents  Next 50

Augmented Reality in a Wide Area Sentient Environment - Joseph Newman David (2001)   (10 citations)  (Correct)

....server runs on the Batportal. Mono 16 bit, 16 khz samples are generated and sent to it via the WaveLAN. A mixer runs on the iPAQ hardware to provide one key access to mute and low volume settings (suitable for headphones) Speech output has been added, using the Festival text tospeech engine [5]. Festival runs continuously on the backend machine, synthesizing utterances in its client server mode to reduce per invocation overheads. Speech is currently used to identify which room the user is in, and to provide feedback when the current user or mode changes. Status information that can be ....

A. W. Black and P. Taylor. The Festival Speech Synthesis System: System Documentation. Technical Report HCRC/TR-83, University of Edinburgh, Scotland, University of Edinburgh, University of Edinburgh, Scotland, UK, 1997.


Design Principles for Intelligent Environments - Coen (1998)   (42 citations)  (Correct)

....other than preface spoken utterances with the cue Computer to enable verbal interaction. Thus, a user can interact with the room easily, regardless of her proximity to a keyboard or monitor. The Intelligent Room is capable of addressing users via the Festival Speech Synthesis System (Black et al. [2]) Utterances spoken by the room are also displayed on a scrollable LCD sign in case a user was unable to understand what was said. The room uses its speech capability for a variety of purposes that include conducting dialogs with users and getting its occupant s attention without resorting to use ....

Black, A. and Taylor, P. Festival Speech Synthesis System: system documentation (1.1.1) Human Communication Research Centre Technical Report HCRC/TR-83. University of Edinburgh. 1997.


Evaluation Of A System For Concatenative Articulatory Visual.. - Engwall   (Correct)

....were stored in a concatenation database (the mean number of entries per diart was 11.8 and the median 5) 2.3. Text to visual speech algorithm The algorithm to create tongue movements from text uses the letter to phoneme conversion, phoneme duration calculation and acoustic synthesis in Festival [6], whereas the diart selection algorithm is specific for the current application. The selection algorithm aims at minimizing the differences at the joins of concatenated units and between the selected diarts and the target. This is done using a weighted sum C of the concatenation and target ....

A. Black and P. Taylor, "Festival speech synthesis system: system documentation (1.1.1)," Tech. Rep., Centre for Speech Technology Research, University of Edinburgh, 1997.


Tools For Researchand Education In Speech Science - Ronald Cole Center (1999)   (3 citations)  (Correct)

....has been developed for PROFER in which students learn to develop a conversational system for retrieving movie times and locations from a Web site. Festival Speech Synthesis System. The toolkit integrates the Festival text to speech synthesis system, developed at the University of Edinburgh [11]. Festival provides a complete environment for learning, researching and developing synthetic speech, including modules for normalizing text (e.g. dealing with abbreviations) transforming text into a sequence of phonetic segments with appropriate durations, assigning prosodic contours (e.g. ....

Black, A., and Taylor, P., "Festival Speech Synthesis System: System documentation (1.1.1)," Human Communication Research Centre Technical Report HCRC/TR-83, Edinburgh, 1997.


A Computer-Based Course in Spectrogram Reading - Carmell, Hosom, Cole (1999)   (Correct)

....is one such extension. 2.2 BaldiSync The BaldiSync application combines the playback of speech with visible articulator movements. BaldiSync integrates (a) SpeechView s waveform, spectrogram, and label displays, b) the 3D talking face called Baldi [4] c) the Festival speech synthesis server [5], and (d) the Toolkit forced alignment package. To synchronize recorded speech with Baldi s facial Figure 1. A sample SpeechView window, showing the waveform, spectrogram, and label windows. movements, users record an utterance or read in a waveform file, then supply the text of the utterance as ....

Black, A., Taylor, P., Festival Speech Synthesis System: System documentation (1.1.1), Human Communication Research Centre Technical Report HCRC/TR-83, Edinburgh, 1997.


Fully Automatic Prosody Generator for Text-to-Speech - Malfrère, Dutoit..   (Correct)

....its size (in phonemes) at the phoneme contextual level: the phonetic class of the following phoneme; at the rhythmic level: the position (in syllables) of the last accented syllable. The CART tree has been trained with WAGON, a tool available with the FESTIVAL Speech Synthesis system of CSTR [14]. The training of the CART on 90 of the corpus and its testing on the other 10 give a mean duration prediction error of less than 20 ms. FO Curve Generation To convert the symbolic representation of intonation into an f0 curve represented by a set of pitch targets, an intonation pattern ....

Black A.W., Taylor P. and Caley R., "The Festival Speech Synthesis System: System Documentation" University of Edinburgh, 1997.


Prominence Prediction For Super-Sentential Prosodic Modeling.. - On New Database   Self-citation (Black)   (Correct)

No context found.

A. W. Black and P. Taylor, "The Festival Speech Synthesis System: system documentation, " Tech. Rep. HCRC/TR-83, Human Communciation Research Centre, University of Edinburgh, Scotland, UK, January 1997, Available at http://www.cstr.ed.ac.uk/projects/festival/.


Using Decision Trees within the Tilt Intonation Model to.. - Kurt Dusterhoff Alan (1999)   (7 citations)  Self-citation (Black Taylor)   (Correct)

No context found.

A.W. Black, P. Taylor, and R. Caley. The Festival Speech Synthesis System: system documentation. The Centre for Speech Technology Research, University of Edinburgh, 1.3 edition, 1998. http://www.cstr.ed.ac.uk/projects/festival/manual1. 3.0/festival toc.html.


Domain Action Classification and Argument Parsing - For Interlingua-Based Spoken   (Correct)

No context found.

A. Black, P. Taylor, and R. Caley. The Festival Speech Synthesis System: System Documentation Edition 1.4, for Festival Version 1.4.2. Available from http://festvox.org/festival/ or http://www.cstr.ed.ac.uk/projects/festival/.


The Caterpillar System For Data-Driven Concatenative Sound.. - Diemo Schwarz Ircam (2003)   (Correct)

No context found.

Alan Black, Paul Taylor, and Richard Caley, "The Festival Speech Synthesis System: System Documentation," Technical Report HCRC/TR-83, Human Communication Research Centre, 1998.


Emotion in Speech Synthesis - Hoult (2004)   (Correct)

No context found.

A. Black and P. Taylor, The Festival Speech Synthesis System: system documentation, Human Communications Research Centre, University of Edinburgh, Scotland, January 1997.


Tools For The Development Of A Hindi Speech Synthesis.. - Bali, Ramakrishnan.. (2004)   (Correct)

No context found.

A. Black and P. Taylor, "Festival speech synthesis system: system documentation (1.1.1)," Tech. Rep. HCRC/TR-83, Human Communication Research Centre, 1997.


Duration Models and the Perceptual Evaluation of Spoken.. - Hyunsong Chung Department (2002)   (Correct)

No context found.

Black, A.; Taylor, P.; Caley, R., 1999. The Festival Speech Synthesis System: system documentation, edition 1.4, for Festival Version 1.4.0., CSTR web page, University of Edinburgh.


A Natural Human-Computer Interface for Controlling Wheeled.. - Flippo (2003)   (Correct)

No context found.

Alan W. Black, Paul Taylor, and Richard Caley. The Festival Speech Synthesis System --- System documentation, 1.4, for festival version 1.4.0 edition, Jun 1999.


Assigning Prosodic Structure for Speech Synthesis: a.. - Michaela Atterer.. (2002)   (2 citations)  (Correct)

No context found.

Black, A. W.; Taylor, P., 1999. The festival speech synthesis system: system documentation. Available from


The Caterpillar System for Data-Driven Concatenative Sound.. - Schwarz (2003)   (Correct)

No context found.

Alan Black, Paul Taylor, and Richard Caley, "The Festival Speech Synthesis System: System Documentation," Technical Report HCRC/TR-83, Human Communication Research Centre, 1998.


Dunedin New Zealand - Development Of Mori (2000)   (Correct)

No context found.

Black, A. and Taylor, P. (1997). Festival Speech Synthesis System: system documentation. Technical Report HCRC/TR83, Human Communication Research Centre. University of Edinburgh, Scotland.


Forced Alignment For Speech Synthesis Databases Using.. - Prosodic Phrase Breaks   (Correct)

No context found.

A. W. Black and P. Taylor, "The Festival Speech Synthesis System: system documentation, " Tech. Rep. HCRC/TR-83, Human Communciation Research Centre, University of Edinburgh, Scotland, UK, January 1997, Available at http://www.cstr.ed.ac.uk/projects/festival/.


Demonstrations of Dialogue Design Tools in the CSLU Toolkit - Ron Cole Jacques   (Correct)

No context found.

A. Black, and P. Taylor. Festival Speech Synthesis System: System documentation (1.1.1), Human Communication Research Centre Technical Report HCRC/TR-83, Edinburgh, 1997.


The Caterpillar System for Data-Driven Concatenative Sound.. - Schwarz (2003)   (Correct)

No context found.

Alan Black, Paul Taylor, and Richard Caley, "The Festival Speech Synthesis System: System Documentation," Technical Report HCRC/TR-83, Human Communication Research Centre, 1998.


Perceptual And Objective Detection Of Discontinuities In - Concatenative Speech..   (Correct)

No context found.

A. Black and P. Taylor, "The Festival Speech Synthesis System: system documentation," Technical Report HCHC/TR83, 1997.


MBone2Tel - Telephone Users Meeting the MBone - Ackermann, Pommnitz, Wolf..   (Correct)

No context found.

A. Black and P. Taylor. "Festival Speech Synthesis System: system documentation (1.1.1)" Human Communication Research Centre, Technical Report HCRC/TR-83, 1997, http://www.cstr.ed.ac.uk/projects/festival/festival.html


MBone2Tel - Telephone Users Meeting the MBone - Ackermann, Pommnitz, Wolf..   (Correct)

No context found.

A. Black and P. Taylor. "Festival Speech Synthesis System: system documentation (1.1.1)" Human Communication Research Centre, Technical Report HCRC/TR-83, 1997, http://www.cstr.ed.ac.uk/projects/festival/festival.html


XIMERA: A New TTS from ATR Based on Corpus-Based.. - Kawai, Toda, Ni..   (Correct)

No context found.

A. Black and P. Taylor, "Festival Speech Synthesis System: system documentation (1.1.1)," Human Communication Research Centre Technical Report HCRC/TR-83, 1997. See also http://www.cstr.ed.ac.uk/projects/festival/


Paired Speech and Gesture Generation in Embodied Conversational.. - Yan (2000)   (4 citations)  (Correct)

No context found.

Black, A. and Taylor, P., Festival Speech Synthesis System: system documentation (1.1.1) Human Communication Research Centre Technical Report HCRC/TR-83, 1997.

First 50 documents  Next 50

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC