See this document in CiteSeerX!

Text Preprocessing for Speech Synthesis  (Make Corrections)  
Uwe D. Reichel, Hartmut R. Pfitzinger Department of Phonetics and Speech...



  Home/Search   Context   Related

 
View or download:
phonetik.unimuenc...fitzingerTCS06.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  phonetik.unimuenchen.de/Publi... (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: In this paper we describe our text preprocessing modules for English text-to-speech synthesis. These modules comprise rule-based text normalization subsuming sentence segmentation and normalization of non-standard words, statistical part-of-speech tagging, and statistical syllabification, grapheme-to-phoneme conversion, and word stress assignment relying in parts on rule-based morphological analysis. (Update)

Active bibliography (related documents):   More   All
1.1:   Improving Data Driven Part-of-Speech Tagging - Morphologic Knowledge Induction   (Correct)
0.6:   Using Morphology and Phoneme History - To Improve Grapheme-To-Phoneme   (Correct)
0.5:   Tokenization of Portuguese: resolving thr hard cases - Branco, Silva (2003)   (Correct)

Similar documents based on text:
0.0:   Unknown -   (Correct)

BibTeX entry:   (Update)

@misc{ hartmut-text,
  author = "Uwe Reichel Hartmut",
  title = "Text Preprocessing for Speech Synthesis",
  url = "citeseer.ist.psu.edu/765253.html" }
Citations (may not include all citations):
2528   Maximum likelihood from incomplete data via the EM algorithm (context) - Dempster, Laird et al. - 1977
2177   Programs for Machine Learning (context) - Quinlan - 1993
475   Building a large annotated corpus of English: The Penn treeb.. - Marcus, Santorini et al. - 1995
372   An algorithm for suffix stripping (context) - Porter - 1980
337   Error bounds for convolutional codes and an asymptotically o.. (context) - Viterbi - 1967
187   Transformation-based error-driven learning and natural langu.. - Brill - 1995
70   The festival speech synthesis system (context) - Black, Taylor et al. - 1999
53   TnT -- a statistical part-of-speech tagger - Brants - 2000
40   Markov source modeling of text generation (context) - Jelinek - 1985
34   On stress and linguistic rhythm (context) - Liberman, Prince - 1977
29   MITRE: Description of the alembic system used for muc (context) - Aberdeen, Burger et al. - 1995
24   LanguageIndependent Data-Oriented Grapheme-to-PhonemeConvers.. - Daelemans, van den Bosch - 1997
13   Some applications of tree-based modelling to speech and lang.. (context) - Riley - 1989
11   A syntax-based part of speech analyser - Voutilainen - 1995
11   Connectionist models and linguistic theory: Investigations o.. - Gupta, Touretzky - 1994
5   Normalization of non-standard words - Sproat, Black et al. - 2001
4   Writing tools -- the STYLE and DICTION programs (context) - Cherry, Vesterman - 1991
4   Self-learning techniques for graphemeto -phoneme conversion - Yvon - 1994
3   Automated Morphological Segmentation and Evaluation (context) - Reichel, Weilhammer - 2004
2   Computational Linguistics (context) - Mikheev, capitalized et al. - 2002
2   An experiment stemming non-traditional text (context) - Nascimento, Cunha - 1998
2   Using morphology and phoneme history to improve grapheme-to-.. (context) - Reichel, Schiel - 1940
1   Improving data driven part-of-speech tagging by morphologic .. (context) - Reichel - 2005

Documents on the same site (http://www.phonetik.uni-muenchen.de/Publications/):   More
SpeechRecorder - a Universal Platform Independent.. - Draxler, Jänsch (2004)   (Correct)
End-to-End Evaluation of Multimodal Dialogue.. - Beringer, Louka.. (2002)   (Correct)
WebTranscribe - An Extensible Web-Based Speech Annotation Framework - Draxler (2005)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC